Over a million developers have joined DZone.

Google Big Query Now Supports Pearson Correlation

DZone's Guide to

Google Big Query Now Supports Pearson Correlation

· Big Data Zone
Free Resource

Learn how you can maximize big data in the cloud with Apache Hadoop. Download this eBook now. Brought to you in partnership with Hortonworks.

During this year's I/O conference, Google announced the ability to perform correlation on its BigQuery platform.   As of late last week, that functionality is now available to all BigQuery users! 

By using the CORR() function in BigQuery, you can now output correlation values between two variables in your dataset within the same SELECT statement queries you are accustomed to with BigQuery.  A blog post on Google's Cloud Platform Blog demonstrates a great example of CORR() in action with some sample data taken during the I/O conference using various sensor readings for temperature, humidity, noise and other environmental data.  Your work ends with defining the two variables you are interested in within your query and then BigQuery will do the heavy lifting by performing the Pearson Correlation function across your dataset for you and providing it in the output.

Hortonworks DataFlow is an integrated platform that makes data ingestion fast, easy, and secure. Download the white paper now.  Brought to you in partnership with Hortonworks


Opinions expressed by DZone contributors are their own.

The best of DZone straight to your inbox.

Please provide a valid email address.

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}