By using the CORR() function in BigQuery, you can now output correlation values between two variables in your dataset within the same SELECT statement queries you are accustomed to with BigQuery. A blog post on Google's Cloud Platform Blog demonstrates a great example of CORR() in action with some sample data taken during the I/O conference using various sensor readings for temperature, humidity, noise and other environmental data. Your work ends with defining the two variables you are interested in within your query and then BigQuery will do the heavy lifting by performing the Pearson Correlation function across your dataset for you and providing it in the output.
Google Big Query Now Supports Pearson Correlation
Join the DZone community and get the full member experience.Join For Free