Do You Trust Google Big Query with Your Big Data?
Join the DZone community and get the full member experience.
Join For Free
google has come up with a fantastic service to analyze large amounts of data. it’s called
bigquery
and it allows you to run analysis on big data on the cloud. as
expected, the tool has a superb, intuitive web ui. the data analysis
language uses sql like queries. (
hive
, anyone
). have a look at the
big query tutorial
, it looks pretty neat. so, now all you need to do to run queries is to upload your data to google using the form shown below.
it allows you to upload a file or point to it using google’s cloud storage.
now, the interesting question here is that to analyze using bigquery how much of that data are you willing to give google? and how long will that take? the answer won’t be “let me quickly upload a 500 gb file and run some queries”. that amount of data would definitely take some time to upload. so, effectively, this saas becomes pretty useless as more and more data volumes need to be uploaded for analysis.
everyone trusts google (
), so this concern might be easily ignored. but a potential other
problem i see is the “privacy policies” that are violated. usually, when
you want to analyze data, it can contain sensitive data such as user
behavior patterns and so forth. how comfortable will your customers be
if you hand that data over to google? even anonymizing this data might
not save you from a potential legal breach.
i still believe setting up your own data analysis and monitoring platform is the best way to go. thoughts? i’d love to hear them.
Published at DZone with permission of Tharindu Mathew, DZone MVB. See the original article here.
Opinions expressed by DZone contributors are their own.
Comments