The Big Data Zone is presented by Exaptive. Learn how rapid data application development can address the data science shortage.
Real-Time Hadoop queries will be a reality in 2013 thanks to two new projects from Cloudera: Impala and Trevni.
Impala is the open source version of Dremel, Google’s proprietary big data query solution. A first beta is available and the production version is foreseen for Q1 2013.
Impala allows you to run real-time queries on top of Hadoop’s HDFS, Hbase and Hive. No migrations necessary.
However the real revolution will only get better when Doug Cutting [the creator of Lucene, Hadoop, etc.]‘s Trevni is integrated into Impala. Trevni is a new columnar data storage format that promises superior performance for reading large columnar stored data sets.
Impala + Trevni is promising real-time big data queries with multiple joins that are on par in performance but have more functionality than Google’s Dremel…
The Big Data Zone is presented by Exaptive. Learn about how to rapidly iterate data applications, while reusing existing code and leveraging open source technologies.
Published at DZone with permission of
, DZone MVB
Opinions expressed by DZone contributors are their own.