This Week in Hadoop: NiFi, Kafka, Spark, and More
Here is what's hot this week in the big world of Big Data.
Join the DZone community and get the full member experience.
Join For FreeThis week the big news is the coming HDF 2.0 and the release of Apache NiFi 1.0 Beta. NiFi has a redesigned UI, more processors and more production level features.
HDF 2.0 is released with a ton of improvements including Ambari integration, Spark 1.0, Zero Master Clustering, Zookeeper, Storm Ambari views, updated UI, multi-tenant authentication and more.
Apache NiFi has released 1.0.0-Beta which includes an incredible number of changes and a new very modern fast UI. I definitely recommend evaluating this interesting software.
- Using NiFi 1.0 to Processing Incoming Emails with Attachments.
- The New NiFi 1.0 UI
- Slowly Changing Dimensions in Hadoop with Phoenix and NiFi
HBase
Kylin
Apache Kylin is an interesting OLAP and Distributed Analytics Engine that provides fast SQL on Hadoop. See: Apache Kylin with HBase
Spark Machine Learning
- Spark and K-Means
- Simple Voronoi
- Spark Food Recommendations
- Spark Naive Bayes for Reuters Data
- Spark Streaming Log Aggregation
Web Tools
- Twitter Streams and HeatMaps (Github)
- HTML extraction with Goose
IoT in Java
A cool article for working with Sensors (IoT) using Intel Edison and Java.
Cool Big Data Articles From Spring One
Most Interesting Article of the Week
Opinions expressed by DZone contributors are their own.
Comments