DZone
Big Data Zone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
  • Refcardz
  • Trend Reports
  • Webinars
  • Zones
  • |
    • Agile
    • AI
    • Big Data
    • Cloud
    • Database
    • DevOps
    • Integration
    • IoT
    • Java
    • Microservices
    • Open Source
    • Performance
    • Security
    • Web Dev
DZone > Big Data Zone > This Week in Hadoop: NiFi, Kafka, Spark, and More

This Week in Hadoop: NiFi, Kafka, Spark, and More

Here is what's hot this week in the big world of Big Data.

Tim Spann user avatar by
Tim Spann
CORE ·
Aug. 12, 16 · Big Data Zone · Opinion
Like (7)
Save
Tweet
3.90K Views

Join the DZone community and get the full member experience.

Join For Free

This week the big news is the coming HDF 2.0 and the release of Apache NiFi 1.0 Beta.   NiFi has a redesigned UI, more processors and more production level features.   

HDF 2.0

HDF 2.0 is released with a ton of improvements including Ambari integration, Spark 1.0,  Zero Master Clustering, Zookeeper, Storm Ambari views, updated UI, multi-tenant authentication and more.

Apache NiFi has released 1.0.0-Beta which includes an incredible number of changes and a new very modern fast UI.   I definitely recommend evaluating this interesting software.

  • Using NiFi 1.0 to Processing Incoming Emails with Attachments.  
  • The New NiFi 1.0 UI
  • Slowly Changing Dimensions in Hadoop with Phoenix and NiFi

Image title

HBase

  • HBase at AirBnB
  • The Future of HBase

Kylin

Apache Kylin is an interesting OLAP and Distributed Analytics Engine that provides fast SQL on Hadoop. See: Apache Kylin with HBase

Spark Machine Learning 

  • Spark and K-Means 
  • Simple Voronoi
  • Spark Food Recommendations
  • Spark Naive Bayes for Reuters Data
  • Spark Streaming Log Aggregation 

Web Tools

  • Twitter Streams and HeatMaps (Github) 
  • HTML extraction with Goose

IoT in Java

A cool article for working with Sensors (IoT) using Intel Edison and Java.

Cool Big Data Articles From Spring One

  • Speed of Though Analytics on Hadoop
  • Streaming Live Data

Most Interesting Article of the Week

Uber's Case for Incremental Processing on Hadoop

hadoop kafka

Opinions expressed by DZone contributors are their own.

Popular on DZone

  • An Overview of DTrace and strace
  • Everything You Need to Know About Web Pentesting: A Complete Guide
  • Event Loop in JavaScript
  • Everything You Need to Know About Cloud Automation in 2022

Comments

Big Data Partner Resources

X

ABOUT US

  • About DZone
  • Send feedback
  • Careers
  • Sitemap

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • MVB Program
  • Become a Contributor
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 600 Park Offices Drive
  • Suite 300
  • Durham, NC 27709
  • support@dzone.com
  • +1 (919) 678-0300

Let's be friends:

DZone.com is powered by 

AnswerHub logo