Over a million developers have joined DZone.

This Week in Hadoop and More: HDF 2, HDP 2.5, GearPump, LinkedIn's Ambry, and Paddle Paddle

DZone's Guide to

This Week in Hadoop and More: HDF 2, HDP 2.5, GearPump, LinkedIn's Ambry, and Paddle Paddle

There's lots of interesting updates and new projects going around including HDP 2.5, HDF 2.0, Python's Paddle Paddle and Apache GearPump.

· Big Data Zone ·
Free Resource

How to Simplify Apache Kafka. Get eBook.

HDF 2.0 released with 0ver 170 processors and now includes Ambari installation and management.

HDP 2.5 Sandbox Now available for download for VMWARE and VIRTUALBOX.   This includes supported integrated version of Apache Zeppelin, Apache Spark 1.6.2 and a preview of Spark 2.

A cool selection of Free Datasets linked at DataQuest.  You had me at Free.

Applying Kafka Streams is yet another streaming option.   See this link site has all the streaming, Awesome Streaming!

Spark Streaming Meetup Github.

Interesting new streaming engine, Apache GearPump based on Akka.  It's incubating at Apache. Read this tutorial to write your first app and deploy on HDP YARN.

Cool new book just came out, Practical Hive, on the latest Hive updates, with very knowledgeable writers.

Interesting framework:  StreamCQL is Continuous Query Language on RealTime Computation System build on Apache Storm and optionally Apache Kafka.  Another Huawei Big Data project, they are doing some interesting things with Spark and Big Data. 

Another interesting framework,  Python Paddle Paddle for Deep Learning

Yet another interesting data project from LinkedIn, Ambry, in scalable object store for lots of smaller objects.  

12 Best Practices for Modern Data Ingestion. Download White Paper.

hadoop ,hortonworks ,nifi ,spark ,big data

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}