Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

This Week in Hadoop and More: Deep Dive on NiFi, Presentations & Spark

DZone's Guide to

This Week in Hadoop and More: Deep Dive on NiFi, Presentations & Spark

Wrap up of the last 7 days of Big Data, Hadoop, and Spark activities.

· Big Data Zone ·
Free Resource

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

This week, we have Big Data, Fast Data, Data in Motion, and the tools, languages, and techniques to enable better solutions with many types of data. A ton of great presentations on Apache Big Data projects came out of the Apache Big Data Conference that just completed in Vancouver. There's also a couple of interesting Hadoop tutorials for working with Zeppelin and Pig.

Mining Tez App Logs with Pig

Using ZeppelinHub Viewer

There's an interesting new video on working with Hadoop on Cisco: How to Accelerate Time to Value on Your Data (Cisco). A subproject of Apache NiFi - MiniFi for Small IoT Devices is on Github and is very interesting. It helps extend this former NSA project to smaller devices and platforms and turn NIFI into a real IOT powerhouse.

Pivotal has an excellent article on why Agility, Big Data, and Cloud work together in symbiosis.

An interesting use case that effects everyone is Building Big Data for Smart Cities.

As Big Data and Hadoop has added more and more projects and become more complex to get all the pieces in the correct versions to work together, the open consortium (ODPi) is really enabling standards and ODPi becomes Gold Sponsor of ASF. This is helping to push forward the open goals of both organizations.

A very interesting in-depth advanced Spark article this week on Spark RDD Partitioning.

The Apache Big Data 2016 event in Vancouver has produced an amazing stream of presentations on all facets of Spark, NiFi, HBase, Hadoop, Data Science, Zeppelin, YARN, HAWQ, SQL, ProtoBufs, Thrift, and more. If you have time this week, look at these presentations. Not to forget the NJ Hadoop meetup for having a great talk on Apache NiFi.

That's it for this week; next week we will have all the new items in Spark, NiFi, Hadoop, and related projects in Big Data as this will be a weekly column.

Hortonworks Community Connection (HCC) is an online collaboration destination for developers, DevOps, customers and partners to get answers to questions, collaborate on technical articles and share code examples from GitHub.  Join the discussion.

Topics:
hadoop ,java ,spark ,zeppelin

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}