Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Building Big Data Pipelines with Hadoop

DZone's Guide to

Building Big Data Pipelines with Hadoop

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

Here's an in-depth JavaZone tutorial on building big data pipelines:


Hadoop is not an island. To deliver a complete Big Data solution, a data pipeline needs to be developed that incorporates and orchestrates many diverse technologies. Using an example of real-time weblog processing, in this session we will demonstrate how the open source Spring for Apache Hadoop project can be used to build manageable and robust pipeline solutions around Hadoop.


How to develop Big Data Pipelines for Hadoop from JavaZone on Vimeo.

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}