Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Batch vs. Stream Processing: Which Should You Choose and When? [Video]

DZone's Guide to

Batch vs. Stream Processing: Which Should You Choose and When? [Video]

Each approach has its pros and cons. At the end of the day, your choice of batch or streaming all comes down to your business use case.

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

We all know that enterprise data needs change constantly, and recently, that change has come at an increasing pace. Companies that were once processing all their big data on-prem have suddenly moved into the cloud. Frameworks we used to know and love suddenly become obsolete. However, an interesting debate that still rages on is how to get data processed faster. There are generally two heralded ways of processing data today:

  1. Batch processing
  2. Stream processing

Batch processing deals with non-continuous data. It's fantastic at handling datasets quickly but doesn't really get near the real-time requirements of most of today's business. Stream processing does deal with continuous data and is really the golden key to turning big data into fast data.

Each approach has its pros and cons. At the end of the day, your choice of batch or streaming all comes down to your business use case. However, there are questions and use cases to consider here when selecting your data processing approach. In our latest episode of Craft Beer and Data, Mark Balkenende and I dove deep into the debate of batch vs. streaming.

We answered some interesting questions like, "Is data ever really real-time?" We also debated if the lambda architecture is really dead, as well as sifted through some considerations you should take into account when deciding batch or stream processing.

Before we jump into the video (small plug), we are taking Craft Beer and Data on the road! Check out our events page and come attend an event in your area. We'd also love to hear your thoughts on the batch vs. streaming debate. Tweet me your thoughts @Nick_Piette.

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:
big data ,batch processing ,stream processing

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}