Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}
DZone's Guide to

Big Data Processing

How big is Big Data? The 2016 DZone Guide to Big Data Processing breaks down this question and provides insight on how to answer it. Dive into the thoughts of over 1,500 developers and see how they address growing data sets and associated pain points. Discover symptoms that may impact the health of your data, and how neural nets are used in machine learning.  This Guide digs deeper into what resources are needed to build a system for big and fast data, insight on the future of Big Data, and much more.

Free 38-page ebook

Table of Contents

2
Executive Summary
3
Key Research Findings
7
Checklist: Data Diagnosis: Is Your Data Healthy?
8
Overview of the Apache Spark Ecosystem
12
What Are Neural Nets?
14
The Do’s and Don’ts of Using Regexes With Big Data
16
Infographic: Building Big Data
20
Four MapReduce Design Patterns
22
Building High Performance Big Data Analytics Systems
25
Diving Deeper Into Big Data
26
Executive Insights on Big Data
28
Machine Learning: the Bigger Picture
31
Big Data Solutions Directory
34
Glossary

Interactive Preview

Publications

  • Featured
  • Latest
  • Popular
Continuous Delivery
The DZone 2014 Guide to Continuous Delivery provides data, ideas, and solutions that your organization can use to drastically improve its software production process.
The Java Ecosystem
The DZone Guide to the Java Ecosystem is an essential publication for understanding current research and trends surrounding Java development. It covers benefits of recent language updates, microservices and containers as they apply to Java, practical monitoring advice, and reactive programming principles.
Mobile Development
The DZone 2014 Guide to Mobile Development gives readers a full picture of the various approaches to mobile development, enabling them to overcome its biggest obstacles.
Enterprise Integration
DZone’s 2014 Guide to Enterprise Integration is a unique resource for developers and architects to learn how industry experts are handling integration in legacy enterprise systems, modern systems, and massive web-scale systems. It contains resources that will help you succeed with modern architectural patterns and application integration.
Internet of Things
DZone’s 2014 Guide to Internet of Things is an early mover’s map for navigating this bleeding edge space and finding your place in it.
Modern Java
The key to the modernization of Java is the energy and enthusiasm of the Java developer community at large. In the 2016 Guide to Modern Java, we cover how Java 8 improves the developer experience and preview features of Java 9. Discover how the JVM landscape is changing, 7 habits of super productive Java developers, and a checklist to build Java 8 APIs. Learn more about Jigsaw, its capabilities, and how to create Java 9 modules. We also explore implementing hash tables and reactive microservices for a flexible architecture.
Big Data Guide
DZone’s 2014 Guide to Big Data is the definitive resource for learning how industry experts are handling the massive growth and diversity of data. It contains resources that will help you navigate and excel in the world of Big Data management.
Cloud Platforms
The 2014 DZone Cloud Platform Research Report brings together worldwide cloud providers into one free, exclusive report that offers impartial insight into 39 specific cloud platform providers.
Java: Development and Evolution
Although some believe Java is dying, developments such as the upcoming release of Java 9 and the strength of the Java community tell another story. New JVM-based languages like Kotlin and exciting changes in Java 9 such as Project Jigsaw, Streams API improvements, and JShell prove a bright future ahead. The 2017 Guide to Java explores upcoming features of Java 9, how to make your apps backwards-compatible, a look into whether Microservices are right for you, and using the Futures API in Java.
Continuous Delivery
The DZone Guide to Continuous Delivery has more insight than ever into the status of DevOps in the enterprise and the obstacles facing developers, not only in their tooling, but within the organization as a whole.
DevOps: Continuous Delivery and Automation
DevOps has emerged to be the “new normal” in software development, helping companies react to user feedback real-time and setting higher standards for rapid development. Since becoming a permanent topic of discussion, thought leaders, developers, and businesses have pushed to adopt the necessary DevOps tools and methodologies. In the DZone Guide to DevOps: Continuous Delivery & Automation, we explore the state of DevOps in 2017 including industry challenges, best practices, and solutions. Dive into the best mental model for implementing microservices, implementing unambiguous code requirements, best practices for microservices and containers, and Continuous Delivery anti-patterns.
Code Quality and Software Agility
The DZone Guide to Code Quality and Software Agility is an invaluable resource for understanding the software quality trade-offs at both the code and organizational levels. It covers testing and monitoring strategies, requirements management, team agility, and decision making.
{{ card.title }}
{{card.downloads | formatCount }} {{card.views | formatCount }}