DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library

The Latest Big Data Topics

article thumbnail
Don't Use Apache Kafka Consumer Groups the Wrong Way!
Apache Kafka is great — but if you're going to use it, you have to be very careful not to break things. Here's how you can avoid the pain!
July 30, 2017
by Paolo Patierno
· 275,406 Views · 20 Likes
article thumbnail
How I Used Deep Learning to Train a Chatbot to Talk Like Me (Sorta)
See how to use a deep learning model to train a chatbot based on past social media conversations in hopes of getting the chatbot to respond to messages the way you would.
July 27, 2017
by Adit Deshpande
· 40,279 Views · 18 Likes
article thumbnail
Working With Centralized Logging With the Elastic Stack
See how Filebeat works with the Elastic, or ELK, stack to locate problems in distributed logs for an application with microservices architecture.
July 24, 2017
by Rafael Salerno
· 6,960 Views · 2 Likes
article thumbnail
How to Use the Kafka Streams API
The Kafka Streams API allows you to create real-time applications that power your core business. Here's everything you need to know about it!
July 23, 2017
by Anuj Saxena
· 111,736 Views · 15 Likes
article thumbnail
Data Flow Pipeline Using StreamSets
Learn about configuring JDBC Query Consumer, performing JDBC lookup with multiple tables, creating a data flow pipeline, and monitoring the stage and pipeline stats.
July 18, 2017
by Rathnadevi Manivannan
· 16,635 Views · 7 Likes
article thumbnail
Is ElasticSearch SET/GET Eventually Consistent?
The author takes a deep dive into the intricacies of SET/GET in ElasticSearch to determine if there is eventual consistency.
July 18, 2017
by Tomer Ben David
· 16,973 Views · 9 Likes
article thumbnail
Hadoop Cluster Capacity Planning of Data Nodes for Batch and In-Memory Processes
Let's talk about capacity planning for data nodes. We'll start with gathering the cluster requirements and end by learning about RAM requirements.
Updated July 14, 2017
by Mamta Chawla
· 48,549 Views · 8 Likes
article thumbnail
An Introduction to Kafka
Learn the basics of Apache Kafka, an open-source stream processing platform, and learn how to create a general single broker cluster.
July 14, 2017
by Prashant Sharma
· 75,233 Views · 51 Likes
article thumbnail
Filebeat vs. Logstash — The Evolution of a Log Shipper
This comparison of log shippers Filebeat and Logstash reviews their history, and when to use each one- or both together.
July 8, 2017
by Daniel Berman
· 30,609 Views · 12 Likes
article thumbnail
The Most Important Lessons Learned From Data Science Projects
The biggest advantage of data science over traditional statistics is that it can draw conclusions from a junk pile of the supposedly unrelated information.
July 7, 2017
by Chris Richardson
· 9,993 Views · 2 Likes
article thumbnail
Big Data Ingestion: Flume, Kafka, and NiFi
Flume, Kafka, and NiFi offer great performance, can be scaled horizontally, and have a plug-in architecture where functionality can be extended through custom components.
July 7, 2017
by Tony Siciliani
· 90,502 Views · 24 Likes
article thumbnail
How to Automatically Migrate All Tables From a Database to Hadoop With No Coding
This is a great tool for instantly moving over tables from relational databases. This is also a great way to quickly build up a data lake.
July 5, 2017
by Tim Spann DZone Core CORE
· 20,354 Views · 7 Likes
article thumbnail
Connecting Apache Kafka With Mule ESB
Learn about the capabilities of the Apache Kafka message queuing system and how to integrate it with Mule ESB in this tutorial.
July 4, 2017
by Jitendra Bafna
· 21,862 Views · 9 Likes
article thumbnail
Apache Spark Performance Tuning – Degree of Parallelism
Today we learn about improving performance and increasing speed through partition tuning in a Spark application running on YARN.
June 30, 2017
by Rathnadevi Manivannan
· 102,293 Views · 8 Likes
article thumbnail
Apache Spark on YARN: Resource Planning
Apache Spark is an in-memory distributed data processing engine and YARN is a cluster management technology. Learn how to use them effectively to manage your big data.
June 28, 2017
by Rathnadevi Manivannan
· 37,725 Views · 8 Likes
article thumbnail
4 Traits of Outstanding Data Engineers
Knowing what makes a great data engineer is a critical first step towards identifying and onboarding the right data engineers to make your enterprise succeed.
June 28, 2017
by Yaniv Leven
· 10,444 Views · 2 Likes
article thumbnail
Running Distributed TensorFlow on Slurm Clusters
Check out a thorough example that will help you in your experiments with TensorFlow on Slurm clusters with the use of a simple Python module.
June 27, 2017
by Tomasz Grel
· 8,528 Views · 1 Like
article thumbnail
Apache Spark on YARN – Performance and Bottlenecks
In this series, we learn about performance tuning and fixing bottlenecks in high-level APIs with an Apache Spark application on YARN.
June 27, 2017
by Rathnadevi Manivannan
· 31,206 Views · 12 Likes
article thumbnail
Top Sites to Learn the Internet of Things
This collection of sites and blogs will provide never-ending inspiration and learning opportunities for devs who want to master IoT.
June 26, 2017
by Francesco Azzola
· 21,359 Views · 9 Likes
article thumbnail
Reviewing Open-Source Business Intelligence Tools
Review some open-source Business Intelligence tools that are built to simplify planning, analysis, and reporting with one software suite.
June 21, 2017
by Luba Belokon
· 17,084 Views · 7 Likes
  • Previous
  • ...
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • ...
  • Next
  • RSS
  • X
  • Facebook

ABOUT US

  • About DZone
  • Support and feedback
  • Community research

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 215
  • Nashville, TN 37211
  • [email protected]

Let's be friends:

  • RSS
  • X
  • Facebook
×