DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library

The Latest Big Data Topics

article thumbnail
Big Data Ingestion: Flume, Kafka, and NiFi
Flume, Kafka, and NiFi offer great performance, can be scaled horizontally, and have a plug-in architecture where functionality can be extended through custom components.
July 7, 2017
by Tony Siciliani
· 90,582 Views · 24 Likes
article thumbnail
How to Automatically Migrate All Tables From a Database to Hadoop With No Coding
This is a great tool for instantly moving over tables from relational databases. This is also a great way to quickly build up a data lake.
July 5, 2017
by Tim Spann DZone Core CORE
· 20,383 Views · 7 Likes
article thumbnail
Connecting Apache Kafka With Mule ESB
Learn about the capabilities of the Apache Kafka message queuing system and how to integrate it with Mule ESB in this tutorial.
July 4, 2017
by Jitendra Bafna
· 21,903 Views · 9 Likes
article thumbnail
Apache Spark Performance Tuning – Degree of Parallelism
Today we learn about improving performance and increasing speed through partition tuning in a Spark application running on YARN.
June 30, 2017
by Rathnadevi Manivannan
· 102,323 Views · 8 Likes
article thumbnail
Apache Spark on YARN: Resource Planning
Apache Spark is an in-memory distributed data processing engine and YARN is a cluster management technology. Learn how to use them effectively to manage your big data.
June 28, 2017
by Rathnadevi Manivannan
· 37,769 Views · 8 Likes
article thumbnail
4 Traits of Outstanding Data Engineers
Knowing what makes a great data engineer is a critical first step towards identifying and onboarding the right data engineers to make your enterprise succeed.
June 28, 2017
by Yaniv Leven
· 10,474 Views · 2 Likes
article thumbnail
Running Distributed TensorFlow on Slurm Clusters
Check out a thorough example that will help you in your experiments with TensorFlow on Slurm clusters with the use of a simple Python module.
June 27, 2017
by Tomasz Grel
· 8,546 Views · 1 Like
article thumbnail
Apache Spark on YARN – Performance and Bottlenecks
In this series, we learn about performance tuning and fixing bottlenecks in high-level APIs with an Apache Spark application on YARN.
June 27, 2017
by Rathnadevi Manivannan
· 31,258 Views · 12 Likes
article thumbnail
Top Sites to Learn the Internet of Things
This collection of sites and blogs will provide never-ending inspiration and learning opportunities for devs who want to master IoT.
June 26, 2017
by Francesco Azzola
· 21,394 Views · 9 Likes
article thumbnail
Reviewing Open-Source Business Intelligence Tools
Review some open-source Business Intelligence tools that are built to simplify planning, analysis, and reporting with one software suite.
June 21, 2017
by Luba Belokon
· 17,110 Views · 7 Likes
article thumbnail
NMEA Data Acquisition: An IoT Exercise With Python
This comprehensive post covers the basic data arc that many IoT projects have—exploration, modeling, filtering, and persistence—using Python.
June 21, 2017
by Steven Lott
· 14,095 Views · 4 Likes
article thumbnail
Spark Streaming vs. Kafka Streaming
If event time is very relevant and latencies in the seconds range are completely unacceptable, Kafka should be your first choice. Otherwise, Spark works just fine.
June 19, 2017
by Mahesh Chand Kandpal
· 141,013 Views · 31 Likes
article thumbnail
Streaming in Spark, Flink, and Kafka
There is a lot of buzz going on between when to use Spark, when to use Flink, and when to use Kafka. Get it all straight in this article.
June 18, 2017
by Shivangi Gupta
· 55,281 Views · 26 Likes
article thumbnail
What Is Variable Importance and How Is It Calculated?
Variable Importance (VI) helps data scientists weed out certain predictors that are contributing to nothing and that instead add time to processing.
June 15, 2017
by Avkash Chauhan
· 27,091 Views · 5 Likes
article thumbnail
How to Install the ELK Stack on Azure
Want to switch to the ELK Stack for your logging? Even better, want to get it running on your Azure cloud? This guide will walk you through setting up each component.
June 13, 2017
by PJ Hagerty
· 12,622 Views · 3 Likes
article thumbnail
Identifying Duplicate Files in AWS S3 With Apache Spark
Using Spark, you can identify duplicate files in your S3 storage by calculating checksums. It's a quick, easy way to ensure you aren't carrying extra weight.
June 12, 2017
by Nikhil Bhide
· 17,175 Views · 9 Likes
article thumbnail
Predictive Analytics and Machine Learning Explained Through Dog Memes
The way that memes go viral is very similar to the way that Machine Learning and predictive analytics work. How in the world could this be?!
June 4, 2017
by Gur Tirosh
· 8,871 Views · 1 Like
article thumbnail
Apache Flume: Regex Filtering
There's so much precious data out there that it can be difficult for humans to get meaning out of it sometimes. Apache Flume to the rescue!
Updated June 1, 2017
by Nikhil Bhide
· 13,860 Views · 13 Likes
article thumbnail
Advanced Analytics in Order to Cash Process
In this article, we'll take a look at the cases where advanced analytics can be implemented in an Order to Cash (O2C) process, going over potential use cases for anomaly detection, streaming analytics, and recommendation engines.
June 1, 2017
by Sachit Das
· 15,713 Views · 6 Likes
article thumbnail
How the Internet of Things Will Affect Database Management
The Internet of Things poses unprecedented challenges for database administrators in terms of scalability, flexibility, and connectivity.
May 31, 2017
by Darren Perucci
· 6,732 Views · 4 Likes
  • Previous
  • ...
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • ...
  • Next
  • RSS
  • X
  • Facebook

ABOUT US

  • About DZone
  • Support and feedback
  • Community research

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 215
  • Nashville, TN 37211
  • [email protected]

Let's be friends:

  • RSS
  • X
  • Facebook
×