Big Data Resources

An Overview of the Kafka Distributed Message System (Part 2)

We finish this small series by exploring brokers, topics, partitions, and more in the open source Apache Kafka platform.

December 10, 2018

by Leona Zhang

· 15,190 Views · 9 Likes

An Overview of the Kafka Distributed Message System (Part 1)

Apache Kafka is one of the most popular tools for big data out there. If you're curious as to why, read on to take a look under the hood!

December 7, 2018

by Leona Zhang

· 33,629 Views · 17 Likes

Flink State Management and Fault Tolerance for Real-Time Computing

We look at the processing of stateful streaming data, state interfaces, and the implementation of state management and fault tolerance in Apache Flink.

December 5, 2018

by Leona Zhang

· 11,579 Views · 3 Likes

What Is Amazon Machine Learning: 8 Benefits of AWS ML

Explore eight benefits of Amazon Machine Learning.

Updated December 5, 2018

by Rinu Gour

· 11,114 Views · 4 Likes

Introduction to SparkSession

We go over how to use this new feature of Apache Spark 2.0, covering all the Scala and SQL code you'll need to get started.

December 5, 2018

by Abhishek Baranwal

· 66,384 Views · 3 Likes

Anomaly Detection With Isolation Forests Using H2O

There are multiple approaches to an unsupervised anomaly detection problem that try to exploit the differences between the properties of common and unique observations.

Updated December 3, 2018

by Martin Barus

· 13,932 Views · 4 Likes

Machine Learning and Pattern Recognition

What's the difference between ML and pattern recognition?

November 30, 2018

by Chandu Siva

· 90,947 Views · 4 Likes

Table Store Time Series Data Storage Architecture

Explore table store time series data storage architecture.

November 22, 2018

by Leona Zhang

· 31,807 Views · 9 Likes

Data Warehouse Solutions: On-Prem and Cloud-Based

Montague vs. Capulet. Gretzky vs. Lemieux. On-prem vs. cloud-based data warehouse solutions. The three biggest rivalries of all time.

November 21, 2018

by John Hammink

· 13,079 Views · 2 Likes

Building a Graph Database on a Key-Value Store?

Is the next generation of graph databases going to be based on key-value stores? Is there a renaissance on its way?

November 21, 2018

by Yu Xu

· 17,441 Views · 2 Likes

How to Execute Distributed MapReduce on Java Over Data Stored in Redis

What is MapReduce and why is it helpful?

November 21, 2018

by Nikita Koksharov

· 15,847 Views · 4 Likes

Real-Time Stock Processing With Apache NiFi and Apache Kafka, Part 1

A big data expert starts his series on using Kafka and NiFi for real-time data flow programming.

November 20, 2018

by Tim Spann

CORE

· 44,094 Views · 15 Likes

23 Useful Elasticsearch Example Queries

Don't forget to bookmark this article for quick reference when you need it!

Updated November 19, 2018

by Tim Ojo

· 904,348 Views · 90 Likes

Design Patterns for Microservice-To-Microservice Communication

Let's learn about design patterns for synchronous and asynchronous communication between microservices.

November 13, 2018

by Rajesh Bhojwani

· 69,016 Views · 34 Likes

Top 10 Machine Learning, Deep Learning, and Data Science Courses for Beginners (Python and R)

This article includes a list of some of the best courses to learn Data Science, Machine Learning, and Deep Learning.

November 12, 2018

by Javin Paul

· 17,733 Views · 10 Likes

CoAP Protocol: Step-by-Step Guide

Want to learn more about the CoAP protocol for IoT devices? Check out this post where we explore using CoAP and how it differs from MQTT.

Updated November 8, 2018

by Francesco Azzola

· 98,707 Views · 6 Likes

Top 10 Most Popular AI Models

While AI and ML provide ample possibilities for businesses to improve their operations and maximize their revenues, there is no such thing as a “free lunch.”

Updated November 8, 2018

by Vladimir Fedak

· 125,703 Views · 21 Likes

Reporting and Analysis With Elasticsearch

A software developer gives an overview of Elasticsearch and the Elastic Stack, while diving into her experiences with the big data platform and search engine.

November 8, 2018

by Veronika Rovnik

· 18,824 Views · 11 Likes

Message Producer and Consumer Using Golang on CloudAMQP

This article explains a bit about how asynchronous messaging operates and gives an example using Golang and RabbitMQ as the message broker.

November 7, 2018

by Sirish Kumar

· 13,273 Views · 3 Likes

Overview of the Data Science Pipeline

Interested in undertaking data science projects? Read on for a high-level overview of the data science process and the skills required.

November 6, 2018

by Vinit Saini

· 23,468 Views · 6 Likes

The Latest Big Data Topics