Big Data Resources

Word Count Program With MapReduce and Java

In this post, we provide an introduction to the basics of MapReduce, along with a tutorial to create a word count app using Hadoop and Java.

March 3, 2016

by Shital Kat

· 449,857 Views · 31 Likes

Linking Apache Ignite and Apache Kafka for Highly Scalable and Reliable Data Processing

Here's how to link Apache Kafka and Ignite, for maintaining scalability and reliability for data processing. We'll explore injecting data with KafkaStreamer, as well as IgniteSinkConnector.

March 3, 2016

by Roman Shtykh

· 24,482 Views · 13 Likes

Real-time Data Pipelines with Kafka Connect

Information about Kafka Connect sourced from Spark Summit East 2016.

February 19, 2016

by Tim Spann

CORE

· 16,697 Views · 5 Likes

Loading Data Into Azure SQL Data Warehouse

I’m not an ETL expert. In fact, I haven’t done any professional ETL work for several years. My skills are, at best, rusty. With this in mind, I knew I’d have a hard time extracting data from a local database in order to move it up to Azure SQL Data Warehouse. Read on to hear about my journey.

February 18, 2016

by Grant Fritchey

· 14,275 Views · 6 Likes

Making the Impossible Possible with Tachyon: Accelerate Spark Jobs from Hours to Seconds

Barclays Data Scientist Gianmario Spacagna and Harry Powell, Head of Advanced Analytics, describe how they iteratively process raw data directly from the central data warehouse into Spark and how Tachyon is their key enabling technology.

February 16, 2016

by Henry Powell

· 38,112 Views · 8 Likes

Report Finds Wearables Will Skyrocket in the Enterprise

Wearables have infiltrated the IoT space, but they're relegated mainly to the consumer market. However, 2016 may see wider adoption in the enterprise. Check out what APX Labs discovered in a recent report, "What's Next in Wearable Technology."

February 12, 2016

by Amy Groden-Morrison

· 4,493 Views · 1 Like

Executives' Perspectives on the Evolution of Data Management

Data management has evolved very rapidly since the introduction of Hadoop and big data.

February 12, 2016

by Tom Smith

CORE

· 5,713 Views · 1 Like

Solr vs Elasticsearch: Battle of The Query DSLs

Comparing Solr and Elasticsearch on how they control ranking, and how their query DSLs stack up against each other.

February 10, 2016

by Doug Turnbull

· 10,997 Views · 11 Likes

Fundamentals of Big Data Log Analytics

A presentation and slide deck on using several different tools including Graylog Splunk, and TIBCO to analyze log data.

February 8, 2016

by Kai Wähner

CORE

· 10,239 Views · 3 Likes

Problems Being Solved With Databases — Executives' Perspectives

Databases are enabling companies to use data to inform real-time decisions about their business as well as to use predictive analytics to make better informed, real-time decisions.

February 8, 2016

by Tom Smith

CORE

· 22,872 Views · 2 Likes

What Are the Most Important Elements of Databases?

The most important elements of the database depend upon the application at hand.

February 7, 2016

by Tom Smith

CORE

· 24,748 Views · 3 Likes

LogPacker: A New Log Management Platform

Interested in log management? Check out LogPacker, a new log management platform! Neat features include scanning, aggregation, clustering, and more!

January 30, 2016

by Vladislav Chernov

· 5,538 Views · 4 Likes

How Amazon Uses Its Own Cloud to Process Vast, Multidimensional Datasets

Big Data has permeated a number of industries. Check out how companies like Amazon are using Big Data to delver business value in a several neat cases.

January 26, 2016

by Ann

· 41,210 Views · 6 Likes