Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

The Best of the Week (Nov. 29): Big Data Zone

DZone's Guide to

The Best of the Week (Nov. 29): Big Data Zone

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

Make sure you didn't miss anything with this list of the Best of the Week in the Big Data Zone (Nov. 29 to Dec. 5). Here they are, in order of popularity:

1. Curing Cancer with Data Visualization

Everyone thought that The Netherlands Cancer Institute’s 12-year-old dataset on breast cancer was old news. That was until a researcher, Pek Lum, analyzed and visualized the dataset using topological data analysis (TDA) and advanced machine learning technology.

2. Text Mining with R: Comparing Word Counts in Two Text Documents

Here is a pair of code samples in R designed to compare word counts in two pieces of text. The first attempts to reinvent the wheel, while the second utilizes the capabilities of existing packages.

3. Adding Java 8 Lambda Goodness to JDBC

Data access, specifically SQL access from within Java, has never been nice. This is in large part due to the fact that the JDBC api has a lot of ceremony. In this article, you'll learn how to make SQL access easier in Java using Java 8 Lambda expressions and Streams.

4. Apache Lucene and Solr 4.6

Recently, Apache Lucene and Solr PMC announced another version of Apache Lucene library and Apache Solr search server numbered 4.6. This is a next release continuing the 4th version of both Apache Lucene and Apache Solr.

5. SolrCloud: What Happens When ZooKeeper Fails?

One of the questions the author tends to get is what happens with a SolrCloud cluster when ZooKeeper fails. Not a single ZooKeeper instance failure, but the whole ensemble not being accessible. Because the answer to this question is easy to verify, the author decided to show what happens when ZooKeeper fails.

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}