Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

The Best of the Week (May 30): Big Data Zone

DZone's Guide to

The Best of the Week (May 30): Big Data Zone

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

Make sure you didn't miss anything with this list of the Best of the Week in the Big Data Zone (May 30 to June 6). Here they are, in order of popularity:

1. Apache Spark v1.0 Solidifies Its Place Among Big Data Tools

The Apache Foundation announced today the release of Apache Spark v1.0, an open source large-scale data processing and advanced analytics engine.

2. The Problem with Hadoop in HPC

When it comes to handling big data, Hadoop is a major player – but it doesn't seem to have much traction in the high performance computing community.

3. Is there a future for Map/Reduce?

In my opinion, Map/Reduce is an idea whose time has come and gone – it won’t die in a day or a year, there is still a lot of working systems that use it and the alternatives are still maturing.

4. Understanding how Parquet Integrates with Avro, Thrift and Protocol Buffers

Parquet is a new columnar storage format that come out of a collaboration between Twitter and Cloudera. Parquet’s generating a lot of excitement in the community for good reason - it’s shaping up to be the next big thing for data storage in Hadoop.

5. Powers of Ten – Part I

Developing complex graph analytics over a multi-billion edge distributed graph represent the adventures that await.

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}