Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

The Best of the Week (June 13): Big Data Zone

DZone's Guide to

The Best of the Week (June 13): Big Data Zone

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

Make sure you didn't miss anything with this list of the Best of the Week in the Big Data Zone (June 13 to June 20). Here they are, in order of popularity:

1. Python 101: An Introduction to Python’s Debugger

Python comes with its own debugger module that is named pdb. This module provides an interactive source code debugger for your Python programs. You can set breakpoints, step through your code, inspect stack frames and more.

2. What is Search Relevancy?

Have you ever tried a site’s search and been underwhelmed with the accuracy of the results? Do you find yourself feeling frustrated and leaving when the search doesn’t return what you’re looking for?

3. Mining Patent Data to Understand the Nature of Invention

In an article on the MIT Technology Review , we get a glimpse of one project in which patent data – records that go back several centuries – is used to gain a better understanding of the process of invention.

4. Conjecture: Scalable Machine Learning in Hadoop with Scalding

When it comes to predictive modeling and machine learning, the most obvious product of engineering work that is seen client-side are those tailored ads: they scour your internet behavior and feed you content based on your preferences.

5. Anomaly Detection : A Survey

Anomaly detection refers to the problem of finding patterns in data that do not conform to expected behavior. These non-conforming patterns are often referred to as anomalies, outliers, discordant observations, exceptions, aberrations, surprises, peculiarities or contaminants.

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}