Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

The Best of the Week (Oct 17): Big Data Zone

DZone's Guide to

The Best of the Week (Oct 17): Big Data Zone

· Big Data Zone ·
Free Resource

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

Make sure you didn't miss anything with this list of the Best of the Week in the Big Data Zone (October 17 - October 24). Here they are, in order of popularity:

1. FAQ of Executives Regarding Apache Hadoop

  • Apache Hadoop has slowly been infiltrating the mainstream business world, but many executives are still left with doubts about whether adopting Hadoop is a sound strategy for their organization. Is Hadoop enterprise friendly? Is it economical for an organization to use?

2. Understanding Information Retrieval by Using Apache Lucene and Tika - Part 2

  • A sequal of what was implemented in Part 1 of this tutorial; we continue indexing and improving search conditions through different features provided by the Apache Lucene library.

3. Understanding Information Retrieval by Using Apache Lucene and Tika - Part 1

  • This tutorial will explain the Lucene and Tika frameworks will be explained through their core concepts (parsing, mime detection, indexing, scoring, boosting) via illustrative examples that should be applicable to not only seasoned software developers but to beginners to content analysis and programming as well.

4. Understanding Information Retrieval by Using Apache Lucene and Tika - Part 3

  • This is a sequal of what was presented in part 1 and part 2 of this tutorial; after indexing and querying we can highlight the results of a search by making use of Highlighter(s).

5. Validate Configuration on Startup

  • Do you remember that time when you spent a whole day trying to fix a problem, only to realize that you have mistyped a configuration setting? Avoiding that is not trivial, as not only you, but also the frameworks that you use should take care. But let me outline my suggestion.

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

Topics:

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}