Over a million developers have joined DZone.

The Best of the Week (Oct. 25): Big Data Zone

· Big Data Zone

Learn how you can maximize big data in the cloud with Apache Hadoop. Download this eBook now. Brought to you in partnership with Hortonworks.

Make sure you didn't miss anything with this list of the Best of the Week in the Big Data Zone (Oct. 25 to Oct. 31). Here they are, in order of popularity:

1. Douglas Hofstadter, AI, and Machine Learning: The Gap Between Function and Exploration

Douglas Hofstadter is a pioneer in the field of AI, and this recent article uses his life and work as a frame through which to explore common perceptions of AI and machine learning, their origins and purposes, and their role in the Google-centric data-mining world of software today.

2. Building a Distributed Search Engine: Refactoring Story, Part 2

In the author's previous post, he gave an introduction to search engines. He showed the concept of an Inverted Index and how it helps with performing searches on large amounts of texts quickly and efficiently. In this post, he'll be discussing the issue of scaling out a search engine.

3. Data News: Free eBook on Machine Learning and Optimization, and More

In this installment of Arthur Charpentier's roundup of stats and data science-related links, we find "Bad Science" with false positives, data on when Americans use mobile apps and what apps they use, a free ebook on machine learning and optimization, and more.

4. How to Use Recursive SQL for Data Normalization 

Recursive SQL can be awesome, but a bit hard to read in its SQL standard beauty. In this article, you'll find a quick tutorial on how to work with recursive SQL for data normalization.

5. Why is Multi-term Synonym Mapping so Hard in Solr?

One would hope that working with synonyms should be as simple as tossing a set of synonyms into the synonyms.txt file and just having Solr “do the right thing.”™ Unfortunately, especially as you get into more complex uses of synonyms, such as multi-term synonyms, there are several gotchas.

Hortonworks DataFlow is an integrated platform that makes data ingestion fast, easy, and secure. Download the white paper now.  Brought to you in partnership with Hortonworks

Topics:

Opinions expressed by DZone contributors are their own.

The best of DZone straight to your inbox.

SEE AN EXAMPLE
Please provide a valid email address.

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.
Subscribe

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}