Make sure you didn't miss anything with this list of the Best of the Week in the Big Data Zone (Oct. 25 to Oct. 31). Here they are, in order of popularity:
Douglas Hofstadter is a pioneer in the field of AI, and this recent article uses his life and work as a frame through which to explore common perceptions of AI and machine learning, their origins and purposes, and their role in the Google-centric data-mining world of software today.
In the author's previous post, he gave an introduction to search engines. He showed the concept of an Inverted Index and how it helps with performing searches on large amounts of texts quickly and efficiently. In this post, he'll be discussing the issue of scaling out a search engine.
In this installment of Arthur Charpentier's roundup of stats and data science-related links, we find "Bad Science" with false positives, data on when Americans use mobile apps and what apps they use, a free ebook on machine learning and optimization, and more.
Recursive SQL can be awesome, but a bit hard to read in its SQL standard beauty. In this article, you'll find a quick tutorial on how to work with recursive SQL for data normalization.
One would hope that working with synonyms should be as simple as tossing a set of synonyms into the synonyms.txt file and just having Solr “do the right thing.”™ Unfortunately, especially as you get into more complex uses of synonyms, such as multi-term synonyms, there are several gotchas.