Make sure you didn't miss anything with this list of the Best of the Week in the Big Data Zone (Apr. 04 to Apr. 10). Here they are, in order of popularity:
The is a LOT of hype around the IoT. Lots of vendors selling proprietary solutions that have very little to do with the Internet of Things but everything to do with locking customers into a single solution. If we are going to have a truly open Internet of Things, the solutions will need to be interoperable.
Apache Spark is an increasingly popular alternative to replace MapReduce with a more performant execution engine but still use Hadoop HDFS as storage engine for large data sets.
Continuing his trip into the Lucene codebase, the author is now looking into the process indexing as they are happening. Interestingly enough, that is something that we never really had to look at before.
In this recap of a podcast with Bikas Saha and Arun Murthy, the author got to hear about some of what is in 2.4 and coming in 2.5 of Hadoop.
How do you do sorting on a field value? The answer is, not easily.