Make sure you didn't miss anything with this list of the Best of the Week in the Big Data Zone (Feb. 7 to Feb. 13). Here they are, in order of popularity:
It's a common problem: As data grows, performance suffers. That was the case in this article - an expanding dataset led to nodes that became unresponsive for seconds at a time, and even bigger problems - which was eventually solved by tuning the JVM to cooperate better with Cassandra.
VoltDB is welcoming Groovy into its ecosystem as its first inline procedure language. Code your procedure logic straight into the DDL, bypassing the Java procedure requirements to edit/compile Java source files separately. In this article, you'll learn how to work with VoltDB using Groovy.
One of the author's new year resolutions is to do a project with Neo4j Spatial, so he's kicked off his first blog post of the year with a gentle introduction to this awesome plugin.
Because MongoDB’s main data store is a flat file and secondary indexes store offsets into the flat file, if the location of a document changes, corresponding entries in secondary indexes must also change, which can be expensive. What interests the author about this problem is the underlying cause.
One of the features added into the more recent versions of Neo4j’s Cypher query language is the ability to tell Cypher which index you’d like to use in your queries. In this article, the author uses the football dataset to explore this feature.