Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include a TokuMX and MongoDB Oplog entry comparison, the release of MongoDB 2.6, announcing Couchbase Server 2.5.1, and more.
For a look at what's been happening outside of the NoSQL Zone, we've assembled a collection of links from around the web, including a look at MapReduce performance on SSDs, Oracle's intention to create NoSQL standards, a new data structure for Redis, Cassandra at one million writes per second, and more.
In most applications, the disk IO will typically end up being your main bottleneck, all other silly bottlenecks being worked out (CPU, number of connections, etc). And whether our competitors like it or not, the write locks are rarely the bottlenecks in a well designed application.
The author is excited to offer you Motor 0.2, release candidate zero. Motor is his non-blocking driver for MongoDB and Tornado.
In this post, the author will be discussing the various metrics that should be used to understand the sizing needs of a running Couchbase Server cluster. He'll also attempt to provide guidance around thresholds and alerts where appropriate. Grab a coffee, this is gonna be a long one...
I know what you're thinking: What would John Lennon think about NoSQL databases? Well, whether you're thinking that or not, you don't have to anymore, because Andy Tiffany has it covered with this NoSQL-themed rendition of John Lennon's "Imagine."
With Neo4j 2.0, we got automatic schema indexes based on labels and properties for exact lookups of nodes on property values. In this post, you'll learn to set up node auto-index as fulltext-index, enable node auto-index for certain properties, use the fulltext auto-index and the Java API, and more.
MongoDB is evolving rapidly. The 2.2 version introduced the aggregation framework as an alternative to the Map-Reduce query model. Generating aggregated reports is a recurrent requirement for enterprise systems, and MongoDB shines in this regard.
The author is pleased to announce the coordinated release of drivers in 9 languages in preparation for the release of MongoDB 2.6. This is the largest driver release in the history of MongoDB, both in terms of code changes as well as in terms of drivers released.
The @Resource annotation appeared for the first time in Java EE 5 specification. When you annotate an attribute with @Resource, it will be the container responsible for injecting the requested resource. The typical example is used with the DataSource class.
The author is currently working on a project using Voron, and was horrified to get the FatalExecutionEngineException that was discussed previously. After a lot of work, he managed to narrow it down to… not Voron. To be rather more exact, it is Voron that is causing it, but it isn’t Voron’s fault.
As much as I love geeking out about basketball stats, I want to put a MongoDB data set out there that’s a bit more app-friendly: the USDA SR25 nutrient database. You can download this data set from my S3 bucket here, and plug it into your MongoDB instance using mongorestore.
The author announces with satisfaction that they've released PyMongo 2.7, the successor to PyMongo 2.6.3. The bulk of the driver's changes are to support MongoDB 2.6, which is currently a release candidate.
Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include MongoDB's issues with scaling out, how to delete lots of data from Neo4j with Cypher, and tips on clustering with GridGain.
In the next series of posts, the author will describe how TokuMX replication keeps data in sync between machines in the replica set. In doing so, he will address the challenges they faced and algorithms they developed to address them.
We’re happy to announce the release of Couchbase Server 2.5.1 Enterprise Edition. As many of you may know, the 2.5 release introduced several exciting features like rack-zone awareness and, secure cross datacenter replication that provided better reliability and security for enterprises.
MongoDB replication has some great features. As the author discussed in his last post, MongoDB’s crash safety design is very elegant. Despite these great features, TokuMX replication is completely incompatible with MongoDB’s replication. As the saying goes, this is a feature and not a bug.
Sometimes you’ll find yourself looking at a traversal and thinking… “I’m going to be doing this one thing over and over again.” That sounds kind of wasteful and years of recycling have taught us not to be wasteful.
If a NoSQL database can be deployed on-premise or it can be deployed in the cloud, why can’t it be deployed on-premise and in the cloud? It can, and it should. This article highlights a variety of hybrid cloud use cases for NoSQL database deployments.
Recommendation is a hot topic these days. Using a graph database it is easy to create great recommendation queries. Here are a few steps for designing excellent recommendation queries for your applications.
Before diving deeper into what it means to easily cluster an application, let's start by defining what a cluster is. Wikipedia provides a high level definition that covers fault tolerance, load balancing, scheduling, etc. However, the real magic behind clustering is in making these complex operations seem easy.
The latest MLJS release is now available. Changes include updated API and tutorial docs, support for advanced rendering to the Search Results widget, heat maps on the OpenLayers widget, address lookup using Google geocode search, and more. Read on for all the details.
There was a question on the Neo4j Google Group about returning results in a random order in Cypher. So the author thought explaining it in a blog post (this) and an interactive GraphGist is better than just to answer the email.
Someone asked on Twitter today for a way to trigger a connection failure between MongoDB and the client. This would be terribly useful when you're testing your application's handling of network hiccups.
This is the second part of a two part series. In the first part, the author covered how important clocks are and how bad clocks can be in virtualized systems (like Amazon EC2) today. In today’s installment, he is going to cover some disadvantages of off-the-shelf NTP installations, and how to overcome them.