Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include MongoDB's issues with scaling out, how to delete lots of data from Neo4j with Cypher, and tips on clustering with GridGain.
In the next series of posts, the author will describe how TokuMX replication keeps data in sync between machines in the replica set. In doing so, he will address the challenges they faced and algorithms they developed to address them.
We’re happy to announce the release of Couchbase Server 2.5.1 Enterprise Edition. As many of you may know, the 2.5 release introduced several exciting features like rack-zone awareness and, secure cross datacenter replication that provided better reliability and security for enterprises.
MongoDB replication has some great features. As the author discussed in his last post, MongoDB’s crash safety design is very elegant. Despite these great features, TokuMX replication is completely incompatible with MongoDB’s replication. As the saying goes, this is a feature and not a bug.
Sometimes you’ll find yourself looking at a traversal and thinking… “I’m going to be doing this one thing over and over again.” That sounds kind of wasteful and years of recycling have taught us not to be wasteful.
If a NoSQL database can be deployed on-premise or it can be deployed in the cloud, why can’t it be deployed on-premise and in the cloud? It can, and it should. This article highlights a variety of hybrid cloud use cases for NoSQL database deployments.
Recommendation is a hot topic these days. Using a graph database it is easy to create great recommendation queries. Here are a few steps for designing excellent recommendation queries for your applications.
Before diving deeper into what it means to easily cluster an application, let's start by defining what a cluster is. Wikipedia provides a high level definition that covers fault tolerance, load balancing, scheduling, etc. However, the real magic behind clustering is in making these complex operations seem easy.
The latest MLJS release is now available. Changes include updated API and tutorial docs, support for advanced rendering to the Search Results widget, heat maps on the OpenLayers widget, address lookup using Google geocode search, and more. Read on for all the details.
There was a question on the Neo4j Google Group about returning results in a random order in Cypher. So the author thought explaining it in a blog post (this) and an interactive GraphGist is better than just to answer the email.
Someone asked on Twitter today for a way to trigger a connection failure between MongoDB and the client. This would be terribly useful when you're testing your application's handling of network hiccups.
This is the second part of a two part series. In the first part, the author covered how important clocks are and how bad clocks can be in virtualized systems (like Amazon EC2) today. In today’s installment, he is going to cover some disadvantages of off-the-shelf NTP installations, and how to overcome them.
Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include 10 tips on how to run MongoDB at scale, how to find duplicate keys with MongoDB's aggregation framework, running Spark on Cassandra, and more.
If you want to delete lots of data from a Neo4j database with Cypher, just stop the server and delete the directory and start again. However, there are some tricks to be aware of.
You may have heard that MongoDB has issues with scaling out. You may have heard that Viber is replacing MongoDB with Couchbase Server. Have you heard that MongoHQ has to scale up?
When developing web applications it becomes very useful to start by authoring an API over your data model in order to expose certain services to data-driven components of your front-end application. For Neo4j, the best API authoring solution out there is Neo4j Swagger.
At an extremely high level, replication in MongoDB and MySQL are similar. Underneath the covers, though, their implementations are quite different. And in peeking underneath the covers while developing TokuMX, the author learned more about his favorite thing in MongoDB replication: crash safety.
Neo4j 2.0 introduced the UNION (ALL) clause, which can join the results of 2 or more complete statements into a single result. In this article, you'll find some examples for using UNION and UNION ALL.
When you're looking to run analytics on complex data sets, you might instinctively reach for Hadoop. However, if your data’s in MongoDB, using the Hadoop connector seems like overkill if your data fits on your laptop. Luckily, MongoDB’s built-in aggregation framework offers a quick solution.
Cassandra is a highly-distributable NoSQL database with tunable consistency. What makes it highly distributable makes it also, in part, vulnerable: the whole deployment must run on synchronized clocks.
We all know that MongoDB is web scale, but how, exactly, does one run MongoDB at scale? This recent article provides a list of ten tips for doing so, each including a brief explanation and outlining potential pitfalls and optimization tricks for each.
Recently the author had a request for Cassandra-style counters at the mailing list. And as long as he is doing a feature design series, he thought that he could talk about how he would go about implementing this. Again, consider this fiction, he has no plans of implementing this at this time.
Quite frequently our users want to create a unique index on a data set but encounter some form of the following error because of duplicate key value(s). While MongoDB supports an option to drop duplicates, dropDups, during index builds, this option forces the creation of a unique index by way of deleting data.
Accessing your beloved Neo4j-Shell via RMI works OK on localhost or in your intranet. But over the internet, you don’t really want to expose RMI ports.