The author has recently spent a bit of time working with people on their graph commons, and a common pattern he's come across is that although the models have lots of relationships, there are often missing nodes.
Cassandra users looking to make their lives easier might benefit from using Cassandra on Apache Mesos. This recent post provides a tutorial on how to get started, arguing that the two technologies are a great fit for each other because of Cassandra's peer-to-peer architecture.
Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone! This week's best include debugging a failing unit-test which interacts with RavenDB, part two of a tutorial on building a recommendation engine in Neo4j, why Cassandra's plainness makes it better than MongoDB, and more!
In this series of blog posts, the author describe the most interesting changes in TokuMX 1.4.0 and how they’ll affect users. Part 3 covers performance improvements that were achieved by making two big changes to how updates are implemented.
Finding relationships that should not be there is a great use case for Neo4j, and today the author wants to highlight an example of why: One of the hardest things for SQL based systems to do is cross-check the incoming payment information against existing data looking for relationships that shouldn’t be there.
In MongoDB, the replication oplog is a capped collection, with a fixed size on disk, and therefore the amount of history (measured in days) varies as the application makes changes faster or slower. In TokuMX, capped collections are handled differently.
When your MongoDB becomes unresponsive, it’s imperative that you can quickly identify the cause. Although there can be many reasons for unresponsiveness, we sometimes find that particularly long-running and/or blocking operations (either initiated by a human or an application) are the culprit.
The author got tired of doing arbitrary performance testing, so he decided to work with the FreeDB dataset, a dataset used to look up CD information. It contains a lot of data, and it is production data. That means that it is dirty, which makes it perfect to run all sort of interesting scenarios.
Tokutek just released version 1.4.0 of TokuMX, our high-performance distribution of MongoDB. There are a lot of improvements in this version (release notes), the most of any release yet. In this series of blog posts, the authors describe the most interesting changes and how they’ll affect users.
This recent article presents a comparison of Cassandra and MongoDB, although it's pretty clearly weighted in one direction. Yes, saying bad things about MongoDB is nothing new, but this article is interesting in that the anti-MongoDB framing is really not the core of his point. Ultimately, this is all about Cassandra.
Last week, Couchbase Server 2.5 Enterprise Edition was released. In this article, you'll find an overview of the key highlights of the release, including Rack-Zone Awareness, secure cross datacenter replication (XDCR), optimized connection management, and more.
Developers familiar with relational databases and SQL but interested in moving to MongoDB might find this recent article to be helpful. It explores the structure and use of MongoDB in terms of relational databases and SQL, making comparisons in order to draw on existing knowledge.
The author has been playing around with calling Neo4j’s Java API from Python via Jython and immediately ran into the following exception when trying to create an embedded instance: java.lang.ExceptionInInitializerError
A few days ago, Bernie Hackett and the author shipped a release candidate for PyMongo 2.7, with substantial contributions from Amalia Hawkins and Kyle Erf. This version supports new features in the upcoming MongoDB 2.6, and includes major internal improvements in the driver code.
One of the critical aspects to using conditional updates is determining whether the update was applied. In this article, you'll learn how to do it.
A lot of people love the new Neo4j browser shipped with 2.0 and subsequent releases. This is a nicely-built locally-running web application running in your browser. At the top, users can easily type their Cypher code and see results after executing, either in tabular form or as a visualization enabling to click through.
Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include tips on how to tune the JVM to improve performance in Cassandra, VoltDB's embrace of Groovy, part 1 of a series on building a recommendation engine with Neo4j, and more!
In part 1 of this series, the author looked at how to get started with Neo4j Spatial and at some of the pieces we’ll use today to build a proof of concept application. The author is calling the application “Nom Nom Nom." So we’ll get data from Factual, from OpenTable, combine them, and import them into Neo4j.
This week the author is working with Particular Software of NServiceBus fame. There's a shiny new platform for designing, managing and debugging distributed systems coming up, and while doing some work on it, he hit the following failing test.
A couple of months ago the author wrote a blog post explaining how he’d applied Wes Freeman’s Cypher optimisation patterns to a query. Since then, Neo4j 2.0.0 has been released, and he's extended the model, so he thought he’d try again.
MongoDB hosting on AWS is a very popular scenario today. When researching a managed hosting provider on AWS, there are a number of aspects that need to be considered. In this article are five questions to ask your Mongo hosting provider to get a better understanding of your options
One of the author's new year resolutions is to do a project with Neo4j Spatial, so he's kicked off his first blog post of the year with a gentle introduction to this awesome plugin.
VoltDB is welcoming Groovy into its ecosystem as its first inline procedure language. Code your procedure logic straight into the DDL, bypassing the Java procedure requirements to edit/compile Java source files separately. In this article, you'll learn how to work with VoltDB using Groovy.
One of the features added into the more recent versions of Neo4j’s Cypher query language is the ability to tell Cypher which index you’d like to use in your queries. In this article, the author uses the football dataset to explore this feature.
As the author laid out in his previous post, there are a number of motivations to start using CQL. CQL has better support for batching, conditional updates, and collections. For those reasons, the author has started working with CQL for Storm/Cassandra integration.