NoSQL Zone is brought to you in partnership with:
  • submit to reddit
Alec Noller07/04/14
3210 views
0 replies

ArangoDB 2.2.0 Beta Released

The ArangoDB 2.2.0 beta has been released! We covered ArangoDB a while back, and the NoSQL database has been steadily improving since then. According to Frank Celler's release announcement, this most recent release covers a wide variety of bug fixes and changes, but focuses on three major items.

Nikhil Kuriakose07/03/14
631 views
0 replies

Implementing Word Ladder Game Using Neo4j

In a word ladder puzzle you must make the change occur gradually by changing one letter at a time. At each step you must transform one word into another word, you are not allowed to transform a word into a non-word. See how to neatly implement this using Neo4j.

Shane Johnson07/03/14
937 views
0 replies

Couchbase Blueprints in Apache Brooklyn

The cloud is the new operating system, and projects like Apache Brooklyn enable the enterprise to be more efficient. What if you could deploy a dozen instances of Couchbase Server to Amazon Web Servers with the click of a button?

Chris Chang07/02/14
3136 views
0 replies

Reporting Back from MongoDB World 2014, NYC, Planet JSON

Closely approaching the one year mark of when I first joined MongoLab (and the MongoDB community), I had the pleasure of attending the inaugural MongoDB World conference put together by the incredible MongoDB team. There was a lot to digest from the conference... here are the major highlights.

Mark Needham07/02/14
2449 views
0 replies

Neo4j & Cypher: Finding Movies by Decade

The author was recently asked how to find the number of movies produced per decade in the movie data set that comes with the Neo4j browser. In this article, you'll find out how.

Rishav Rohit07/01/14
3514 views
0 replies

Update Fixed number of MongoDB Documents

In a recent MongoDB and R project, the author faced a new problem. He was using R to process source data present in MongoDB, and if he gave a large number of documents to R for analysis, it was becoming slower and a bottleneck.

Michael Hunger07/01/14
968 views
0 replies

LOAD CSV into Neo4j Quickly and Successfully

Since version 2.1, Neo4j provides out-of-the box support for CSV ingestion. But hear my words of advice before you jump directly into using it. There are some tweaks and configuration aspects that you should know to be successful on the first run.

Alec Noller06/30/14
2895 views
0 replies

Why Shared Storage Hinders Performance in Cassandra

One question that keeps coming up, according to Jonathan Lacefield, is why shared storage is not recommended for Cassandra. The short version? Performance suffers, and it introduces a single point of failure. Lacefield's explanation, however, aims to clarify what "performance suffers" really means.

Michael Hunger06/30/14
1468 views
0 replies

Rendering a Neo4j Database in UbiGraph

UbiGraph is a graph rendering server that is controlled remotely and also interactively with a XML-RPC API (which is a weird choice). It comes with example clients in Java, Python, Ruby and C. In this article, you'll learn the basics of UbiGraph and how to render Neo4j with it.

Alec Noller06/29/14
3210 views
0 replies

The Best of the Week (June 20): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include a look at performing CRUD operations on MongoDB in a Node.js app, the need for DBaaS in the app economy, and thoughts on how to control an ever-expanding MongoDB database.

Alec Noller06/27/14
2476 views
0 replies

Cayley: An Open Source Graph Database Written in Go

If you're looking for something new and a bit different when it comes to NoSQL solutions, you might be interested in Cayley, an open source graph database written in Go and based on Freebase and Google's Knowledge Graph.

Shane Johnson06/27/14
1267 views
0 replies

MongoDB and DataStax, In the Rearview Mirror

Thumbtack published an excellent blog post highlighting the preliminary results of performance tests executed with Couchbase Server, MongoDB and DataStax Enterprise (Apache Cassandra). The final results will be included in a benchmark report.

Zardosht Kasheff06/26/14
948 views
0 replies

Use TokuMX Partitioned Collections in Place of TTL Indexes

Take the following scenario. You have a time-series data application for which you would like to store a rolling period of data. With basic MongoDB, you would likely create a collection with a “TTL”, or “time to live” index. While simple to use, this solution can run into performance problems.

Ravi Kiran06/25/14
4377 views
0 replies

Performing CRUD Operations on MongoDB in a Node.js Application Using MongoDB Driver

There are a number of drivers created by the community to interact with MongoDB from a Node.js app. The official mongodb driver seems to be the simplest of them. In this post, we will learn to perform simple CRUD operations on a MongoDB document store using the mongodb driver.

Alec Noller06/25/14
2163 views
0 replies

Defending the Cassandra Benchmark: What it Means to Compare NoSQL Performance

You may have heard about Jonathan Ellis criticizing Thumbtack Technology's NoSQL benchmarks - in short, he suggested that the benchmarks were improperly configured and understated Cassandra's performance. Well, Ben Engber at Thumbtack Technology heard about it, and according to his response, Ellis is way off.

Alec Noller06/24/14
1709 views
0 replies

Benchmarking Cassandra: The Right & Wrong Way to Do it

Everybody loves comparing databases. Not everybody agrees on how to do it, though. One prime example is Thumbtack Technology's benchmarks comparing Cassandra, Couchbase, MongoDB, and Aerospike. The problem, according to Jonathan Ellis, is that the benchmarks give Cassandra a raw deal.

A. Jesse Jiryu Davis06/24/14
542 views
0 replies

Rules of Thumb for Methods and Functions

The Python team at MongoDB is partially rewriting PyMongo. The next version, 3.0, aims to be faster, more flexible, and more maintainable than the current 2.x series. One strategy is to minimize methods, period. In this article, you'll find the author's rules of thumb when it comes to methods and functions.

Alec Noller06/23/14
2194 views
0 replies

How to Control Your Expanding MongoDB Database

The key question asked by DJ Walker-Morgan in this recent post from MongoHQ is an important one: do you actually know how big your database is? As Walker-Morgan points out, most people probably have a number they can point to, but the number may not be communicating exactly what they think it is.

Ketan Parmar06/23/14
796 views
0 replies

NoSQL: A Quick Guide

NoSQL is a buzzword now-a-days among the developers and software professionals. In this article, you'll find a quick guide to NoSQL, including what it is, where to use it, advantages and disadvantages, and some of the more popular NoSQL options.

Alec Noller06/22/14
1903 views
0 replies

The Best of the Week (June 13): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include a tutorial on building a TV show tracker with MongoDB, Node.js, AngularJS, a look at MongoDB and Grails, 16 of the top NoSQL and NewSQL databases, and more.

Kurt Mackey06/21/14
2234 views
0 replies

The App Economy Will Never Fully Take Flight Without DBaaS

Developers are continually upping the ante by creating better, smarter and more valuable apps. However, these apps also have increasingly sophisticated data requirements, and the ability to take them to the next level may be stymied by an archaic approach to databases.

Zardosht Kasheff06/20/14
1942 views
0 replies

Best Practices for Partitioned Collections and Tables in TokuDB and TokuMX

Last time, the author gave a technical explanation of the performance characteristics of partitioned collections in TokuMX 1.5 and partitioned tables in relational databases. Given those characteristics, in this post, he will present some best practices when using this feature in TokuMX or TokuDB.

Mark Needham06/20/14
606 views
0 replies

Neo4j: LOAD CSV - Handling Conditionals

While building up the Neo4j World Cup Graph, the author has been making use of the LOAD CSV function and he frequently found himself needing to do different things depending on the value in one of the columns. In this article, the author explores handling conditionals to do so.

Zardosht Kasheff06/19/14
1894 views
0 replies

Understanding the Performance Characteristics of Partitioned Collections

In TokuMX 1.5 that is right around the corner, the big feature will be partitioned collections. This feature is similar to partitioned tables in Oracle, MySQL, SQL Server, and Postgres. A question many have is “why should I use partitioned tables?” In short, it’s complicated.

Itamar Syn-hershko06/19/14
563 views
0 replies

RavenDB to Elasticsearch Replication for Real-Time Reporting and Data-Analysis

While replicating data from RavenDB to SQL Server or the like does make sense, every report can take a while to generate. Replicating to Elasticsearch provides real-time view of the data, and fast reporting capabilities on it. Now, how do we get data to it from a RavenDB database?