NoSQL Zone is brought to you in partnership with:
  • submit to reddit
Mark Needham03/05/14
1359 views
0 replies

Neo4j and Cypher: Set Based Operations

The author was recently reminded of a Neo4j Cypher query that he wrote a couple of years ago to find the colleagues that he hadn’t worked with in the ThoughtWorks London office. In this article, you'll find a model to help explain how to write such a query.

Alec Noller03/04/14
7387 views
0 replies

From MySQL to Cassandra: How to Deal with Application-Level Failure Scenarios

Moving from MySQL to Cassandra can be beneficial for a number of reasons, particularly when it comes to spreading out failure scenarios. However, there are still challenges to be faced. According to this recent blog post on the transition, the Rackspace team encountered a number of hiccups in the process.

Alec Noller03/04/14
5235 views
0 replies

Getting Started with Node.js, Express.js, and MongoDB

If you're looking for a practical application to help you get started with MongoDB (or Node.js, or Express.js, for that matter), you might be interested in this presentation from Karan Goel on getting started with Node.js, Express.js, and MongoDB. You can find the video below, and Goel's slides here.

Leif Walsh03/04/14
1206 views
0 replies

What’s New in TokuMX 1.4, Part 5: Faster Chunk Migrations

Sharding in MongoDB and TokuMX does a great job of scaling an application beyond what a single machine can do, but it also brings new challenges to the table. One of those challenges is how to deal with the impact of migrations on the running system.

Don Pinto03/03/14
4014 views
0 replies

Couchbase Java SDK 1.4.0: New and Noteworthy

Couchbase has released the first developer preview of the 1.4.0 Java SDK. Aside from the usual bugfixes and enhancements, this new release provides support for optimized connection management, which was recently introduced in Couchbase Server 2.5.0. This article provides more information on what's new here.

Moshe Kaplan03/03/14
62160 views
5 replies

When to Use MongoDB Rather than MySQL (or Other RDBMS): The Billing Example

NoSQL has been a hot buzz in the air for a pretty long time (well, it's not only a buzz anymore), and MongoDB has been a major player. However, when should we really use it?

Alec Noller03/03/14
4371 views
0 replies

Using Apache Cassandra for Real-Time Analytics

If you're interested in using Cassandra for real-time analytics, you might find something useful in this talk from Stephane Legay, CTO at LoopLogic, on LoopLogic's use case.

Alec Noller03/02/14
7507 views
0 replies

The Best of the Week (Feb. 21): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include 30 years of NBA data crunched with MongoDB, a response using PostgreSQL, thoughts on when to use GridFS on MongoDB, and more!

Alec Noller02/28/14
5300 views
0 replies

Find Bugs in MongoDB's New Release and Win Some Prizes

In the timeless words of a great man: "It's a bughunt." Last week, the MongoDB team released MongoDB 2.6.0-rc0, and they're running a contest to find bugs. Bug "quality" is judged on severity, impact, and prevalence, and as long as you get your bug reports in by March 4th, you'll be up for some prizes.

Mark Needham02/28/14
5098 views
0 replies

Neo4j: Creating Nodes and Relationships From a List of Maps

Last week Alistair and the author were porting some Neo4j cypher queries from 1.8 to 2.0, and one of the queries they had to change was an interesting one that created a bunch of relationships from a list/array of maps.

Alec Noller02/28/14
4239 views
0 replies

Clarango: A Clojure Driver for ArangoDB

Anybody working with ArangoDB might be interested in Stefan Edlich's work-in-progress Clojure driver, Clarango. The current version is 0.3.0, and 1.0 is expected in late 2014, so obviously there is still a lot to be done, but according to the GitHub, the features list is already pretty interesting.

Alec Noller02/27/14
4043 views
0 replies

Where Are All the DBAs in NoSQL?

Earlier this month, Gartner released survey results that suggest that there aren't too many DBAs in the NoSQL space. But why would that be? Quite a few people have weighed in, blaming everything from stick-in-the-mud DBAs to the "cool guys" of DevOps.

Dharshan Rangegowda02/27/14
11829 views
0 replies

When to use GridFS on MongoDB

GridFS is a simple file system abstraction on top of MongoDB. If you are familiar with Amazon S3, GridFS is a very similar abstraction. Now why does a document oriented database like MongoDB provide a file layer abstraction? Turns out there are some very good reasons

Brian O' Neill02/27/14
3841 views
0 replies

Storm and Cassandra: A Three Year Retrospective

The authors had made the decision to go forward with Cassandra, but didn't see any bridge between Storm and Cassandra -- so they built one. By December 2011, they had made enough progress on Storm-Cassandra that it made it into the Cassandra Summit, and they started building out their first topologies.

Alec Noller02/26/14
9324 views
2 replies

MongoDB vs. PostgreSQL for NBA Data Crunching

There is a long-ish tradition of comparing things to MongoDB. You know, MongoDB vs. Oracle, and MongoDB vs. Cassandra, and MongoDB vs. Redis and CouchDB. Now, Dmitri Fontaine at tapoueh.org has provided a new comparison: MongoDB vs. PostgreSQL.

Leif Walsh02/26/14
1777 views
0 replies

What’s New in TokuMX 1.4, Part 4: Smaller, Faster Sharded Clusters

In the first part of this series, the author introduced a new feature, the ability to define the primary key for a collection. Today, you’ll see how we use it to reduce the disk footprint of sharded clusters.

Shane Johnson02/26/14
2905 views
0 replies

The NoSQL Kiss

We have to categorize everything, so we categorized NoSQL implementations. There are several categories, but I will focus on three: Distributed Caches, Key / Value Stores, and Document Databases. What if all three requirements must be met? Keep it simple, stupid.

Ayende Rahien02/25/14
3834 views
0 replies

Voron & Time Series Data: Getting Real Data Outputs

So far, we have just put the data in and out. And we have had a pretty good track record doing so. However, what do we do with the data now that we have it? As you can expect, we need to read it out. Usually by specific date ranges.

Andreas Kollegger02/25/14
4289 views
0 replies

The Neo4j 2.1.0 Milestone 1 Release: Import and Dense Nodes

On the data import side, Neo4j now supports CSV import directly in the Cypher query language. For large, densely-connected graphs, Neo4j has changed the way relationships are stored to make navigating densely-connected nodes much quicker for common cases.

Alec Noller02/24/14
7375 views
0 replies

MongoDB Aggregation: How to Work with 30 Years of NBA Data

If you've been waiting for the day when MongoDB and basketball would finally intersect, here is some good news: This recent post has crunched 30 years worth of NBA data with MongoDB aggregation.

Mark Needham02/24/14
2311 views
0 replies

Neo4j: Value in Relationships, but Value in Nodes Too!

The author has recently spent a bit of time working with people on their graph commons, and a common pattern he's come across is that although the models have lots of relationships, there are often missing nodes.

Alec Noller02/24/14
2638 views
0 replies

How to Deploy Cassandra on Mesos

Cassandra users looking to make their lives easier might benefit from using Cassandra on Apache Mesos. This recent post provides a tutorial on how to get started, arguing that the two technologies are a great fit for each other because of Cassandra's peer-to-peer architecture.

Alec Noller02/23/14
7023 views
0 replies

The Best of the Week (Feb. 14): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone! This week's best include debugging a failing unit-test which interacts with RavenDB, part two of a tutorial on building a recommendation engine in Neo4j, why Cassandra's plainness makes it better than MongoDB, and more!

Leif Walsh02/22/14
5075 views
0 replies

What’s New in TokuMX 1.4, Part 3: Optimized Updates

In this series of blog posts, the author describe the most interesting changes in TokuMX 1.4.0 and how they’ll affect users. Part 3 covers performance improvements that were achieved by making two big changes to how updates are implemented.

Max De Marzi02/21/14
3754 views
0 replies

Online Payment Risk Management with Neo4j

Finding relationships that should not be there is a great use case for Neo4j, and today the author wants to highlight an example of why: One of the hardest things for SQL based systems to do is cross-check the incoming payment information against existing data looking for relationships that shouldn’t be there.