NoSQL Zone is brought to you in partnership with:
  • submit to reddit
Dharshan Rangegowda02/27/14
11574 views
0 replies

When to use GridFS on MongoDB

GridFS is a simple file system abstraction on top of MongoDB. If you are familiar with Amazon S3, GridFS is a very similar abstraction. Now why does a document oriented database like MongoDB provide a file layer abstraction? Turns out there are some very good reasons

Brian O' Neill02/27/14
3801 views
0 replies

Storm and Cassandra: A Three Year Retrospective

The authors had made the decision to go forward with Cassandra, but didn't see any bridge between Storm and Cassandra -- so they built one. By December 2011, they had made enough progress on Storm-Cassandra that it made it into the Cassandra Summit, and they started building out their first topologies.

Alec Noller02/26/14
9301 views
2 replies

MongoDB vs. PostgreSQL for NBA Data Crunching

There is a long-ish tradition of comparing things to MongoDB. You know, MongoDB vs. Oracle, and MongoDB vs. Cassandra, and MongoDB vs. Redis and CouchDB. Now, Dmitri Fontaine at tapoueh.org has provided a new comparison: MongoDB vs. PostgreSQL.

Leif Walsh02/26/14
1770 views
0 replies

What’s New in TokuMX 1.4, Part 4: Smaller, Faster Sharded Clusters

In the first part of this series, the author introduced a new feature, the ability to define the primary key for a collection. Today, you’ll see how we use it to reduce the disk footprint of sharded clusters.

Shane Johnson02/26/14
2892 views
0 replies

The NoSQL Kiss

We have to categorize everything, so we categorized NoSQL implementations. There are several categories, but I will focus on three: Distributed Caches, Key / Value Stores, and Document Databases. What if all three requirements must be met? Keep it simple, stupid.

Ayende Rahien02/25/14
3820 views
0 replies

Voron & Time Series Data: Getting Real Data Outputs

So far, we have just put the data in and out. And we have had a pretty good track record doing so. However, what do we do with the data now that we have it? As you can expect, we need to read it out. Usually by specific date ranges.

Andreas Kollegger02/25/14
4268 views
0 replies

The Neo4j 2.1.0 Milestone 1 Release: Import and Dense Nodes

On the data import side, Neo4j now supports CSV import directly in the Cypher query language. For large, densely-connected graphs, Neo4j has changed the way relationships are stored to make navigating densely-connected nodes much quicker for common cases.

Alec Noller02/24/14
7342 views
0 replies

MongoDB Aggregation: How to Work with 30 Years of NBA Data

If you've been waiting for the day when MongoDB and basketball would finally intersect, here is some good news: This recent post has crunched 30 years worth of NBA data with MongoDB aggregation.

Mark Needham02/24/14
2295 views
0 replies

Neo4j: Value in Relationships, but Value in Nodes Too!

The author has recently spent a bit of time working with people on their graph commons, and a common pattern he's come across is that although the models have lots of relationships, there are often missing nodes.

Alec Noller02/24/14
2611 views
0 replies

How to Deploy Cassandra on Mesos

Cassandra users looking to make their lives easier might benefit from using Cassandra on Apache Mesos. This recent post provides a tutorial on how to get started, arguing that the two technologies are a great fit for each other because of Cassandra's peer-to-peer architecture.

Alec Noller02/23/14
7005 views
0 replies

The Best of the Week (Feb. 14): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone! This week's best include debugging a failing unit-test which interacts with RavenDB, part two of a tutorial on building a recommendation engine in Neo4j, why Cassandra's plainness makes it better than MongoDB, and more!

Leif Walsh02/22/14
5046 views
0 replies

What’s New in TokuMX 1.4, Part 3: Optimized Updates

In this series of blog posts, the author describe the most interesting changes in TokuMX 1.4.0 and how they’ll affect users. Part 3 covers performance improvements that were achieved by making two big changes to how updates are implemented.

Max De Marzi02/21/14
3743 views
0 replies

Online Payment Risk Management with Neo4j

Finding relationships that should not be there is a great use case for Neo4j, and today the author wants to highlight an example of why: One of the hardest things for SQL based systems to do is cross-check the incoming payment information against existing data looking for relationships that shouldn’t be there.

Leif Walsh02/21/14
1696 views
0 replies

What’s New in TokuMX 1.4, Part 2: Partitioned oplog

In MongoDB, the replication oplog is a capped collection, with a fixed size on disk, and therefore the amount of history (measured in days) varies as the application makes changes faster or slower. In TokuMX, capped collections are handled differently.

Chris Chang02/20/14
4164 views
0 replies

Finding and Terminating Long-Running Operations in MongoDB

When your MongoDB becomes unresponsive, it’s imperative that you can quickly identify the cause. Although there can be many reasons for unresponsiveness, we sometimes find that particularly long-running and/or blocking operations (either initiated by a human or an application) are the culprit.

Ayende Rahien02/20/14
2146 views
0 replies

Voron and the FreeDB Dataset

The author got tired of doing arbitrary performance testing, so he decided to work with the FreeDB dataset, a dataset used to look up CD information. It contains a lot of data, and it is production data. That means that it is dirty, which makes it perfect to run all sort of interesting scenarios.

Leif Walsh02/20/14
1974 views
0 replies

What’s New in TokuMX 1.4, Part 1: Primary Keys

Tokutek just released version 1.4.0 of TokuMX, our high-performance distribution of MongoDB. There are a lot of improvements in this version (release notes), the most of any release yet. In this series of blog posts, the authors describe the most interesting changes and how they’ll affect users.

Alec Noller02/19/14
23774 views
0 replies

Cassandra vs. MongoDB: Why Your Database Should be Boring

This recent article presents a comparison of Cassandra and MongoDB, although it's pretty clearly weighted in one direction. Yes, saying bad things about MongoDB is nothing new, but this article is interesting in that the anti-MongoDB framing is really not the core of his point. Ultimately, this is all about Cassandra.

Don Pinto02/19/14
2707 views
0 replies

Announcing Couchbase Server 2.5

Last week, Couchbase Server 2.5 Enterprise Edition was released. In this article, you'll find an overview of the key highlights of the release, including Rack-Zone Awareness, secure cross datacenter replication (XDCR), optimized connection management, and more.

Alec Noller02/18/14
3600 views
2 replies

How to Use Relational Databases and SQL to Understand MongoDB

Developers familiar with relational databases and SQL but interested in moving to MongoDB might find this recent article to be helpful. It explores the structure and use of MongoDB in terms of relational databases and SQL, making comparisons in order to draw on existing knowledge.

Mark Needham02/18/14
2695 views
0 replies

Exceptions in Jython and Neo4j: java.lang.ExceptionInInitializerError

The author has been playing around with calling Neo4j’s Java API from Python via Jython and immediately ran into the following exception when trying to create an embedded instance: java.lang.ExceptionInInitializerError

A. Jesse Jiryu Davis02/18/14
2204 views
0 replies

Announcing PyMongo 2.7 Release Candidate

A few days ago, Bernie Hackett and the author shipped a release candidate for PyMongo 2.7, with substantial contributions from Amalia Hawkins and Kyle Erf. This version supports new features in the upcoming MongoDB 2.6, and includes major internal improvements in the driver code.

Brian O' Neill02/17/14
3681 views
0 replies

Determining if a Conditional Update was Applied with CQL Java-Driver

One of the critical aspects to using conditional updates is determining whether the update was applied. In this article, you'll learn how to do it.

Stefan Armbruster02/17/14
2224 views
0 replies

A Quick Tooling Tip for Hacking Cypher Statements on Linux

A lot of people love the new Neo4j browser shipped with 2.0 and subsequent releases. This is a nicely-built locally-running web application running in your browser. At the top, users can easily type their Cypher code and see results after executing, either in tabular form or as a visualization enabling to click through.

Alec Noller02/16/14
6919 views
0 replies

The Best of the Week (Feb. 7): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include tips on how to tune the JVM to improve performance in Cassandra, VoltDB's embrace of Groovy, part 1 of a series on building a recommendation engine with Neo4j, and more!