NoSQL Zone is brought to you in partnership with:
  • submit to reddit
Alec Noller01/27/14
0 replies

Quickly Removing Duplicates from MongoDB

If you've acquired some duplicates in MongoDB that you want to get rid of, this recent post provides a how-to on cleaning them up. The best option, obviously, is not to duplicate things in the first place - you're welcome - but the post is focused on solving the problem after the fact.

Alec Noller01/26/14
0 replies

The Best of the Week (Jan 17): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone! This week's best include a beginner's guide to MongoDB performance turbocharging, working with Node.js and N1QL, a look at an extremely configurable diff tool, and more.

Dharshan Rangegowda01/24/14
0 replies

10 Tips to Improve MongoDB Security

MongoDB provides a number of constructs to improve the security of your data. Here are 10 tips you can use to improve the security of your MongoDB servers on premise and in the cloud.

Mark Needham01/24/14
0 replies

Neo4j Backup: Store Copy and Consistency Check

One of the lesser-known things about the Neo4j online backup tool is that conceptually there are two parts to it: Copying the store files to a location of your choice, and verifying that those store files are consistent. Both of these run when you run the ‘neo4j-backup’ script, but it can be useful to run them separately.

Mark Needham01/23/14
0 replies

Neo4j Backup: Dealing with java.lang.ClassCastException

When using Neo4j’s online backup facility there are two ways of triggering java.lang.ClassCastException, either by using the ‘single://‘ or ‘ha://‘ syntax, and these behave slightly differently. In this article, you'll learn how to handle these exceptions.

A. Jesse Jiryu Davis01/22/14
0 replies

Five Things About Scaling MongoDB

There are a lot of articles about neat hacks for scaling MongoDB, but neat hacks are rarely necessary. MongoDB is designed to scale. Most applications just need to get these five things right.

Tom Harwood01/21/14
0 replies

Programmable Diff

From time to time, the author gets down deep inside the NuoSQL compiler and gives it a good hard shake, which can make comparing the new compiler to the old compiler's results a challenge. He would like a diff tool that's extremely configurable and written in Java, so in this article, he explores the way diff works.

Vlad Mihalcea01/20/14
0 replies

A Beginner’s Guide to MongoDB Performance Turbocharging

This is the second part of our MongoDB time series tutorial, and this post will be dedicated to performance tuning. In my previous post, I introduced you into our virtual project requirements. In short, we have 50M time events, spanning from the 1st of January 2012 to the 1st of January 2013.

Alec Noller01/19/14
0 replies

The Best of the Week (Jan. 10): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include a comparison of various NoSQL databases, a discussion of the myths surrounding the open source movement, a demonstration of MongoDB's real-life usage, and more.

Don Pinto01/17/14
0 replies

Node.js and N1QL

Recently, the author added support to their Node.js client for executing N1QL queries against your cluster. When he implemented it, he didn’t have very much to test against at the time, so decided to see how Node.js’s beer-sample example would look using N1QL queries, rather than using any views.

Mark Needham01/16/14
0 replies

Neo4j: Modeling ‘Series’ of Events

One of the things the author never worked out how to model in his football graph is a series of matches that allow him to answer questions like the following: How many goals has Robin Van Persie scored in his last 10 matches in the Barclays Premier League? In this article, the author explores some options.

Ayende Rahien01/16/14
0 replies

Early Lock Release: Transactions and Errors

There has been quite a lot of research on the notion of early lock release as a way to improve database concurrency. For a while, Voron actually supported that, mostly because the author thought that it was a good idea. Lately, however, he decided that it is anything but for modern databases.

Rajaraman Raghuraman01/15/14
0 replies

MongoDB Schema Design: How to Think Non-Relational

A nice video about MongoDB schema design. To all who are new to MongoDB, MongoDB is a leading NoSQL document oriented database. A table in RDBMS can be contrasted to a Collection in MongoDB, and a row in RDBMS table corresponds to a JSON document in MongoDB.

Ayende Rahien01/15/14
0 replies

Performance Testing with Voron OMG

There isn't a lot to say here, the author says, because the numbers speak for themselves. In this article, you'll find the current performance test of Voron vs. Esent on the author's machine, as well as details surrounding the comparison and some commentary on what it all means.

Vlad Mihalcea01/14/14
0 replies

MongoDB Time Series: Introducing the Aggregation Framework

In previous posts, the author talked about batch importing and the out-of-the-box MongoDB performance. Recently, though, MongoDB was awarded DBMS of the year, so he therefore decided to offer a more thorough analysis of its real-life usage.

Adam Fowler01/14/14
0 replies

How to Interpret NoSQL Funding Rounds

It’s all very tempting to gloat about the amount of money your company has raised in a funding round. Many commentators use the size of an investment to determine the likely value of the company. The author believes that approach is too simplistic, though.

Alec Noller01/13/14
0 replies

MongoDB: 5 Syntactic Weirdnesses to Keep in Mind

People like to complain about MongoDB. The debate gets so heated, though, that sometimes valid criticisms - and nothing is above criticism - are dismissed as bandwagon hatred. It's a problem that Slava Kim seems very aware of in this recent blog post on some of the syntactic weirdnesses of MongoDB.

Anders Karlsson01/13/14
0 replies

Some Myths of Open Source, the Way I See it

The Open Source movement is full of myths. There are different myths from inside the movement (i.e. those who think it's a good thing) and outside (i.e. those who do not). In this article, you'll find a few of those myths, as well as the author's interpretation and opinion of them.

Ayende Rahien01/13/14
0 replies

Voron and Time Series Data

One of the things that Voron does very well is the ability to read a lot of data fast. One of the interesting scenarios we deal with is when we want to deal with time series data. In this article, you'll see such a scenario.

Alec Noller01/12/14
0 replies

The Best of the Week (Jan. 3): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include an analysis of Cassandra for ETL, a discussion of the validity of the idea that SQL and NoSQL are two sides of the same coin, the top 10 articles from 2013 in the NoSQL Zone, and more.

Alec Noller01/10/14
0 replies

NoSQL Databases Compared: Differences, Strengths, and Examples

There's been a lot of comparison of SQL and NoSQL lately, but what is to be done if you're already sold on NoSQL and just don't know which database is for you, or your project in particular? To meet that need, check out this rather thorough comparison of NoSQL databases.

Barton George01/10/14
0 replies

An Interview with Jonathan Reams of MongoDB from DevOps Days NYC

Johnathan is systems engineer on Mongo’s DevOps team and is helping to make MongoDB, the NoSQL non-relational database, more appealing to operations. The interview covers Mongo's recent round of funding, the difference between Mongo and traditional relational DBs, the future of MongoDB, and more.

Konrad Malawski01/09/14
0 replies

Async Akka-Persistence-HBase Journal

I’ve recently been working mostly with HBase and Scalding, so picking up the new Akka Persistence APIs and implementing a plugin for HBase for it was a great idea for one afternoon or two.

Ayende Rahien01/09/14
0 replies

Working with the FreeDB Dataset in Voron

The FreeDB data set is a 3.32 million records, containing most of the albums that came out in the past few decades. The author created the following Voron database to handle it, and in this article, you'll see how to implement and work with it.

Mark Needham01/09/14
0 replies

Python: Scoping Variables to Use with Timeit

The author has been playing around with Python’s timeit library to help benchmark some Neo4j cypher queries, but he ran into some problems when trying to give it accessible to variables in my program. In this post, he tracks and explores the issue in detail.