If you've acquired some duplicates in MongoDB that you want to get rid of, this recent post provides a how-to on cleaning them up. The best option, obviously, is not to duplicate things in the first place - you're welcome - but the post is focused on solving the problem after the fact.
Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone! This week's best include a beginner's guide to MongoDB performance turbocharging, working with Node.js and N1QL, a look at an extremely configurable diff tool, and more.
MongoDB provides a number of constructs to improve the security of your data. Here are 10 tips you can use to improve the security of your MongoDB servers on premise and in the cloud.
One of the lesser-known things about the Neo4j online backup tool is that conceptually there are two parts to it: Copying the store files to a location of your choice, and verifying that those store files are consistent. Both of these run when you run the ‘neo4j-backup’ script, but it can be useful to run them separately.
When using Neo4j’s online backup facility there are two ways of triggering java.lang.ClassCastException, either by using the ‘single://‘ or ‘ha://‘ syntax, and these behave slightly differently. In this article, you'll learn how to handle these exceptions.
There are a lot of articles about neat hacks for scaling MongoDB, but neat hacks are rarely necessary. MongoDB is designed to scale. Most applications just need to get these five things right.
From time to time, the author gets down deep inside the NuoSQL compiler and gives it a good hard shake, which can make comparing the new compiler to the old compiler's results a challenge. He would like a diff tool that's extremely configurable and written in Java, so in this article, he explores the way diff works.
This is the second part of our MongoDB time series tutorial, and this post will be dedicated to performance tuning. In my previous post, I introduced you into our virtual project requirements. In short, we have 50M time events, spanning from the 1st of January 2012 to the 1st of January 2013.
Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include a comparison of various NoSQL databases, a discussion of the myths surrounding the open source movement, a demonstration of MongoDB's real-life usage, and more.
Recently, the author added support to their Node.js client for executing N1QL queries against your cluster. When he implemented it, he didn’t have very much to test against at the time, so decided to see how Node.js’s beer-sample example would look using N1QL queries, rather than using any views.
One of the things the author never worked out how to model in his football graph is a series of matches that allow him to answer questions like the following: How many goals has Robin Van Persie scored in his last 10 matches in the Barclays Premier League? In this article, the author explores some options.
There has been quite a lot of research on the notion of early lock release as a way to improve database concurrency. For a while, Voron actually supported that, mostly because the author thought that it was a good idea. Lately, however, he decided that it is anything but for modern databases.
A nice video about MongoDB schema design. To all who are new to MongoDB, MongoDB is a leading NoSQL document oriented database. A table in RDBMS can be contrasted to a Collection in MongoDB, and a row in RDBMS table corresponds to a JSON document in MongoDB.
There isn't a lot to say here, the author says, because the numbers speak for themselves. In this article, you'll find the current performance test of Voron vs. Esent on the author's machine, as well as details surrounding the comparison and some commentary on what it all means.
In previous posts, the author talked about batch importing and the out-of-the-box MongoDB performance. Recently, though, MongoDB was awarded DBMS of the year, so he therefore decided to offer a more thorough analysis of its real-life usage.
It’s all very tempting to gloat about the amount of money your company has raised in a funding round. Many commentators use the size of an investment to determine the likely value of the company. The author believes that approach is too simplistic, though.
People like to complain about MongoDB. The debate gets so heated, though, that sometimes valid criticisms - and nothing is above criticism - are dismissed as bandwagon hatred. It's a problem that Slava Kim seems very aware of in this recent blog post on some of the syntactic weirdnesses of MongoDB.
The Open Source movement is full of myths. There are different myths from inside the movement (i.e. those who think it's a good thing) and outside (i.e. those who do not). In this article, you'll find a few of those myths, as well as the author's interpretation and opinion of them.
One of the things that Voron does very well is the ability to read a lot of data fast. One of the interesting scenarios we deal with is when we want to deal with time series data. In this article, you'll see such a scenario.
Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include an analysis of Cassandra for ETL, a discussion of the validity of the idea that SQL and NoSQL are two sides of the same coin, the top 10 articles from 2013 in the NoSQL Zone, and more.
There's been a lot of comparison of SQL and NoSQL lately, but what is to be done if you're already sold on NoSQL and just don't know which database is for you, or your project in particular? To meet that need, check out this rather thorough comparison of NoSQL databases.
Johnathan is systems engineer on Mongo’s DevOps team and is helping to make MongoDB, the NoSQL non-relational database, more appealing to operations. The interview covers Mongo's recent round of funding, the difference between Mongo and traditional relational DBs, the future of MongoDB, and more.
I’ve recently been working mostly with HBase and Scalding, so picking up the new Akka Persistence APIs and implementing a plugin for HBase for it was a great idea for one afternoon or two.
The FreeDB data set is a 3.32 million records, containing most of the albums that came out in the past few decades. The author created the following Voron database to handle it, and in this article, you'll see how to implement and work with it.
The author has been playing around with Python’s timeit library to help benchmark some Neo4j cypher queries, but he ran into some problems when trying to give it accessible to variables in my program. In this post, he tracks and explores the issue in detail.