The Open Source movement is full of myths. There are different myths from inside the movement (i.e. those who think it's a good thing) and outside (i.e. those who do not). In this article, you'll find a few of those myths, as well as the author's interpretation and opinion of them.
One of the things that Voron does very well is the ability to read a lot of data fast. One of the interesting scenarios we deal with is when we want to deal with time series data. In this article, you'll see such a scenario.
Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include an analysis of Cassandra for ETL, a discussion of the validity of the idea that SQL and NoSQL are two sides of the same coin, the top 10 articles from 2013 in the NoSQL Zone, and more.
There's been a lot of comparison of SQL and NoSQL lately, but what is to be done if you're already sold on NoSQL and just don't know which database is for you, or your project in particular? To meet that need, check out this rather thorough comparison of NoSQL databases.
Johnathan is systems engineer on Mongo’s DevOps team and is helping to make MongoDB, the NoSQL non-relational database, more appealing to operations. The interview covers Mongo's recent round of funding, the difference between Mongo and traditional relational DBs, the future of MongoDB, and more.
I’ve recently been working mostly with HBase and Scalding, so picking up the new Akka Persistence APIs and implementing a plugin for HBase for it was a great idea for one afternoon or two.
The FreeDB data set is a 3.32 million records, containing most of the albums that came out in the past few decades. The author created the following Voron database to handle it, and in this article, you'll see how to implement and work with it.
The author has been playing around with Python’s timeit library to help benchmark some Neo4j cypher queries, but he ran into some problems when trying to give it accessible to variables in my program. In this post, he tracks and explores the issue in detail.
The author has been using Redis extensively, but one thing that has stuck out as problematic is isolating and segmenting parts of an application with Redis. In this article, you'll learn about SubRedis, a simple Python keyspace wrapper around Redis.
In NuoDB's latest patch, v2.0.2, the focus remained on fortifying our 2.0 release. In this short development cycle there were nearly 100 feature improvements and fixes. In this article, you'll find out about a number of fixes, as well as a variety of new features in NuoDB v2.0.2.
According to this recent article, there are changes to come for NoSQL databases. The article looks at the origins of NoSQL - MongoDB in particular - and its reasons for existing, but the central argument is that there is an increasing and still-unmet need for performance to match the scalability of NoSQL databases.
Think about a social network; one of the things you may want to do is tell the user how big their friends network is. But why stop there? How about their friends of friends or even friends of friends of friends network? These are the kind of questions graph databases excel at compared to relational databases.
Neo4j as a graph database features indexing as the preferred way to find start points for graph traversals. Over the years multiple different indexing approaches have been added. The goal of this article is to give an overview on this to avoid confusion, especially for those who just recently got started with Neo4j.
In a recent debate about NoSQL vs. SQL on Hackernews, the author was made aware of an amusing paper by Erik Meijer and Gavin Bierman. What they are claiming in their article is the fact that SQL and NoSQL are duals of each other, i.e. two sides of the same coin.
Rather than the best of the week this week, let's see the most popular articles NoSQL had to offer in 2013. The year's best include 5 things for Java developers to consider, an aptly-titled critique of blogs that attack popular tech simply for the sake of the clicks, Martin Fowler's introduction to NoSQL, and much more.
A very common requirement with any database engine is to be able to store and query hierarchical data. While RavenDB applications need to solve this kind of problem, a good solution often times looks very different from what you'd expect.
According to this recent blog post, Cassandra is no good for ETL. That's not to say that Cassandra is not good at all - the author is a current Cassandra user and has good to say about it - but the author takes issue with the perception of Cassandra as a do-all replacement for something like MySQL.
Neo4j’s powerful graph database can be used for analytics, recommendation engines, social graphs and many more applications. In the following example we demonstrate in a few steps how you can load Neo4j from your legacy relations SQL source.
Anyone can browse the database at MusicBrainz. With an account, you can contribute new data or fix existing record's details, track lengths, send in cover art scans of your favorite albums, etc. In this post (part 1) you will learn how to import the MusicBrainz data into Neo4j for some further analysis.
After publishing a small experiment with MongoDB, the author was challenged by the JOOQ team to match his results against Oracle. He will explore the specifics of that challenge in a later post, and in this one, he discusses a number of Small Data use-cases in which MongoDB was the right tool for the job.
Redis users might be interested in Lark, a new Python library designed to transform an HTTP request into a Redis command and provide a "RESTy" interface. Features include automatic JSON serialization and deserialization for Redis values, adapters for Flask and Django, and more.
What does “Scale” even mean in the context of databases? When talking about scaling, people have jumped to the vendor-induced conclusion that SQL doesn’t scale, while NoSQL scales. In this article, the author takes a look at database scalability by comparing Oracle benchmarks to MongoDB.
A new feature of MarkLogic 7′s search API is range index scoring – affecting relevancy based on a value within a document. In this article, the author details a couple of use cases: One involving ratings, and one involving distance from the center point of a geospatial query.
In a couple of Neo4j talks, the author has been asked how long it takes to get used to modeling data in graphs and whether he felt it's simpler than alternative approaches. His experience closely mirrors what he believes is a fairly common curve when learning technologies that change the way you think.
The author was in the middle of upgrading a little test project to a newer version of Spring Data Neo4j and Neo4j itself when he came across a few little points that others might find useful. Here are a couple "gotchas" he encountered.