I’ve recently been working mostly with HBase and Scalding, so picking up the new Akka Persistence APIs and implementing a plugin for HBase for it was a great idea for one afternoon or two.
The FreeDB data set is a 3.32 million records, containing most of the albums that came out in the past few decades. The author created the following Voron database to handle it, and in this article, you'll see how to implement and work with it.
The author has been playing around with Python’s timeit library to help benchmark some Neo4j cypher queries, but he ran into some problems when trying to give it accessible to variables in my program. In this post, he tracks and explores the issue in detail.
The author has been using Redis extensively, but one thing that has stuck out as problematic is isolating and segmenting parts of an application with Redis. In this article, you'll learn about SubRedis, a simple Python keyspace wrapper around Redis.
In NuoDB's latest patch, v2.0.2, the focus remained on fortifying our 2.0 release. In this short development cycle there were nearly 100 feature improvements and fixes. In this article, you'll find out about a number of fixes, as well as a variety of new features in NuoDB v2.0.2.
According to this recent article, there are changes to come for NoSQL databases. The article looks at the origins of NoSQL - MongoDB in particular - and its reasons for existing, but the central argument is that there is an increasing and still-unmet need for performance to match the scalability of NoSQL databases.
Think about a social network; one of the things you may want to do is tell the user how big their friends network is. But why stop there? How about their friends of friends or even friends of friends of friends network? These are the kind of questions graph databases excel at compared to relational databases.
Neo4j as a graph database features indexing as the preferred way to find start points for graph traversals. Over the years multiple different indexing approaches have been added. The goal of this article is to give an overview on this to avoid confusion, especially for those who just recently got started with Neo4j.
In a recent debate about NoSQL vs. SQL on Hackernews, the author was made aware of an amusing paper by Erik Meijer and Gavin Bierman. What they are claiming in their article is the fact that SQL and NoSQL are duals of each other, i.e. two sides of the same coin.
Rather than the best of the week this week, let's see the most popular articles NoSQL had to offer in 2013. The year's best include 5 things for Java developers to consider, an aptly-titled critique of blogs that attack popular tech simply for the sake of the clicks, Martin Fowler's introduction to NoSQL, and much more.
A very common requirement with any database engine is to be able to store and query hierarchical data. While RavenDB applications need to solve this kind of problem, a good solution often times looks very different from what you'd expect.
According to this recent blog post, Cassandra is no good for ETL. That's not to say that Cassandra is not good at all - the author is a current Cassandra user and has good to say about it - but the author takes issue with the perception of Cassandra as a do-all replacement for something like MySQL.
Neo4j’s powerful graph database can be used for analytics, recommendation engines, social graphs and many more applications. In the following example we demonstrate in a few steps how you can load Neo4j from your legacy relations SQL source.
Anyone can browse the database at MusicBrainz. With an account, you can contribute new data or fix existing record's details, track lengths, send in cover art scans of your favorite albums, etc. In this post (part 1) you will learn how to import the MusicBrainz data into Neo4j for some further analysis.
After publishing a small experiment with MongoDB, the author was challenged by the JOOQ team to match his results against Oracle. He will explore the specifics of that challenge in a later post, and in this one, he discusses a number of Small Data use-cases in which MongoDB was the right tool for the job.
Redis users might be interested in Lark, a new Python library designed to transform an HTTP request into a Redis command and provide a "RESTy" interface. Features include automatic JSON serialization and deserialization for Redis values, adapters for Flask and Django, and more.
What does “Scale” even mean in the context of databases? When talking about scaling, people have jumped to the vendor-induced conclusion that SQL doesn’t scale, while NoSQL scales. In this article, the author takes a look at database scalability by comparing Oracle benchmarks to MongoDB.
A new feature of MarkLogic 7′s search API is range index scoring – affecting relevancy based on a value within a document. In this article, the author details a couple of use cases: One involving ratings, and one involving distance from the center point of a geospatial query.
In a couple of Neo4j talks, the author has been asked how long it takes to get used to modeling data in graphs and whether he felt it's simpler than alternative approaches. His experience closely mirrors what he believes is a fairly common curve when learning technologies that change the way you think.
The author was in the middle of upgrading a little test project to a newer version of Spring Data Neo4j and Neo4j itself when he came across a few little points that others might find useful. Here are a couple "gotchas" he encountered.
One graph database vendor decided to divide the graph database space into non-native (i.e. square) and native (i.e. diamond) graph databases. Obviously, non-native is boring, or slow, or simply bad, and native is exciting, or fast, or simply good. Problem is: There is no such thing as a native graph database.
There isn’t too much information about using MongoDB with SSL connections out there. If you are using MongoDB on a public network, all the data you transmit from the database to your application is completely unencrypted. Luckily however, MongoDB offers the option to be compiled with SSL support.
One of the additional features that Neo4j enterprise provides is access to various JMX properties which describe various aspects of the database.
In MongoDB query analysis, you can use the built-in queries (after all, the profiling is saved in a MongoDB collection). However, Dex, a tool from MongoLab, can help you shorten the time to index...
Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include a discussion of the relationship between relational databases and their predecessors, thoughts about compression and storage in LevelDB and LMDB, and an argument that SQL is the new NoNoSQL.