OrientDB recently hit version 1.5. The open source, document graph database adds new features such as a new paginated local storage engine, a new and more performant Hash Index type, fetch plan support in SQL, and more.
Crowd Riff is kind of like a Klout competitor with a focus on customer and community analysis for businesses. Seems like a perfect situation for a graph databse...
Patrick McFadin wrote a good post recently on PlanetCassandra.org that covers three patterns for user profile data models in Cassandra.
Riak Pipe is most simply described as “UNIX pipes for Riak.” In much the same way you would pipe the output of one program to another on the command line, Riak Pipe allows you to pipe the output of a function on one vnode to the input of a function on another.
In the new data for this month's DB-Engines.com DBMS rankings, MongoDB has surpassed Microsoft Access in popularity and is only slightly behind IBM DB2 with a rank of 6 overall. It's also the most popular NoSQL data store.
If you're entirely new to MongoDB - this is just a very gentle introduction that gets you up and running with a sample database in just a few easy steps.
This is a MongoDB tutorial on using MongoDB in PHP which I gave as a webcast for O’Reilly last year. The tutorial covers everything from Installing MongoDB along with installing and configuring the MongoDB PHP driver.
Often considered one of the best NoSQL databases for using alongside Hadoop, HBase had it's own conference this year with several very interesting sessions. One of these sessions covered the compaction improvements in HBase.
There's an in-depth post the Datastax about doing lightweight transactions in Cassandra. It's an interesting read for those looking into the tradeoffs between availability and consistency in their database system.
In this tutorial, I am going to use the Spray-Client, DataStacks Cassandra driver and Akka to build an application that downloads tweets and then stores their id, text, name and date in a Cassandra table
As a followup to his "Fun with Music, Neo4j and Talend" post a few weeks ago, Rik Van Bruggen also posted a gist of his Last.fm dataset.
Need an introduction to graphs, graph databases, and a NoSQL graph database like neo4j along with a graph query language – Cypher?
The LMDB codebase is a very dense piece of code, but at the same time, it is also quite interesting. In particular, B-Trees are pretty much The Answer for a lot of on disk data structures, but they tend to be quite hard to handle properly.
Tungsten Replicator is a software package that allows replication to be established between MySQL and another database product. This blog post describes how to configure replication between MySQL and NuoDB.
Batch importing and processing is getting more and more popular it seems. This video, featuring DZone MVB Mark Needham, gives a brief tutorial on how to use the neo4j batch importer - a tool used to import large data sets.
When I was helping prepare for the CFSummit conference, we organized the sessions on Trello. The MongoDB aggregation framework is a relatively new addition to the platform. Using this framework, you can group, sort, calculate and handle information gathering in the aggregate sense. Here's how I did this for the Trello JSON data.
Flyclops, a game company, recently decided that MySQL wasn't cutting it for them anymore. They tried out a bunch of different databases including CouchDB, MongoDB, Cassandra, HBase, Neo4j, DynamoDB, PostgreSQL, and a bunch more. What they finally settled on was...
If you're looking to learn some real-world usage scenarios for all four of these technologies all in one sweep, then look no further.
Redis 2.8 hit release candidate 2 status this week with some of the major bugs finally getting fixed. The new release should be a welcome change given Redis' recent, and very public, incident with Twilio's billing system.
If you've tried Riak and didn't like it, or you're about to start using it, you'll want to take a look at this video to see if you were using the best practices when working with this database.
J Brisbin's work with NoSQL datastores over the last couple of years has given me some insight into the direction applications will inevitably take as NoSQL becomes the dominant data storage and retrieval method—at least for web and cloud-based applications.
A relatively new NoSQL data store on the scene is Aerospike. They're focused on pushing the limits of SSD/Flash in-memory data processing and providing "Storm speed" (perhaps he's referring to Twitter Storm?) which is about 10 million messages a second.
This article will show you how core MongoDB operations are made using the MongoDB Java Driver version 2.11.1.
Although it's not yet in a code release, Cloudant and the rest of the Apache CouchDB community has finally finished the merging of BigCouch, a HA, fault-tolerant, clustered version of CouchDB, with the primary CouchDB Apache project. The code is now in the testing phase, which you can help with.
Over the past few years I’ve seen the emergence of polyglot persistence i.e. using different data storage technologies for different data and in most situations we work that out up front. The main down side to this approach is that we now have to keep two data sources in sync but it’s interesting to think about whether this trade off is worthwhile...