A great intro to GridFS, a convention and API (not something native to your server), and how you can use it from Python
If you fail to indicate the primary key to sqoop, the following exception will be thrown; a similar issue arises without properly starting Cassandra.
In case you did not pay attention to the area of large scale distributed computing – there is a revolution going on! It is becoming increasingly evident that the software ecosystems built around so called Big Data are at the forefront of cloud computing innovation.
Brian leads design and development of a Master Data Management (MDM) solution for the Health Market industry. The solution runs on a Big Data platform powered by Cassandra. He recently contributed a RefCard on Cassandra to DZone.
In response to the following tweet: "What's your take on the filure to get Umbraco CMS v5 performing on top of NHibernate? The project folded today . . ."
Now we can use graph data in a business intelligence / business discovery solution like QlikView to do some more business related analytics.
Thinking about this problem imperatively, these nodes seem to belong together, but this changes when considering the problem functionally.
This time, garudad takes a look at MongoDB's built-in map-reduce functionality to solve a specific Chemoinformatics problem.
This post uses the above tools in order to page through the ATOM feed; includes all code.
In this article, I try to illustrate my point, by employing NoSQL, and more specifically MongoDB, to solve a specific Chemoinformatics problem in a truly elegant and efficient way.
The Neo4j 1.8 Milestone 4 updates capabilities of the core Traversal Framework as well as offering new ways to use paths in Cypher.
How nice it would be if one could transparently create/maintain relational record sets for document writes.
Including all the necessary code, just open solrconfig.xml and declare your ValueSourceParser . . .
After working with MongoDB and .NET, this user wanted to give Riak a try.
Learn how to integrate MongoDB with Hadoop for large-scale distributed
data processing. Using Hadoop’s MapReduce and Streaming you will learn
how to do analytics and ETL on large datasets with the ability to load
and save data against MongoDB. With...
RavenDB is a 2nd generation
document database for the .NET platform. It has been used in
production since 2010 and is probably the nicest database that you’ll
get to meet, even if
this is said by its author.
I have written articles before...
Entrance into the realm of NoSQL databases, specifically MongoDB, requires that you give up a sense of structure. SourceForge's Ming helps deal with this problem for Python users.
The GraphConnect conference, sponsored by Neotechnology, celebrates everything graphy.
I already said that my first research results have been submitted to SIGMOD conference to the social networks and graph databases track. Time to sum up the results and blog about them. you can find a demo of the system here
From Stockholm to Munich to Philly, RavenDB training is being offered on both sides of the pond.
How do they use Scribe, Hadoop / Pig, HBase, Cassandra, and FlockDB for data analytics?
A great post that crosses a few hot topics in the realms of databases and cloud computing. Check out how MongoDB, Windows Azure, and Node.js can be realated.
This writer describes how objects' circular references in MongoDB could lead to crashing your process.
From an OSCON data workshop in 2011, this slideshow offers plenty of useful resources for getting to know CouchDB
Feel free to sign up here for MongoDB and Python tutorial training classes with a former SourceForge employee.