Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

MongoDB Finds A Major Adopter In Craigslist

DZone's Guide to

MongoDB Finds A Major Adopter In Craigslist

· Java Zone ·
Free Resource

Verify, standardize, and correct the Big 4 + more– name, email, phone and global addresses – try our Data Quality APIs now at Melissa Developer Portal!

 MongoDB recently gained another adopter (possibly its largest yet).

The NoSQL data store is now being used to archive billions of records at Craigslist, the popular classifieds and job posting community that serves 570 cities in 50 countries.

Every post in the history of the site was previously held in a large MySQL cluster. Since Craigslist had a variety of database needs moving forward, ranging from wanting to add new machines without downtime to routing around dead machines without clients failing, the development team decided to initiate a major migration to a NoSQL solution. Mongo DB was the solution they chose.



Here are some basic numbers about the Craigslist MongoDB cluster from Jeremy Zawodny, one of the site's software engineers:

We’re sizing the install for around 5 billion documents. That’s from the initial 2 billion document import we need to do plus room to grow for a few years to come. Average document size is right around 2KB. (Five billion 2KB documents is 10TB of data.) We’re getting our feet wet with MongoDB so this particular task isn’t high throughput or growing in unpredictable ways.

We can put data into MongoDB faster than we can get it out of MySQL during the migration.


Watch a video where Zawodny explains the evolution of data storage at Craigslist and how MongoDB will fit into the future of the site's infrastructure. You'll also find out why Craigslist chose MongoDB over other data stores.

Developers! Quickly and easily gain access to the tools and information you need! Explore, test and combine our data quality APIs at Melissa Developer Portal – home to tools that save time and boost revenue. Our APIs verify, standardize, and correct the Big 4 + more – name, email, phone and global addresses – to ensure accurate delivery, prevent blacklisting and identify risks in real-time.

Topics:

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}