DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports Events Over 2 million developers have joined DZone. Join Today! Thanks for visiting DZone today,
Edit Profile Manage Email Subscriptions Moderation Admin Console How to Post to DZone Article Submission Guidelines
View Profile
Sign Out
Refcards
Trend Reports
Events
Zones
Culture and Methodologies Agile Career Development Methodologies Team Management
Data Engineering AI/ML Big Data Data Databases IoT
Software Design and Architecture Cloud Architecture Containers Integration Microservices Performance Security
Coding Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks
Culture and Methodologies
Agile Career Development Methodologies Team Management
Data Engineering
AI/ML Big Data Data Databases IoT
Software Design and Architecture
Cloud Architecture Containers Integration Microservices Performance Security
Coding
Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance
Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks
  1. DZone
  2. Data Engineering
  3. Databases
  4. Netflix Benchmarks on AWS Show Cassandra NoSQL Still Has the Goods

Netflix Benchmarks on AWS Show Cassandra NoSQL Still Has the Goods

Mitch Pronschinske user avatar by
Mitch Pronschinske
·
Nov. 06, 11 · News
Like (0)
Save
Tweet
Share
11.12K Views

Join the DZone community and get the full member experience.

Join For Free

A little more than a year ago, Apache Cassandra's reputation was untouchable.  It was blowing other NoSQL data stores out of the water in benchmarks and in our very own DZone popularity poll.  What else would you expect from the data solution that was originally designed to handle the data on Facebook.  How could it not be the top solution out there?

But last year, Cassandra's reputation seemed like it got a little tarnished by about its instability and difficult learning curve.  And then there were subsequent migrations which were induced by the emerging and the growing popularity of MongoDB.  What really hurt Cassandra was Twitter announcing that it would over to the NoSQL store.  It was still used to store geolocation data and data mining results that feed into things like local trends and @toptweets, but the damage was still done.

Fastforward to last month and we see the stability issues fade away as Apache Cassandra reaches a major milestone in version 1.0. And just this week there's been benchmarks done by Netflix which vindicate their 6-month migration to Cassandra. 

Adrian Cockroft authored an incredibly detailed blog post about the stess tests on Amazon EC2 instances:

To measure scalability, the same test was run with 48, 96, 144 and 288 instances, with 10, 20, 30 and 60 clients respectively. The load on each instance was very similar in all cases, and the throughput scaled linearly as we increased the number of instances. Our previous benchmarks and production roll-out had resulted in many application specific Cassandra clusters from 6 to 48 instances, so we were very happy to see linear scale to six times the size of our current largest deployment. This benchmark went from concept to final result in five days as a spare time activity alongside other work, using our standard production configuration of Cassandra 0.8.6, running in our test account. The time taken by EC2 to create 288 new instances was about 15 minutes out of our total of 66 minutes. The rest of the time was taken to boot Linux, start the Apache Tomcat JVM that runs our automation tooling, start the Cassandra JVM and join the "ring" that makes up the Cassandra data store. For a more typical 12 instance Cassandra cluster the same sequence takes 8 minutes. 

--Adrian Cockroft

The blog post is kindly divided into overview and TL;DR sections if you're not interested in the nitty-gritty performance details.  Cockroft even offered Netflix's infrastructure up for your testing curiosity... seriously:  "If you are the kind of performance geek that has read this far and wishes your current employer would let you spin up huge tests in minutes and open source the tools you build, perhaps you should give us a call..."

AWS NoSQL Data mining

Opinions expressed by DZone contributors are their own.

Popular on DZone

  • Data Mesh vs. Data Fabric: A Tale of Two New Data Paradigms
  • Problems of Cloud Cost Management: A Socio-Technical Analysis
  • Silver Bullet or False Panacea? 3 Questions for Data Contracts
  • Hackerman [Comic]

Comments

Partner Resources

X

ABOUT US

  • About DZone
  • Send feedback
  • Careers
  • Sitemap

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 600 Park Offices Drive
  • Suite 300
  • Durham, NC 27709
  • support@dzone.com
  • +1 (919) 678-0300

Let's be friends: