Over a million developers have joined DZone.

Why Cassandra is No Good for ETL

DZone's Guide to

Why Cassandra is No Good for ETL

· Big Data Zone
Free Resource

Effortlessly power IoT, predictive analytics, and machine learning applications with an elastic, resilient data infrastructure. Learn how with Mesosphere DC/OS.

According to this recent blog post from Aras Can Akin, Cassandra is no good for ETL. That's not to say that Cassandra is not good at all - Akin is a current Cassandra user and has good to say about it - but Akin takes issue with the perception of Cassandra as a do-all replacement for something like MySQL. He says:

. . . Cassandra IS NOT the mysql replacement. Tech people need to know that. It’s a fantastic distributed key-value store, but currently, it’s nothing more than that. However, Cassandra developers keeps saying that it’s designed for time-series data or it’s good at ETL. However, it isn’t. It is only a scalable distributed key-value store, nothing more.

Most of the post is a case study involving Akin's own experiences migrating from MySQL to Cassandra, and the various problems and workarounds that popped up during the process. Ultimately, Akin concludes, the workarounds are not solutions that he is happy with.

What do you think? Is Cassandra anything more than a distributed key-value store? Is there misinformation out there as to its strengths?

Learn to design and build better data-rich applications with this free eBook from O’Reilly. Brought to you by Mesosphere DC/OS.


Opinions expressed by DZone contributors are their own.


Dev Resources & Solutions Straight to Your Inbox

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.


{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}