Who Is Using Cassandra?
Apache Cassandra is a high-performance, extremely scalable, fault-tolerant (i.e., no single point of failure), distributed non-relational database solution. Cassandra provides benefits similar to Google Bigtable and Amazon Dynamo to handle the types of database management needs that traditional RDBMS vendors cannot support.
Cassandra is in use at Apple (75,000+ nodes), Spotify (3,000+ nodes), eBay, Capital One, Macy's, Bank of America, Netflix, Twitter, Urban Airship, Constant Contact, Reddit, Cisco, OpenX, Rackspace, Ooyala, and more companies that have large active data sets. The largest known Cassandra cluster has more than 300 TB of data across more than 400 machines (cassandra.apache.org).
RDBMS vs. Cassandra
|
Cassandra |
RDBMS |
Atomicity |
Success or failure for inserts/deletes in a single partition (one or more rows in a single partition). |
Enforced at every scope, at the cost of performance and scalability. |
Sharding |
Native share-nothing architecture, inherently partitioned by a configurable strategy. |
Often forced when scaling, partitioned by key or function. |
Consistency |
No tunable consistency in the ACID sense. Can be tuned to provide more consistency or to provide more availability. The consistency is configured per request. Since Cassandra is a distributed database, traditional locking and transactions are not possible. |
Favors consistency over availability tunable via isolation levels. |
Durability |
Writes are durable to a replica node being recorded in memory and the commit log before acknowledged. In the event of a crash, the commit log replays on restart to recover any lost writes before data is flushed to a disk. |
Typically, data is written to a single master node, sometimes configured with synchronous replication at the cost of performance and cumbersome data restoration. |
Multi-Datacenter Replication |
Native and out-of-the-box capabilities for data replication over lower bandwidth, higher latency, and less reliable connections. |
Typically, only limited long-distance replication to read-only followers receiving asynchronous updates. |
Security |
Coarse-grained and primitive, but authorization, authentication, roles, and data encryption are provided out of the box. |
Fine-grained access control to objects. |
{{ parent.title || parent.header.title}}
{{ parent.tldr }}
{{ parent.linkDescription }}
{{ parent.urlSource.name }}