How Does Twitter Use Scribe, Hadoop/Pig, HBase, Cassandra, and FlockDB for Data Analytics?
About the presentation
For those who are not aware of the said technologies…
Scribe – Log collection framework over Thrift, built and open sourced by Facebook
Hadoop – A software framework that supports data-intensive distributed applications from Apache
Pig – A platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs.
Cassandra – An open source distributed database management system
HBase – An open source, non-relational, distributed database modeled after Google’s BigTable and is written in Java
FlockDB – An open source distributed, fault-tolerant graph database for managing data at webscale