Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows.
This presentation takes a deep dive approach and discuss performance tuning tips for Apache Flume.
For high-level overview of Apache Flume and other Apache Hadoop related technologies, check out following two-part series on DZone:
Part 1 https://dzone.com/articles/techtalk-apache-hadoop-and-related-technologies-fo
Part 2 https://dzone.com/articles/the-hadoop-ecosystem-in-30-minutes-part-2