Real-Time Analytics at UBER Scale [Video]
In this video, technical lead on real-time data infrastructure at Uber shares how Uber supports millions of analytical queries daily across real-time data.
Join the DZone community and get the full member experience.Join For Free
At Strata+Hadoop World, James Burkhart, technical lead on real-time data infrastructure at Uber, shared how Uber supports millions of analytical queries daily across real-time data with Apollo, Uber’s internal analytics querying language.
James covers architectural decisions and lessons learned from building an exactly-once ingest pipeline that captures raw events across in-memory row storage and on-disk columnar storage. He also details how Uber uses a custom metalanguage and query layer, leveraging partial OLAP result, set caching, and query canonicalization. Putting all the pieces together provides thousands of Uber employees with subsecond p95 latency analytical queries spanning hundreds of millions of recent events.
Published at DZone with permission of Mason Hooten, DZone MVB. See the original article here.
Opinions expressed by DZone contributors are their own.