Back to Browse

Inside Uber's Large-Scale Real-Time Analytics Platform | Current 2025

1.2K views
Jun 9, 2025
46:38

In this talk, we cover the matured architecture for realtime analytics ecosystem powering Uber’s usecases that serve up to 10s of thousands of queries/sec, several million writes/sec and host up to tens of Petabytes of Pinot datasets. We also cover two critical business and observability usecase. 1. Realtime processing and ingestion using AthenaX(SQL based transformation on Apache Flink®), Flink and Apache Kafka® to provide analytics on realtime data. 2. Realtime Analytics powered by Apache Pinot to serve analytics at high QPS with sub-second latency 3. Disaster resiliency and disaster recovery strategies for Apache Pinot datasets. The talk covers Uber’s two usecases that solve realtime analytics challenges for business and observability: - Use case 1: Business usecase(rides/eats related) - Use case 2: Observability usecase (metrics/logs related) The audience will gain practical insights into designing real-time analytics systems centered around Apache Pinot and effectively leveraging complementary real-time technologies to build robust and high-performing solutions. At Uber, the EVA platform that drives substantial advancements in our real-time analytics capabilities, empowering various business use cases across marketing, engineering, data science, and operations and internal use cases around metrics, logs & query analytics. The platform features Apache Kafka for realtime data transport, Apache Flink for stream processing, Spark for batch processing, HDFS for deep storage needs, and Apache Pinot as the core analytics engine. Additionally, it features internal service Neutrion for Presto-like queries on Pinot and metadata service for dataset management. – CONNECT Subscribe, if you dare: https://www.youtube.com/@ConfluentDeveloper?sub_confirmation=1 Community Slack: https://confluentcommunity.slack.com X: https://x.com/confluentinc Linkedin: https://www.linkedin.com/company/confluent GitHub: https://github.com/confluentinc Site: https://developer.confluent.io ABOUT CONFLUENT DEVELOPER Confluent Developer provides comprehensive resources for developers looking to learn about Apache Kafka®, Apache Flink®, Confluent Cloud, Confluent Platform, and any other technology related to the broader Data Streaming Platform. Content on Confluent Developer includes courses, getting started guides, topical deep-dives, patterns, tutorials, and listings of community events. Learn more at https://developer.confluent.io. #current2025 #apachekafka #apacheflink #confluent

Download

0 formats

No download links available.

Inside Uber's Large-Scale Real-Time Analytics Platform | Current 2025 | NatokHD