Driving innovation at scale

Powering 41 million daily connections between the digital and physical worlds through a real-time marketplace built for global reliability.

NEWS Agentic Pods: embedding engineers to cut capital allocation from 15 hours to 30 minutes // OSS zap — 24.3k stars: Blazing fast structured logging for Go // HIRING Senior ML Engineer, AV Foundation — frontier ML for autonomous vehicles // NEWS Uber engineers named Agentic AI Foundation ambassadors, shaping open agent standards // BLOG GitFarm: Git-as-a-Service — sub-500ms checkouts, 80%+ resource savings // HIRING Staff ML Engineer, Applied AI — foundation models across Mobility and Delivery // NEWS Uber AV Labs at CVPR 2026: turning billions of real trips into training for Level 4 autonomy // RESEARCH Moirai: Lossless Straggler Mitigation — accepted at SOSP 2025 // BLOG uReview: GenAI code review analyzing 90% of Uber's ~65K weekly changes // NEWS Uber at the AI Engineer World's Fair: agentic SDLC, GenAI code review, and multimodal evals // HIRING Senior Security Incident Commander — lead Uber's critical security response // OSS RIBs — 7.9k stars: Cross-platform mobile architecture framework // NEWS Uber GenAI Meetup, Hyderabad: 80+ builders on trust, cost, and talent in the age of agents // BLOG Building Uber's Multi-Cloud Secrets Management Platform // NEWS UberMobiConf: 280+ engineers on reimagining mobile with AI and platform excellence // NEWS Agentic Pods: embedding engineers to cut capital allocation from 15 hours to 30 minutes // OSS zap — 24.3k stars: Blazing fast structured logging for Go // HIRING Senior ML Engineer, AV Foundation — frontier ML for autonomous vehicles // NEWS Uber engineers named Agentic AI Foundation ambassadors, shaping open agent standards // BLOG GitFarm: Git-as-a-Service — sub-500ms checkouts, 80%+ resource savings // HIRING Staff ML Engineer, Applied AI — foundation models across Mobility and Delivery // NEWS Uber AV Labs at CVPR 2026: turning billions of real trips into training for Level 4 autonomy // RESEARCH Moirai: Lossless Straggler Mitigation — accepted at SOSP 2025 // BLOG uReview: GenAI code review analyzing 90% of Uber's ~65K weekly changes // NEWS Uber at the AI Engineer World's Fair: agentic SDLC, GenAI code review, and multimodal evals // HIRING Senior Security Incident Commander — lead Uber's critical security response // OSS RIBs — 7.9k stars: Cross-platform mobile architecture framework // NEWS Uber GenAI Meetup, Hyderabad: 80+ builders on trust, cost, and talent in the age of agents // BLOG Building Uber's Multi-Cloud Secrets Management Platform // NEWS UberMobiConf: 280+ engineers on reimagining mobile with AI and platform excellence //

0 concurrent trips & orders

0 requests/sec across microservices

0 ETA predictions served per day

0 ML models in production

How We Build

Principles forged by the demands of real-time, physical-world systems.

Multi-sided Marketplace

Real-time forecasting, dynamic pricing & matching across earners, consumers and merchants — all in milliseconds.

Hyper-local Geospatial

H3 hex grids, sub-second ETAs, real-time routing and location intelligence powering every trip on the planet.

Platform First

Modular building blocks that let us launch new verticals — rides, eats, freight, autonomous — on shared infrastructure.

Global Adaptability

70+ countries, thousands of regulatory frameworks, dozens of languages — one platform that adapts everywhere.

Resiliency & Scalability

Fault-tolerant systems serving millions of concurrent users. Five 9s is not aspirational, it is mandatory.

Engineered for the Real World

Stories of building the systems that move the world in real time.

BACKEND

GitFarm: Git as a Service for Large-Scale Monorepos

Cloning Uber's Go monorepo took 15 minutes and 40GB of disk, and thousands of clone operations overwhelmed Git servers. GitFarm delivers Git-as-a-Service over gRPC, with warm bare clones and ephemeral sandbox pools acquired in under 500ms. A read-heavy ownership service cut CPU 77%, memory 90%+, and startup from 20 minutes to under one.

BACKEND

Modernizing Artifact Storage at Uber

Uber's decade-old on-prem artifact store — serving 5+ petabytes of downloads monthly — risked write outages, silent replication failures, and multi-hour manual rebalancing. The team migrated to multi-region cloud blob storage behind a validation proxy that checks freshness via conditional HTTP requests against a MySQL metadata store. Egress fell over 99%, cutting artifact egress costs nearly 90% at 99.99% availability.

BACKEND

Zero-Growth Stack, Real Gains: How Stack Allocation Can Save 10% CPU in Go

Go's default 2KB stack allocation triggered wasteful expansion cycles consuming 10% CPU in Uber's high-throughput services. The team built a profile-guided automation system reading CPU profiles and binary symbols to compute optimal per-service stack sizes. Result: CPU overhead dropped below 1%, enabling 16% efficiency gains while keeping memory overhead under 2% across 2M cores.

BACKEND

The 5 Layers Every Cloud Commitment Depends On

Uber's decade-long cloud commitment — a distributed systems problem spanning 10-figure spend — required mastering five physical layers. The challenge: regional topology constraints, inelastic power limits, and egress economics compound silently into cost variances. The solution: systematic qualification of compute, SKU selection, and network topology upfront. Impact: eliminating technical debt before it scales; enabling portability across zones and regions in minutes.

BACKEND

How Ansible Automation Powers the Uber Corporate Network at a Global Scale

Manual management of 5,000 network devices across six continents created unstable configurations and constant downtime. Uber implemented Ansible-driven "Daily Nightly Enforcement" — automating backups, golden config generation via Jinja templates, and standardized pushes across multi-vendor hardware. Unified network state across regions, eliminated untracked manual changes overnight, and transformed a chaotic distributed system into reliably predictable infrastructure.

BACKEND

Hybrid Core Allocation: From Overallocation to Reliable Sharing

Odin's vertical CPU scaler struggled with bursty workload patterns under pure dedicated core allocation. Uber engineers introduced a hybrid model combining dedicated cores for baseline performance with shared cores for elastic bursts, managed via Linux cpusets and cpu.shares. Result: reduced overprovisioning for variable workloads, increased fleet-wide CPU utilization, and maintained service-level guarantees without cross-host workload migration.

DATA

Simplifying Data and Product Integrations with a Data Abstraction Layer

Building a new advertiser report took weeks to months because queries were tightly coupled to constantly evolving data schemas fragmented across sources. Uber's Data Abstraction Layer, an RPC service, maps logical tables to physical ones via declarative config, resolving joins and aggregations across OLAP, Hive, and Docstore. Report time-to-market dropped from multi-week to under two days — roughly 90%.

DATA

Beyond Prediction: Solving the Multiple Knapsack Problem at Scale

Allocating millions of incentive offers across Mobility and Delivery under strict quarterly budgets relied on a manual First-In-First-Out heuristic with no optimization. Tarot frames the problem as a Multiple Knapsack Problem, pairing uplift models and a budget pacer with a CP-SAT solver. Budget utilization climbed from 68% to 99.99%, and 100,000-user problems now solve in minutes instead of 24+ hours.

DATA

Validating Bounding Box Annotations

Human annotators introduce tracking errors — ID swaps, position jumps, freeze errors, scale distortions — when labeling video bounding boxes. Uber deployed an XGBoost classifier analyzing 283 visual, motion, and coordinate features across an 11-frame window to catch these failures automatically. The system flags errors in real-time before submission, eliminating costly sequential review workflows while preserving data integrity.

DATA

Evolution and Scale of Uber's Delivery Search Platform

Lexical search broke on Uber Eats — failing on synonyms, typos, and multilingual queries like "pan" (bread or cookware?). The new semantic stack: a two-tower deep network with a Qwen LLM backbone, trained via DeepSpeed ZeRO-3. Matryoshka embeddings cut storage 50% with under 0.3% quality loss; scalar quantization halves latency.

DATA

Transforming Ads Personalization with Sequential Modeling and Hetero-MMoE

Uber's ads system flattened rich behavioral sequences into summary stats and used MLP-only experts that missed cross-feature interactions. A target-aware transformer with Multi-Head Latent Attention captures sequences at O(N×L), and a Hetero-MMoE blends MLP, DCN, and CIN experts. Production gains: +0.93% pCTR AUC and +0.66% pCTO AUC.

DATA

Database Federation: Decentralized Hive Databases

16,000 datasets in one monolithic Hive metastore meant shared-fate blast radius — one bad operation could affect every team. Uber decomposed it into domain-specific databases via pointer-level metadata manipulation, achieving zero-downtime migration. Result: over 1 PB saved and the organizational independence teams needed to own their data contracts.

MOBILE

How Uber Built an Agentic System to Automate Design Specs in Minutes

Manual design specs across UIKit, SwiftUI, Android XML, Compose, Web React, Go, and SDUI were a bottleneck causing constant documentation drift. uSpec combines AI agents with a Figma Console MCP bridge that reads real tokens and variants directly from Figma — running locally via Cursor over WebSocket. Screen reader specs across all 3 platforms now generate in under 2 minutes.

MOBILE

Standardized Mobile Analytics for Cross-Platform Insights

Over 40% of Uber's mobile events were ad-hoc custom logs, breaking cross-platform analysis and impression accuracy. The team standardized to three universal event types — tap, impression, scroll — using AnalyticsBuilder classes that capture metadata at the platform layer. Result: 30% less transient-impression noise and reliable iOS/Android parity.

MOBILE

Unified Checkout: Streamlining Uber's Payment Ecosystem

Every Uber line of business had built payments independently — duplicated logic, inconsistent UX, Apple Pay missing from half the flows. EU Strong Customer Authentication forced a reckoning. Uber built a centralized checkout orchestrator with modular components each LOB plugs into. Holdout results: 3% higher conversion, 4.5% better session recovery, hundreds of millions in incremental gross bookings.

MOBILE

Measuring Design System Adoption at Scale

Uber's Rider app launches features across hundreds of screens and thousands of feature flags — making manual design system audits impossible. Design System Observability adds a deterministic component scanner that flags non-Base elements, plus a daily screenshot pipeline that auto-files Jira tickets for violations. Teams using Base report 3x faster development and 50% less code.

MOBILE

Transforming Executive Travel: Delegate Booking

Letting executive assistants book rides for executives meant rethinking trip ownership, identity, billing, and notifications across 30+ backend services and 5 client platforms. Uber introduced a "participant model" extending every booking, tracking, and billing touchpoint to support multiple user profiles per trip — with full audit trails for both EA and executive.

MOBILE

Uber's Live Activity on iOS

Live Activities run sandboxed with no network access — yet Uber needed real-time driver location on the lock screen. App Groups share on-disk state between the main app and Live Activity, a lightweight DSL syncs content logic across iOS Live Activities and Android push, and an OOA backend debounces updates. Result: 2.26% fewer driver and 2.13% fewer rider cancellations at pickup.

AI / ML

Cart Assistant: Agentic Grocery Shopping on Uber Eats

Turning handwritten lists, recipe screenshots, and vague meal plans into a grocery cart forces users to hand-translate intent into dozens of searches. Uber Eats' Cart Assistant decomposes the task into an eight-stage state graph — LLM planning, candidate retrieval, semantic relevance judging, and deterministic price and quantity guardrails — processing items concurrently to keep the agentic shopping flow fast.

AI / ML

Building a File Semantic Analyzer: Guarding Outbound Data at Scale with AI

Keyword-based data-loss prevention drowned analysts in false positives while missing real exfiltration across thousands of employees' files. Uber's File Semantic Analyzer uses fine-tuned LLMs — OCR, chunking, summarization, and entity extraction — to reason over file meaning and intent, with transparent justifications. False positives fell 97%, incident response dropped from hours to minutes, saving four person-years annually across 150,000 files.

AI / ML

Scaling Real-Time Traffic Forecasting with a Graph-Aware Transformer

Uber's legacy traffic system couldn't adapt to incidents, weather, and sparse rural roads where segment errors compound on long trips. DeepETT, a multi-view transformer, tokenizes pre-aggregated spatial and temporal observations for constant-time inference, with a Flink pipeline calibrating drift in real time. It serves 2M+ forecasts per second, improving long-trip accuracy 6% and variance explained 19% — $100M annualized value.

AI / ML

Lessons from Building a First-Pass AI PRD Reviewer at Uber

Product review processes surfaced critical gaps too late, forcing teams into costly rework on unsupported assumptions and blind dependencies. Uber built an AI-powered PRD Evaluator that assembles contextual knowledge across linked documents, prior experiments, and company-specific frameworks to audit proposals before high-level review. Early adoption by dozens of PMs revealed stronger artifact quality entering checkpoints and sharper, faster review discussions.

AI / ML

AI Prototyping Is Changing How We Build Products at Uber

Cross-functional alignment used to take weeks of meetings and PRDs. AI prototyping tools — Lovable, Figma Make, Claude Code, Cursor — compressed a merchant team's four-week discussion into two hours, and let a PM explore six concepts in 20 minutes. Nearly 40% of Uber's global hackathon submissions now incorporate these tools.

AI / ML

Open Source and In-House: How Uber Optimizes LLM Training

Training LLMs for Eats, support, and code gen at Uber means squeezing every GPU cycle. The stack: PyTorch, Ray, DeepSpeed ZeRO-3 CPU Offload (34% memory reduction, 2-7x batch sizes), and Flash Attention (50% memory savings). On H100, Mixtral 8x7b achieves 3x A100 throughput, scaling linearly to batch 64.

SECURITY

Solving the Identity Crisis for AI Agents

When AI agents delegate across systems, the original human's identity vanishes, breaking audit trails and fine-grained access control. Uber extended its Zero Trust architecture with an Agent Registry, a Security Token Service issuing short-lived single-hop JWTs carrying the full actor chain, and an MCP Gateway enforcing policy. Deployed across thousands of agents, token exchange stays under 40ms at P99.

SECURITY

Superuser Gateway: Guardrails for Privileged Command Execution

A misplaced flag in a privileged `rm -r` could silently delete a production dataset with no audit trail. Superuser Gateway removes superuser credentials from engineers' machines entirely, routing privileged commands through a Git-backed PR workflow with CLI submission, CI validation, peer approval, and controlled remote execution. Now standard for all data platform admins.

SECURITY

Determinism and Safety in IAM Policy Changes

An accidental IAM change on a critical gateway once stopped Uber Eats customers from modifying orders — and ~10% of monthly policy changes involve risky privilege removal. The Policy Simulator pulls 30–90 days of access logs from Apache Pinot and replays them through current vs. proposed policies. Cadence-orchestrated, sub-minute impact analysis before anything ships.

SECURITY

Automating Kerberos Keytab Rotation at Uber

Rotating 100,000+ Kerberos keytabs is risky: rotation invalidates the previous key immediately, leaving applications without valid credentials. Uber's solution generates keytabs with both old and new versions during transitions, drops fetch intervals to 30s, and integrates with the Secret Management Platform. Now rotates 30,000+ keytabs monthly with zero disruptions.

SECURITY

Multi-Cloud Secrets Management Platform

150,000 secrets across 25 fragmented vaults — no centralized detection, rotation, or attribution. Uber consolidated into 6 managed vaults, deployed real-time scanning across git/Slack/CI, and built a Cadence-orchestrated Secret Lifecycle Manager. A team of 10 now drives 20,000 automated monthly rotations with 90% fewer secrets exposed in pipelines.

SECURITY

Security for Hadoop Data Lake on Google Cloud Storage

Migrating 160+ PB from HDFS to GCS meant bridging Kerberos delegation tokens and GCP OAuth 2.0 — without changing any of thousands of analytical jobs. The Storage Access Service intercepts FileSystem calls, exchanges Hadoop tokens for time-bound GCP credentials, and caches across three layers. Handles 500,000+ RPS at 0.026ms average latency.

View all posts

News

The latest from Uber Engineering and beyond.

Inside Uber's Agentic Pods

Uber embeds engineers alongside operators to hunt for AI wins hiding in plain sight. The results: capital allocation from 15 hours to 30 minutes, financial reports from two days to ten.

Uber Engineers Named Agentic AI Foundation Ambassadors

Uber engineers Gaurav Goel and Sanjeev Suresh are named Agentic AI Foundation ambassadors — recognizing the technical leadership shaping open standards for how autonomous agents are built, secured, and governed.

Uber AV Labs at CVPR 2026

Uber AV Labs took its work to CVPR 2026, showing how billions of real trips become training signal for Level 4 autonomy — mining long-tail driving scenarios at a scale few can match.

Uber at the AI Engineer World's Fair

Fresh from the AI Engineer World's Fair, Uber engineers recap their sessions on the agentic SDLC, GenAI code review, and multimodal agent evaluation — how AI now runs through Uber's engineering.

Uber GenAI Meetup: Hyderabad

Over 80 AI practitioners, engineers, and builders filled Uber's Hyderabad office for technical talks and a panel on "Beyond the Prototype: Trust, Cost, and Talent in the Age of Agents."

UberMobiConf: Mobile, Reimagined

Uber India Tech's largest external event yet brought 280+ engineers together in person, plus a global webinar audience, to tackle AI agents, developer productivity, and app performance — "Mobile Reimagined."

Research

Browse research and articles authored by our team members.

ICSE-SEIP 2026

Scaling Mobile Chaos Testing with AI-Driven Test Execution

Mobile applications in large-scale distributed systems are susceptible to backend service failures, yet traditional chaos engineering approaches cannot scale mobile testing due to the complexity of end-to-end test environments.

Continue reading → SOSP 2025

Moirai: Optimizing Placement of Data and Compute in Hybrid Clouds

The deployment of large-scale data analytics between on-premise and cloud sites requires careful partitioning of both data and computation to avoid massive networking costs and performance degradation.

Continue reading → ICSE 2025

CI at Scale: Lean, Green, and Fast

Maintaining a "green" mainline branch — where all builds pass successfully — is crucial but challenging in fast-paced, large-scale software development environments, particularly with concurrent code changes in large monorepos.

Continue reading →