Elastic Deep Learning with Horovod on Ray
Introduction
In 2017, we introduced Horovod, an open source framework for scaling deep learning training across hundreds of GPUs in parallel. At the time, most...
How Uber Deals with Large iOS App Size
The App Size Problem
Uber’s iOS mobile Apps for Rider, Driver, and Eats are large in size. The choice of Swift as our primary programming...
Evolving Schemaless into a Distributed SQL Database
Introduction
In 2016 we published blog posts (I, II) about Schemaless - Uber Engineering’s Scalable Datastore. We went over the design of Schemaless as well...
Fast and Reliable Schema-Agnostic Log Analytics Platform
At Uber, we provide a centralized, reliable, and interactive logging platform that empowers engineers to work quickly and confidently at scale. The logs are...
Uber’s Real-time Data Intelligence Platform At Scale: Improving Gairos Scalability/Reliability
Background
Real-time data (# of ride requests, # of drivers available, weather, game) enables operations teams to make informed decisions like surge pricing, maximum dispatch...
The Journey Towards Metric Standardization
At Uber, business metrics are vital for discovering insights about how we perform, gauging the impact of new products, and optimizing the decision making...
Uber’s Real-Time Push Platform
Uber builds multi-sided marketplaces handling millions of trips every day across the globe. We strive to build real-time experiences for all our users.
The nature...
Horovod v0.21: Optimizing Network Utilization with Local Gradient Aggregation and Grouped Allreduce
We originally open-sourced Horovod in 2017, and since then it has grown to become the standard solution in industry for scaling deep learning training...
Turning Metadata Into Insights with Databook
Every day in over 10,000 cities around the world, millions of people rely on Uber to travel, order food, and ship cargo. Our apps...
Meet the 2020 Safety Engineering Interns: COVID Edition
About the Safety team & What we do
Uber is dedicated to keeping people safe on the road. The Safety and Insurance Engineering team is...
Operating Apache Pinot @ Uber Scale
Introduction
Uber has a complex marketplace consisting of riders, drivers, eaters, restaurants and so on. Operating that marketplace at a global scale requires real-time intelligence...
Building from the Baltics: Meet the Uber Engineering Team in Vilnius, Lithuania
The Uber Vilnius office is home to members of our Production Engineering, Infrastructure, Storage Platform, and Developer Tools team.
Ludwig v0.3 Introduces Hyperparameter Optimization, Transformers and TensorFlow 2 support
In February 2019, Uber released Ludwig, an open source, code-free deep learning (DL) toolbox that gives non-programmers and advanced machine learning (ML) practitioners alike...
Revolutionizing Money Movements at Scale with Strong Data Consistency
Uber as a platform invites its users to leverage it, earn from it, and be delighted by it. Serving more than 18 million requests...
Spearheading Open Source: A Conversation with Jim Jagielski, Staff Technical Program Manager with the...
Jim Jagielski's fascination with open source software began out of necessity. He was working at NASA Goddard in the 1980s, and the agency had...
Designing Edge Gateway, Uber’s API Lifecycle Management Platform
The making of Edge Gateway, the highly-available and scalable self-serve gateway to configure, manage, and monitor APIs of every business domain at Uber.
Evolution of...
Standing for Safety: Meet the Uber Sao Paulo Tech Team
Located in the heart of Latin America’s largest city, the Uber Sao Paulo Tech Center was founded in late 2018 as a company-wide hub...
Introducing Domain-Oriented Microservice Architecture
Introduction
Recently there has been substantial discussion around the downsides of service oriented architectures and microservice architectures in particular. While only a few years ago,...
Engineering Failover Handling in Uber’s Mobile Networking Infrastructure
Millions of users use Uber’s applications everyday across the globe, accessing seamless transportation or meal delivery at the push of a button. To achieve...
Fiber: Distributed Computing for AI Made Simple
Project Homepage: GitHub
Over the past several years, increasing processing power of computing machines has led to an increase in machine learning advances. More and...