Tag: Reliability

Scalable Systems & Scalable Careers: A Chat with Uber’s Sumbry

What do Site Reliability Engineering (SRE) and mentorship have in common? According to Uber SRE manager Sumbry, both areas focus on growth.

Building Reliable Reprocessing and Dead Letter Queues with Apache Kafka

The Uber Insurance Engineering team extended Kafka’s role in our existing event-driven architecture by using non-blocking request reprocessing and dead letter queues (DLQ) to achieve decoupled, observable error-handling without disrupting real-time traffic.

Engineering More Reliable Transportation with Machine Learning and AI at Uber

In this article, we highlight how Uber leverages machine learning and artificial intelligence to tackle engineering challenges at scale.

Engineering Uber’s On-Call Dashboard

Uber Engineering's On-Call Dashboard provides real-time incident response, shift maintenance, and post-mortem analysis for an improved on-call experience.

Engineering NullAway, Uber’s Open Source Tool for Detecting NullPointerExceptions on Android

Uber Engineering built and open sourced NullAway, our fast and practical tool for eliminating NPEs, to help others deploy more reliable Android apps.

My Site Reliability Engineering Internship Experience with Uber

What did you do this summer? In this article, intern Mitali Palekar reflects on her experience as a member of Uber's Site Reliability Engineering team.

Presenting the Engineering Behind Uber at Our Technology Day

A daylong event at Uber’s Palo Alto office, sponsored by our LadyEng group, showcased the technical work across Uber Engineering as well as the people who are leading and building these projects. Here are some of the resulting presentations.

Popular Articles