We implemented a Kappa architecture at Uber to effectively backfill streaming data at scale, ensuring accurate data in our platform.
In 2019, Uber's Data Platform team leveraged data science to improve the efficiency of our infrastructure, enabling us to compute optimum datastore and hardware usage.
We redesigned the Uber Freight app with RIBs, our open source plugin architecture, to enable quicker feature rollouts and an improved user experience.
Data science helps Uber determine which tables in a database should be off-boarded to another source to maximize the efficiency of our data warehouse.
With zero downtime, Uber's Payments Engineering team embarked on a migration that would allow authorization hold logic to be written once and used across existing and future payments products.
On May 3, 2019, Uber’s Programming Systems Team hosted the Programming Systems and Tools Track of the company’s Second Uber Science Symposium, featuring a full day of talks by leading researchers and practitioners in the the field.
Aarhus Engineering Internship: Building Aggregation Support for YQL, Uber’s Graph Query Language for Grail
Uber intern Lau Skorstengaard shares his experience working on YQL, the graph query language for our in-house infrastructure state aggregation platform.
Uber built Makisu, our open source Docker image builder, to enable the quick, reliable generation of Dockerfiles in Mesos and Kubernetes ecosystems.
Uber’s Observability team built a robust, scalable metrics and alerting pipeline to detect, mitigate, and notify engineers of issues as they occur.
Uber developed Peloton to help us balance resource use, elastically share resources, and plan for future capacity needs.
Responsible for cleaning, storing, and serving over 100 petabytes of analytical data, Uber's Hadoop platform ensures data reliability, scalability, and ease-of-use with minimal latency.
M3, Uber's open source metrics platform for Prometheus, facilitates scalable and configurable multi-tenant storage for large-scale metrics.
Databook, Uber's in-house platform for surfacing and exploring contextual metadata, makes dataset discovery and exploration easier for teams across the company.
Shan He, the technical lead on Uber's kepler.gl framework, discusses her journey to data visualization and why she believes open source is such an important part of her team's work.
Uber's Data Infrastructure team overhauled our approach to scaling our storage infrastructure by incorporating several new features and functionalities, including ViewFs, NameNode garbage collection tuning, and an HDFS load management service.
Written in Haskell, Queryparser is Uber Engineering's open source tool for parsing and analyzing SQL queries that makes it easy to identify foreign-key relationships in large data warehouses.
Migrating our Schemaless sharding layer from Python to Go while in production demonstrated that it was possible for us to rewrite the frontend of a massive datastore with zero downtime.
The Uber Insurance Engineering team extended Kafka’s role in our existing event-driven architecture by using non-blocking request reprocessing and dead letter queues (DLQ) to achieve decoupled, observable error-handling without disrupting real-time traffic.
Uber's mobile engineers leverage code generation to make our applications more reliable and boost developer productivity.
Get to know Uber Aarhus Engineering and the work they do behind the scenes to build and maintain our storage and compute platforms.
12Page 1 of 2