Skip to main content
Uber logo

Schedule rides in advance

Reserve a rideReserve a ride

Schedule rides in advance

Reserve a rideReserve a ride
Data / ML, Engineering

Making Apache Spark Effortless for All of Uber

July 17, 2019 / Global
Featured image for Making Apache Spark Effortless for All of Uber
Figure 1: uSCS enables technical teams at Uber to easily prototype with Spark notebooks.
Figure 2: Users can schedule Spark notebooks and inspect results for each run.
Figure 3: Uber’s internal data administration website displays information about Spark applications.
Figure 4: At Uber, each of our infrastructure’s geographic regions possess their own storage services and multiple mutually-isolated compute clusters.
Figure 5: The uSCS Gateway can choose to run a Spark application on any cluster in any region, by forwarding the request to that cluster’s Apache Livy deployment.
Figure 6: The path that a customer’s Spark application launch request follows through uSCS.
Abhishek Modi

Abhishek Modi

Modi is a software engineer on Uber’s Data Platform team. Modi helps unlock new possibilities for processing data at Uber by contributing to Apache Spark and its ecosystem.

Adam Hudson

Adam Hudson

Adam is a senior software engineer on Uber’s Data Platform team. Adam works on solving the many challenges raised when running Apache Spark at scale.

Posted by Abhishek Modi, Adam Hudson