Tag: Data Ingestion
Solving Big Data Challenges with Data Science at Uber
How engineers and data scientists at Uber came together to come up with a means of partially replicating Vertica clusters to better scale our data volume.
Queryparser, an Open Source Tool for Parsing and Analyzing SQL
Written in Haskell, Queryparser is Uber Engineering's open source tool for parsing and analyzing SQL queries that makes it easy to identify foreign-key relationships in large data warehouses.
Engineering Data Analytics with Presto and Apache Parquet at Uber
Snap your fingers and presto! How Uber Engineering built a fast, efficient data analytics system with Presto and Parquet.
Streamific, the Ingestion Service for Hadoop Big Data at Uber Engineering
Here we look at Hadoop data ingestion, and how Uber Engineering streams diverse data into a cohesive layer for querying in near real-time using our in-house developed Streamific.