Skip to footer
Home Authors Posts by Kai Jiang

Kai Jiang

0 BLOG ARTICLES 0 RESEARCH PAPERS
Kai Jiang is a Software Engineer at Uber. He focuses on big data file format efficiency. He is also a contributor to Apache Beam, Parquet, and Spark.

Engineering Blog Articles

Cost Efficiency @ Scale in Big Data File Format

 

Background

Our Apache Hadoop® based data platform ingests hundreds of petabytes of analytical data with minimum latency and stores it in a data lake built on top of the Hadoop Distributed File System (HDFS). We use Apache Hudi