From Batch to Streaming: Accelerating Data Freshness in Uber's Data LakeDecember 12, 2025 by Uber Engineeringstreamingapache flinkdatalakeapache hudiuber
Why Uber Built Hudi: The Strategic Decision Behind a Custom Table FormatJuly 3, 2025 by ThamizhElango NatarajanblogApache HudiApache IcebergLakehouseuse-caseUberdet
Scaling Complex Data Workflows at Uber Using Apache HudiJune 30, 2025 by Ankit Shrivastava in collaboration with DipankarApache HudiUberCommunity
Setting Uber’s Transactional Data Lake in Motion with Incremental ETL Using Apache HudiMarch 16, 2023 by Vinoth Govindarajan, Saketh Chintapalli, Yogesh Saswade and Aayush Barejaincremental processingdatalakeapache hudimedallion architectureuber
Cost Efficiency @ Scale in Big Data File FormatJanuary 25, 2022 by Xinli Shang, Kai Jiang, Zheng Shao and Mohammad Islamblogcost efficiencycompressionanalytics at scaleuber
Cost-Efficient Open Source Big Data Platform at UberAugust 11, 2021 by Zheng Shao and Mohammad Islamcost efficiencyoptimizationbigdatadata platformincremental processinguber
Building a Large-scale Transactional Data Lake at Uber Using Apache HudiJune 9, 2020 by Nishith Agarwaluse-casedatalakeanalytics at scaleuber
Hoodie: Uber Engineering's Incremental Processing Framework on HadoopMarch 12, 2017 by Prasanna Rajaperumal and Vinoth Chandaruse-caseincremental processinguber