Practice of Apache Hudi in building real-time data lake at station BOctober 21, 2021 byYu Zhaojinguse-casereal-time datalakedeveloppaper
How Amazon Transportation Service enabled near-real-time event analytics at petabyte scale using AWS Glue with Apache HudiOctober 14, 2021 byMadhavan Sriram,Diego Menin,Gabriele CacciolaandKunal Gautamuse-casenear real-time analyticsanalytics at scaleamazon
Building an ExaByte-level Data Lake Using Apache Hudi at ByteDanceSeptember 1, 2021 byZiyue Guan, translated to English by yihuause-caseapache hudi
Improving Marker Mechanism in Apache HudiAugust 18, 2021 byyihuadesigntimeline-servermarkersapache hudi
Schema evolution with DeltaStreamer using KafkaSourceAugust 16, 2021 bysbernauerdesigndeltastreamerschemaapache hudiapache kafka
Cost-Efficient Open Source Big Data Platform at UberAugust 11, 2021 byZheng ShaoandMohammad Islamcost efficiencyoptimizationbigdatadata platformincremental processinguber
MLOps Wars: Versioned Feature Data with a LakehouseAugust 3, 2021 byDavid BzhalavaandJim Dowlinguse-casemlopsfeature storeincremental processingtime travel querylogicalclocks
Baixin bank’s real-time data lake evolution scheme based on Apache HudiJuly 26, 2021use-casereal-time datalakeincremental processingdeveloppaper