The Art of Building Open Data Lakes with Apache Hudi, Kafka, Hive, and DebeziumDecember 31, 2021 byGary Staffordhow-todatalakemedium
Hudi Z-Order and Hilbert Space Filling CurvesDecember 29, 2021 byAlexey Kudinkin and Tao Mengdesignclusteringdata skippingapache hudi
New features from Apache Hudi 0.7.0 and 0.8.0 available on Amazon EMRDecember 20, 2021 byUdit MehrotraandGagan Brahmiblogamazon
Lakehouse Concurrency Control: Are we too optimistic?December 16, 2021 byvinothblogconcurrency-controlapache hudi
How GE Aviation built cloud-native data pipelines at enterprise scale using the AWS platformNovember 16, 2021 byAlcuin WeidusandSuresh Patnamuse-caseanalytics at scaleamazon
Practice of Apache Hudi in building real-time data lake at station BOctober 21, 2021 byYu Zhaojinguse-casereal-time datalakedeveloppaper
How Amazon Transportation Service enabled near-real-time event analytics at petabyte scale using AWS Glue with Apache HudiOctober 14, 2021 byMadhavan Sriram,Diego Menin,Gabriele CacciolaandKunal Gautamuse-casenear real-time analyticsanalytics at scaleamazon
Building an ExaByte-level Data Lake Using Apache Hudi at ByteDanceSeptember 1, 2021 byZiyue Guan, translated to English by yihuause-caseapache hudi