How Zoom implemented streaming log ingestion and efficient GDPR deletes using Apache Hudi on Amazon EMRMay 16, 2023 bySekar Srinivasan,Amit Kumar Agrawal,Chandra DhandapaniandViral Shahuse-casestreaming ingestiongdpr deletiondeleteamazon
Lakehouse at Fortune 1 ScaleMay 3, 2023 bySamuel Guleffuse-casecomparisonperformancewalmartglobaltech
Build Your First Hudi Lakehouse with AWS S3 and AWS GlueDecember 19, 2022 byNadine Farahhow-touse-caseapache hudiaws
How Hudl built a cost-optimized AWS Glue pipeline with Apache Hudi datasetsNovember 10, 2022 byIndira Balakrishnan,Ramzi YassineandSwagat Kulkarniuse-casecost-efficiencyincremental-processingnear real-time analyticsamazon
Implementation of SCD-2 (Slowly Changing Dimension) with Apache Hudi & SparkAugust 24, 2022 byJayasheel Kalgal,Esha DhingandPrashant Mishrause-casescd2walmartglobaltech
How NerdWallet uses AWS and Apache Hudi to build a serverless, real-time analytics platformAugust 9, 2022 byKevin ChunandDylan Quuse-casenear real-time analyticsincremental-processingamazon
The story of building a data lake that can be deleted on a record-by-record basis using Apache HudiMay 25, 2022 byShota Ejimause-casegdpr deletionyahoo
Key Learnings on Using Apache HUDI in building Lakehouse Architecture @ HalodocApril 4, 2022 byJitendra Shahuse-caselakehouseincremental-processinghalodoc
Fresher Data Lake on AWS S3February 17, 2022 byBalaji Varadarajanuse-caseincremental-processingrobinhood
Hudi powering data lake efforts at Walmart and Disney+ HotstarJanuary 20, 2022 bySean Michael Kerneruse-casetechtarget
How GE Aviation built cloud-native data pipelines at enterprise scale using the AWS platformNovember 16, 2021 byAlcuin WeidusandSuresh Patnamuse-caseanalytics at-scaleamazon
Practice of Apache Hudi in building real-time data lake at station BOctober 21, 2021 byYu Zhaojinguse-casereal-time-datalakedeveloppaper
How Amazon Transportation Service enabled near-real-time event analytics at petabyte scale using AWS Glue with Apache HudiOctober 14, 2021 byMadhavan Sriram,Diego Menin,Gabriele CacciolaandKunal Gautamuse-casenear real-time analyticsanalytics at-scaleamazon
Building an ExaByte-level Data Lake Using Apache Hudi at ByteDanceSeptember 1, 2021 byZiyue Guan, translated to English by yihuause-caseapache hudi
MLOps Wars: Versioned Feature Data with a LakehouseAugust 3, 2021 byDavid BzhalavaandJim Dowlinguse-casemlopsfeature-storeincremental-processingtime-travellogicalclocks
Baixin bank’s real-time data lake evolution scheme based on Apache HudiJuly 26, 2021use-casereal-time-datalakeincremental-processingdeveloppaper
Time travel operations in Hopsworks Feature StoreFebruary 24, 2021use-caseincremental-processingfeature-storetime-travelhopsworks
Building High-Performance Data Lake Using Apache Hudi and Alluxio at T3GoDecember 1, 2020 byt3gouse-casenear real-time analyticsincremental-processingcachingapache hudi
Origins of Data Lake at GrofersOctober 19, 2020 byAkshay Agarwaluse-casedatalakechange-data-capturecdcgrofers
Building a Large-scale Transactional Data Lake at Uber Using Apache HudiJune 9, 2020 byNishith Agarwaluse-casedatalakeanalytics at-scaleuber
Hoodie: Uber Engineering's Incremental Processing Framework on HadoopMarch 12, 2017 byPrasanna RajaperumalandVinoth Chandaruse-caseincremental-processinguber
The Case for incremental processing on HadoopAugust 4, 2016 byVinoth Chandaruse-caseincremental-processingoreilly