Build Slowly Changing Dimensions Type 2 (SCD2) with Apache Spark and Apache Hudi on Amazon EMRApril 12, 2021 byDavid Greenshteinhow-toscd2amazon
Build a data lake using amazon kinesis data stream for amazon dynamodb and apache hudiMarch 4, 2021 byDhiraj Thakur,Dylan QuandSaurabh Shrivastavahow-tostreaming ingestionamazon
Streaming Responsibly - How Apache Hudi maintains optimum sized filesMarch 1, 2021 byshivnarayandesignfile sizingapache hudi
Data Lakehouse: Building the Next Generation of Data Lakes using Apache HudiMarch 1, 2021 byRyan D'SouzaandBrandon Stanleyblogdata-lakehousemedium
Time travel operations in Hopsworks Feature StoreFebruary 24, 2021use-caseincremental processingfeature storetime travel queryhopsworks
Optimize Data lake layout using Clustering in Apache HudiJanuary 27, 2021 bysatish.kothadesignclusteringapache hudi
Building High-Performance Data Lake Using Apache Hudi and Alluxio at T3GoDecember 1, 2020 byt3gouse-casenear real-time analyticsincremental processingcachingapache hudi
Can Big Data Solutions Be Affordable?November 29, 2020blogbig-datanear real-time analyticsanalyticsinsight
Employing the right indexes for fast updates, deletes in Apache HudiNovember 11, 2020 byvinothhow-toindexingapache hudi