Schema evolution with DeltaStreamer using KafkaSourceAugust 16, 2021 bysbernauerdesigndeltastreamerschemaapache hudiapache kafka
Cost-Efficient Open Source Big Data Platform at UberAugust 11, 2021 byZheng ShaoandMohammad Islamcost efficiencyoptimizationbigdatadata platformincremental processinguber
MLOps Wars: Versioned Feature Data with a LakehouseAugust 3, 2021 byDavid BzhalavaandJim Dowlinguse-casemlopsfeature storeincremental processingtime travel querylogicalclocks
Baixin bank’s real-time data lake evolution scheme based on Apache HudiJuly 26, 2021use-casereal-time datalakeincremental processingdeveloppaper
Part1: Query apache hudi dataset in an amazon S3 data lake with amazon athena : Read optimized queriesJuly 16, 2021 byDhiraj Thakur,Sameer GoelandImtiaz Sayedhow-toread optimized queryamazon
Employing correct configurations for Hudi's cleaner table serviceJune 10, 2021 bypratyakshsharmahow-tocleanerapache hudi
Build Slowly Changing Dimensions Type 2 (SCD2) with Apache Spark and Apache Hudi on Amazon EMRApril 12, 2021 byDavid Greenshteinhow-toscd2amazon