Apache Hudi: Managing Partition on a petabyte-scale tableFebruary 4, 2024 byKrishna Prasadblogapache hudimediumintermediatepartitionaws glueapache sparkaws s3
Use Amazon Athena with Spark SQL for your open-source transactional table formatsJanuary 24, 2024 byPathik Shah, Raj Devnathblogapache hudiawsbeginneraws glueaws athenatime travel queryclusteringcompactionaws s3apache icebergdelta lake
Data Engineering: Bootstrapping Data lake with Apache HudiJanuary 20, 2024 byKrishna Prasadblogapache hudimediumbeginnerETLaws glueapache sparkaws s3
Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake FormationJanuary 17, 2024 byRaymond Lai, Aditya Shah, Bin Wang, and Melody Yangblogapache hudiawsintermediateamazon emraws lake formationaws glueaws s3amazon sagemakeraws cloud9amazon athenaaccess control
In-House Data Lake with CDC Processing, Hudi, DockerJanuary 11, 2024 byRahulblogapache hudimediumintermediatedockercdcapache kafkadebeziumapache sparkaws s3
Build Your First Hudi Lakehouse with AWS S3 and AWS GlueDecember 19, 2022 byNadine Farahhow-touse-caseapache hudiaws s3aws glue