Apache Hudi: Managing Partition on a petabyte-scale tableFebruary 4, 2024 byKrishna Prasadblogapache hudimediumintermediatepartitionaws glueapache sparkaws s3
Leverage Partition Paths of your data lake tables to Optimize Data Retrieval Costs on the cloudJanuary 30, 2024 byKrishna Prasadblogapache hudimediumintermediateaws gluecostapache sparkpartition
Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake FormationJanuary 17, 2024 byRaymond Lai, Aditya Shah, Bin Wang, and Melody Yangblogapache hudiawsintermediateamazon emraws lake formationaws glueaws s3amazon sagemakeraws cloud9amazon athenaaccess control
In-House Data Lake with CDC Processing, Hudi, DockerJanuary 11, 2024 byRahulblogapache hudimediumintermediatedockercdcapache kafkadebeziumapache sparkaws s3