Developer Guide: How to Submit Hudi PySpark(Python) Jobs to EMR Serverless (7.1.0) with AWS Glue Hive MetaStoreSeptember 4, 2024 bySoumil Shahblogapache hudipysparkpythonamazon emraws gluelinkedin
Use AWS Data Exchange to seamlessly share Apache Hudi datasetsMay 22, 2024 bySaurabh Bhutyani, Ankith Ede, and Chandra Krishnanblogapache hudiaws data exchangeamazon emramazon s3amazon athenadata sahringamazon
Cost Optimization Strategies for scalable Data LakehouseMarch 22, 2024 bySuresh Hasundiblogapache hudiamazon s3amazon emrapcache sparklakehousecost optimizationhalodoc
Building Data Lakes on AWS with Kafka Connect, Debezium, Apicurio Registry, and Apache HudiFebruary 27, 2024 byGary A. Staffordblogapache hudiitnextbeginnerapache kafkakafka connectdebeziumapicurio registryawsapache sparkdeltastreamerhudi streameramazon rdsamazon mksamazon eksaws glueamazon emr
Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake FormationJanuary 17, 2024 byRaymond Lai, Aditya Shah, Bin Wang, and Melody Yangblogapache hudiawsintermediateamazon emraws lake formationaws glueaws s3amazon sagemakeraws cloud9amazon athenaaccess control