Apache Hudi does XYZ (1/10): File pruning with multi-modal indexJune 16, 2025 by Shiyan Xuapache sparkdata lakehouse
Use open table format libraries on AWS Glue 5.0 for Apache SparkDecember 4, 2024 by Sotaro Hikita and Noritaka Sekiyamaannouncementapache sparktable formataws
Mastering Slowly Changing Dimensions with Apache Hudi & Spark SQLOctober 7, 2024 by Sameer Shaikscdapache spark
Apache Hudi, Spark and Minio: Hands-on Lab in DockerOctober 2, 2024 by Sanjeet Shuklaapache sparkminiodocker
Developer Guide: How to Submit Hudi PySpark(Python) Jobs to EMR Serverless (7.1.0) with AWS Glue Hive MetaStoreSeptember 4, 2024 by Soumil Shahapache sparkpythonaws
Cost Optimization Strategies for scalable Data LakehouseMarch 22, 2024 by Suresh Hasundiawsapache sparkdata lakehouseperformancehalodoc
Building Data Lakes on AWS with Kafka Connect, Debezium, Apicurio Registry, and Apache HudiFebruary 27, 2024 by Gary A. Staffordbeginnerapache kafkadebeziumapicurio registryawsapache sparkhudi streamer
Building an Open Source Data Lake House with Hudi, Postgres Hive Metastore, Minio, and StarRocksFebruary 6, 2024 by Soumil Shahbeginnerapache sparkapache hiveminiostarrocksdockerpythonpostgres