Open Table Formats (part-1): Apache Hudi (Hadoop Upserts Deletes and Incrementals)March 16, 2024 by Vivek L Alexbeginner
Building Data Lakes on AWS with Kafka Connect, Debezium, Apicurio Registry, and Apache HudiFebruary 27, 2024 by Gary A. Staffordbeginnerapache kafkadebeziumapicurio registryawsapache sparkhudi streamer
How a POC became a production-ready Hudi data lakehouse through close team collaborationFebruary 12, 2024 by Xiaoxiao Rey and Hussein Awalaleboncoinbeginnergdprdml
Building an Open Source Data Lake House with Hudi, Postgres Hive Metastore, Minio, and StarRocksFebruary 6, 2024 by Soumil Shahbeginnerapache sparkapache hiveminiostarrocksdockerpythonpostgres
Use Amazon Athena with Spark SQL for your open-source transactional table formatsJanuary 24, 2024 by Pathik Shah, Raj Devnathbeginnerqueryingclusteringcompactionapache icebergawsdelta lake
Data Engineering: Bootstrapping Data lake with Apache HudiJanuary 20, 2024 by Krishna Prasadbeginneretlawsapache spark
Learn How to Move Data From MongoDB to Apache Hudi Using PySparkJanuary 20, 2024 by Soumil Shahbeginnermongodbapache spark
Deleting Items from Apache Hudi using Delta Streamer in UPSERT Mode with Kafka Avro MessagesJanuary 18, 2024 by Soumil Shahbeginnerhudi streamerapache kafkaapache avrodml