
25 posts tagged with "apache spark"
View All Tags

Use open table format libraries on AWS Glue 5.0 for Apache Spark

Apache Hudi, Spark and Minio: Hands-on Lab in Docker

Hands-on with Apache Hudi and Spark

Apache Hudi: From Zero To One (10/10)

Apache Hudi: From Zero To One (9/10)

Building Data Lakes on AWS with Kafka Connect, Debezium, Apicurio Registry, and Apache Hudi

Building an Open Source Data Lake House with Hudi, Postgres Hive Metastore, Minio, and StarRocks

Apache Hudi: Managing Partition on a petabyte-scale table

Leverage Partition Paths of your data lake tables to Optimize Data Retrieval Costs on the cloud

Data Engineering: Bootstrapping Data lake with Apache Hudi
