AWS and Apache Hudi Workshop Overview: Build a ride share lakehouse platformMay 31, 2023 byOnehouseworkshoplakehousedata-lakehouseamazon s3aws glueamazon dynamodbamazon athenaamazon quicksightapache hudi
How to Set Up AWS Glue Locally with Docker: Accessing Glue Database & Table in Your LocalEnvironmentMay 21, 2023 bySoumil Shahguideapache hudidockeraws gluedevelopment setupdatabase
Mastering File Sizing in Hudi: Boosting Performance and EfficiencyMay 20, 2023 bySoumil Shahguideapache hudifile sizinghudi performacnequeryspeedapache parquetamazon s3
Hands-On Lab: Unleashing Efficiency and Flexibility with Partial Updates in Apache HudiMay 19, 2023 bySoumil Shahguideapache hudihands on labincremental processingdata updateapache spark
Unify Your Event Data:Guide to Mapping Events to Standardized Format with Incremental ETL using HudiMay 16, 2023 bySoumil Shahguideapache hudiapache sparkincremental etldata unificationdata processing
EMR Serverless Made Easy: Submitting Hive SQL Queries for Beginners with NYC Taxi DatasetMay 13, 2023 bySoumil Shahguideapache hudiapache hiveamazon emremr serverlesshive sqlhive metastore
EMR Serverless for Beginners: | Ingest Data incrementally | Submit Spark Job with EMR-CLI |Data lakeMay 11, 2023 bySoumil Shahguideapache hudiamazon emremr Serverlessapache sparkdata lakeincremental data processing
Maximizing Efficiency DataLake(Hudi) Glue ETL Jobs with Templated Approach &Serverless ArchitectureMay 7, 2023 bySoumil Shahguideapache hudiaws glueetltemplated architectureserverless
How to Build Your Own Version of AWS Glue Bookmark to get Only New Incremental FilesMay 6, 2023 bySoumil Shahguideapache hudiaws glueincremental processingglue bookmarks
Build, deploy, and run Spark jobs on Amazon EMR with the open-source EMR CLI toolMay 3, 2023 bySoumil Shahguideamazon emr cliapache sparkamazon emr serverlessapache hudiamazon emrcommand line interface