Build Hudi Date Dimension in Minutes with Spark SQL Minio and Query with TrinoMay 23, 2024 bySoumil Shahguidebeginnerapache hudiminiotrinoapache hivehive metastorespark sql
How to perform Backfilling jobs with Hudi DeltaStreamer and Spark SQL using SqlSource ClassMarch 20, 2024 bySoumil Shahguidebeginnerapache hudihudi streamerdeltastreamerspark sqlbackfilling
How to Query Apache Hudi tables from Glue Interactive Notebook for AdHoc AnalysisMarch 1, 2024 bySoumil Shahguidebeginnerapache hudiaws gluespark sqlglue notebookamazon s3
Simplifying Big Data: Setting Up Spark SQL, Hive Thrift Server, and Hudi with Beeline in MinutesDecember 11, 2023 bySoumil Shahguidebeginnerapache hiveapache thriftspark sqlapache hudibeelinehive metastore
Removing Duplicates in Hudi Partitions with Insert_Overwrite API and Spark SQLJuly 28, 2023 bySoumil Shahguideduplicatesde-duplicateinsert overwritespark-sqlpartitionapache hudibeginner
Building Lakehouse using Hudi | Apache Hudi | Data Lakehouse | Hudi | ApacheJuly 1, 2023 byDataCouchguidelakehousedata lakehousespark sqlapache hudiaws gluebeginner
Joining Hudi Raw Tables for Powerful Data Analysis with Spark SQLApril 25, 2023 bySoumil Shahguidejoinsspark sqlapache hudi
Build Datalakes on S3 with Apache HUDI in a easy way for Beginners with hands on labs | GlueDecember 11, 2022 bySoumil Shahguideaws glueamazon athenaapache hudispark-sqlamazon s3beginner