![](https://hudi.apache.org/assets/images/video_blogs/2024-06-21-Four-Different-Ways-to-fetch-Apache-Hudi-Commit-time-in-Python-and-PySpark.png)
182 posts tagged with "guide"
View All Tags![](https://hudi.apache.org/assets/images/video_blogs/2024-06-21-Four-Different-Ways-to-fetch-Apache-Hudi-Commit-time-in-Python-and-PySpark.png)
![](https://hudi.apache.org/assets/images/video_blogs/2024-06-18-learn-how-to-ingest-xml-files-with-aws-glue-into-hudi-datalakes.png)
Learn How to Ingest XML files with AWS Glue into Hudi Datalakes | Step by Step guide
![](https://hudi.apache.org/assets/images/video_blogs/2024-06-16-hudi-with-spark-sql-for-beginners-insert-updates-delete-incremental-query-stored-procedures.png)
Hudi with Spark SQL for Beginners | Insert| Updates | Delete | incremental Query | Stored procedures
![](https://hudi.apache.org/assets/images/video_blogs/2024-06-15-how-we-utilized-hudis-time-travel-query-to-investigate-bid-and-spend.png)
How we Utilized Hudi's Time Travel Query to Investigate Bid and Spend | Going Back in Time with Hudi
![](https://hudi.apache.org/assets/images/video_blogs/2024-06-12-hudi-cleaning-process-hoodie.keep.min.commits-and-hoodie.keep.max.commits-explained.png)
Hudi Cleaning Process | hoodie.keep.min.commits and hoodie.keep.max.commits Explained
![](https://hudi.apache.org/assets/images/video_blogs/2024-06-05-multiple-spark-writers-to-hudi-tables.png)
Multiple Spark Writers to Hudi tables | Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-25-learn-how-to-ingest-data-from-pulsar-topic-into-hudi-with-deltastreamer.png)
Learn How to Ingest data from pulsar Topic into Hudi with DeltaStreamer | Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-23-build-hudi-date-dimension-in-minutes-with-spark-sql-minio-and-query-with-trino.png)
Build Hudi Date Dimension in Minutes with Spark SQL Minio and Query with Trino
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-22-hudi-delta-streamer-implementing-slowly-changing-dimension-and-query-that-using-trino.png)
Demo Video : Hudi Delta Streamer Implementing Slowly Changing Dimension and Query that using Trino
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-22-hudi-streamer-implementing-slowly-changing-dimension-type-2-and-query-real-time-trino.png)
Hudi Streamer implementing Slowly Changing Dimension Type 2 and Query Real Time Trino | Hands on
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-20-deltastreamer-with-incremental-etl-and-broadcast-joins-for-faster-etl.png)
DeltaStreamer with incremental ETL and Broadcast Joins for Faster ETL
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-18-Learn-How-to-use-Cloudwatch-metrics-with-Hudi-AWS-Glue-Jobs.png)
Learn How to use Cloudwatch metrics with Hudi AWS Glue Jobs
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-12-Unleashing-the-Power-of-Serverless-Serving-Gold-Hudi-Tables-with-AWS-Lambda.png)
Unleashing the Power of Serverless: Serving Gold Hudi Tables with AWS Lambda
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-08-How-to-read-Hudi-Dataset-Using-AWS-Glue-Ray-and-Glue-Notebooks-without-Spark.png)
How to read Hudi Dataset Using AWS Glue Ray and Glue Notebooks (withouth Spark)
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-04-Learn-How-to-Display-Data-From-Hudi-Tables-to-your-Frontend-with-Flask-and-Daft-NO-SPARK-NEEDED.png)
Learn How to Display Data From Hudi Tables to your Frontend with Flask and Daft (NO SPARK NEEDED)
![](https://hudi.apache.org/assets/images/video_blogs/2024-04-22-Hudi-with-Kyuubi-a-distributed-and-multi-tenant-gateway-to-provide-serverless-SQL-on-lakehouses.png)
Hudi with Kyuubi, a distributed & multi-tenant gateway, to provide serverless SQL on lakehouses
![](https://hudi.apache.org/assets/images/video_blogs/2024-04-10-Build-Universal-Data-lake-with-MySQL-+-Debezium+Kafka+DeltaSTreamer-+-Minio+HiveMetastore+Trino.png)
Build Universal Data lake with MySQL + Debezium+Kafka+DeltaSTreamer + Minio+HiveMetastore+Trino
![](https://hudi.apache.org/assets/images/video_blogs/2024-04-06-Build-Universal-Data-lake-with-Posgres-+-Debezium+Kafka+DeltaSTreamer-+-Minio+HiveMetastore+Trino.png)
Build Universal Data lake with Posgres + Debezium+Kafka+DeltaSTreamer + Minio+HiveMetastore+Trino
![](https://hudi.apache.org/assets/images/video_blogs/2024-04-03-Reading-Data-from-Hudi-INC-and-Joining-with-Delta-Tables-using-HudiStreamer-and-SQL-Based-Transformer.png)
Reading Data from Hudi INC & Joining with Delta Tables using HudiStreamer & SQL-Based Transformer
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-30-Building-DataLakeHouse-using-XTableMinIO-StarRocks-DeltaStreamer---Interoperating-Hudi-IceBerg-and-Delta.png)
Building DataLakeHouse: XTable, MinIO, StarRocks, DeltaStreamer - Interoperating Hudi, IceBerg,Delta
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-29-Open-Lakehouse-Evolution-Powering-the-Future-with-YugabyteDB-and-Apache-Hudi-Episode-102.png)
Open Lakehouse Evolution: Powering the Future with YugabyteDB & Apache Hudi | Episode 102
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-20-How-to-perform-Backfilling-jobs-with-Hudi-DeltaStreamer-and-Spark-SQL-using-SqlSource-Class.png)
How to perform Backfilling jobs with Hudi DeltaStreamer and Spark SQL using SqlSource Class
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-18-Mastering-Incremental-ETL-with-DeltaStreamer-and-SQL-Based-Transformer.png)
Mastering Incremental ETL with DeltaStreamer and SQL-Based Transformer
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-12-Managing-Updates-&-Deletes-in-Glue-Hudi-Spark-Jobs-with-CDC-Data:-Using-_hoodie_is_deleted-Flag.png)
Managing Updates & Deletes in Glue Hudi Spark Jobs with CDC Data
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-11-Getting-Started-Tutorial-Building-a-Data-Lakehouse-With-StarRocks-Apache-Hudi-and-MinIO.png)
Getting Started Tutorial: Building a Data Lakehouse With StarRocks, Apache Hudi, and MinIO
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-01-How-to-Query-Apache-Hudi-tables-from-Glue-Interactive-Notebook-for-AdHoc-Analysis.png)
How to Query Apache Hudi tables from Glue Interactive Notebook for AdHoc Analysis
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-27-Learn-How-you-can-run-DeltaStreamer-Running-on-AWS-Glue-with-Hudi-0-14-Step-by-Step-Guide.png)
Learn How you can run DeltaStreamer Running on AWS Glue with Hudi 0.14 Step by Step Guide
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-23-Getting-Started-with-Open-Data-lineage-Marquez-Project-Apache-Hudi-Spark-jobs.png)
Getting Started with Open Data lineage | Marquez Project | Apache Hudi Spark jobs
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-18-Build-Incremental-ETL-pipeline-with-Hudi-and-Airflow-and-MinIO.png)
Build Incremental ETL pipeline with Hudi and Airflow and MinIO
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-17-Learn-How-to-Integerate-Hudi-Spark-job-with-Airflow-and-MinIO-Hands-on-Labs.png)
Learn How to Integerate Hudi Spark job with Airflow and MinIO | Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-10-Data-Ingestion-to-Visualization-Hudi-MinIO-StarRocks-HiveMetaStore-Apache-SuperSet-Hands-on-Guide.png)
Data Ingestion to Visualization: Hudi + MinIO + StarRocks + HiveMetaStore + Apache SuperSet Hands on Guide
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-07-Building-an-Open-Source-Data-Lake-House-with-Hudip-Postgres-Hive-Metastore-Minio-and-StarRocks.png)
Building an Open Source Data Lake House with Hudi, Postgres Hive Metastore, Minio, and StarRocks
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-03-Apache-Hudi-Table-Services-Export-Services-HoodieSnapshotExporter-Hands-on-labs.png)
Apache Hudi Table Services | Export Services | HoodieSnapshotExporter | Hands on labs
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-03-Apache-Hudi-Table-Services-Offline-Compaction-HoodieCompactor-Hands-on-labs.png)
Apache Hudi Table Services | Offline Compaction | HoodieCompactor | Hands on labs
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-21-Learn-How-to-Move-Data-From-MongoDB-to-Apache-Hudi-Using-PySpark.png)
Learn How to Move Data From MongoDB to Apache Hudi Using PySpark
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-17-How-to-Delete-Items-from-Hudi-using-Delta-Streamer-operating-in-UPSERT-Mode-with-Kafka-Avro-MSG-12.png)
How to Delete Items from Hudi using Delta Streamer operating in UPSERT Mode with Kafka Avro MSG #12
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-13-Setup-HUDI-with-AWS-Glue-and-MINIO-locally-using-Docker-Container-in-Minutes.png)
Setup HUDI with AWS Glue and MINIO locally using Docker Container in Minutes
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-06-Dynamic-Delta-Streamer-Jobs-with-JDBC-Puller-for-Postgres-Bring-all-Tables-from-particular-Schema-full-video.png)
Dynamic Delta Streamer Jobs with JDBC Puller for Postgres | Bring all Tables from particular Schema- Full Video
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-06-Dynamic-Delta-Streamer-Jobs-with-JDBC-Puller-for-Postgres-Bring-all-Tables-from-particular-Schema.png)
Dynamic Delta Streamer Jobs with JDBC Puller for Postgres | Bring all Tables from particular Schema
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-01-Data-Lake-to-Microservices-Apache-Hudi-Record-Index-FastAPI-Spark-Connect-with-Swagger-UI.png)
Data Lake to Microservices: Apache Hudi's Record Index, FastAPI, Spark Connect with Swagger UI
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-31-What-is-Spark-Connect-and-Getting-started-Spark-Connect-Hello-World.png)
What is Spark Connect and Getting started Spark Connect Hello World
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-30-Step-by-step-guide-on-How-to-Migrate-legacy-COW-Table-on-S3-to-MOR-Table-using-Hudi-CLI.png)
Step by step guide on How to Migrate legacy COW Table on S3 to MOR Table using Hudi CLI
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-29-Get-Started-with-Hudi-CLI-Locally-Using-Docker-in-Minutes-and-Connect-to-Your-S3-Data.png)
Get Started with Hudi CLI Locally Using Docker in Minutes and Connect to Your S3 Data
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-25-Hudi-DBT-Spark-Glue-Hive-MetaStore-Join-two-hudi-tables-Labs-with-Exercise-Files.png)
Hudi + DBT + Spark + Glue Hive MetaStore | Join two hudi tables Labs with Exercise Files
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-24-Apache-Hudi-Spark-DBT-Glue-Hive-MetaStore-Setup-Locally-in-Minutes-Hands-On-Exercise.png)
Apache Hudi, Spark, DBT, Glue Hive MetaStore Setup | Locally | in Minutes – Hands-On Exercise!
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-19-How-to-Use-Apache-Hudi-0-14-and-RLI-on-AWS-Glue-Step-by-Step-Guide.png)
How to Use Apache Hudi 0.14 and RLI (record level index) on AWS Glue Step by Step Guide
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-16-Learn-How-to-Setup-Hudi-on-EMR-with-Hive-and-Query-Data-using-Hue-and-Presto-CLI-Hands-on-Labs.png)
Learn How to Setup Hudi on EMR with Hive and Query Data using Hue and Presto CLI Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-12-Apache-Hudi-DeltaStreamer-in-Action-Python-Publishing-and-AvroKafkaSource-Consumption-11-Guide.png)
Apache Hudi Delta Streamer in Action: Python Publishing and AvroKafkaSource Consumption (#11 Guide)
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-11-Simplifying-Big-Data-Setting-Up-SparkSQL-Hive-Thrift-Server-and-Hudi-with-Beeline-in-Minutes.png)
Simplifying Big Data: Setting Up Spark SQL, Hive Thrift Server, and Hudi with Beeline in Minutes
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-09-Learn-How-to-use-DBT-with-Spark-and-Thrift-Server-on-Local-Machine-for-Begineers-Easy-Setup.png)
Learn How to use DBT with Spark and Thrift Server on Local Machine for Begineers Easy Setup
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-08-How-to-use-DeltaStreamer-to-Read-Data-From-Hudi-Source-in-Incremental-Fashion-Bronze-to-Silver-10.png)
How to use DeltaStreamer to Read Data From Hudi Source in Incremental Fashion (Bronze to Silver) #10
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-30-Learn-How-to-use-MinIO-and-Apache-Hudi-DeltaStreamer-with-Hands-on-Lab-9.png)
Learn How to use MinIO and Apache Hudi Delta Streamer with Hands on Lab #9
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-27-Hudi-Metadata-table-Record-Level-Index-HBase-Index.png)
Hudi Metadata table, Record Level Index, HBase Index
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-27-Learn-How-to-Run-Clustering-in-Async-Mode-with-DeltaStreamer-in-Continuous-Mode-Hands-on-Labs-8.png)
Learn How to Run Clustering in Async Mode with Delta Streamer in Continuous Mode | Hands on Labs |#8
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-26-real-time-data-postgres-debezium-kafka-schema-registry-deltastreamer-7a.png)
Real-Time Data: Postgres, Debezium, Kafka, Schema Registry, Delta Streamer #7A
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-26-real-time-data-postgres-debezium-kafka-schema-registry-deltastreamer-7b.png)
Real-Time Data: Postgres, Debezium, Kafka, Schema Registry, DeltaStreamer #7B
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-24-Learn-How-to-use-DeltaStreamer-and-ingest-data-from-Kafka-Topic-Hands-on-Labs-6.png)
Learn How to use DeltaStreamer and ingest data from Kafka Topic Hands on Labs #6
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-24-hudi-table-types.png)
Hudi Table Types
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-23-Learn-How-to-Ingest-Data-Into-Hudi-Table-using-DeltaStreamer-in-continous-Mode-and-SQL-transformer-5.png)
Learn How to Ingest Data Into Hudi Table using Delta Streamer in continous Mode & SQL transformer#5
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-21-RFC-14-Step-by-Step-Guide-for-Incremental-Data-Pull-from-Postgres-to-Hudi-using-deltastreamer.png)
RFC-14: Step-by-Step Guide for Incremental Data Pull from Postgres to Hudi using DeltaStreamer (#4)
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-20-Hudi-Streamer-Hands-On-Guide-Local-Ingestion-from-CSV-Source-2.png)
Hudi Streamer Delta Streamer Hands On Guide: Local Ingestion from CSV Source #2
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-20-Learn-How-to-Ingest-Multiple-Tables-using-Hudi-MultiTable-Delta-Streamer-3.png)
Learn How to Ingest Multiple Tables using Hudi MultiTable Delta Streamer #3
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-19-Hudi-Streamer-Hands-On-Guide-Local-Ingestion-from-Parquet-Source-1.png)
Hudi Streamer (Delta Streamer) Hands-On Guide: Local Ingestion from Parquet Source #1
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-17-Maximizing-Efficiency-by-Templating-Serverless-Architecture-in-Hudi-Data-Lakes.png)
Maximizing Efficiency by Templating Serverless Architecture in Hudi Data Lakes
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-08-A-Glide-Skip-or-a-Jump-Efficiently-Stream-Data-into-Your-Medallion-Architecture-with-Apache-Hudi.png)
A Glide, Skip or a Jump: Efficiently Stream Data into Your Medallion Architecture with Apache Hudi
![](https://hudi.apache.org/assets/images/video_blogs/2023-10-28-How-to-Unlock-Data-Insights-from-Hudi-Metrics-for-Your-Data-Lake-using-Elastic-Search-and-Kibana.png)
How to Unlock Data Insights from Hudi Metrics for Your Data Lake using Elastic Search and Kibana
![](https://hudi.apache.org/assets/images/video_blogs/2023-10-21-Full-Apache-Hudi-Course-for-beginner-Operations-Type-Part-5.png)
Full Apache Hudi Course for beginners | Operations Type | Part 5
![](https://hudi.apache.org/assets/images/video_blogs/2023-10-16-Hudi-0-14-0-Deep-Dive-Record-Level-Index.png)
[LIVE] Hudi 0.14.0 Deep Dive: Record Level Index
![](https://hudi.apache.org/assets/images/video_blogs/2023-10-14-Accelerating-Data-Processing-Leveraging-Apache-Hudi-with-DynamoDB-for-Faster-Commit-Time-Retrieval.png)
Accelerating Data Processing: Leveraging Apache Hudi with DynamoDB for Faster Commit Time Retrieval
![](https://hudi.apache.org/assets/images/video_blogs/2023-10-07-Hudi-Latest-Feature-Auto-Generating-Primary-Keys-for-Modern-Data-Lakes.png)
Hudi's Latest Feature: Auto-Generating Primary Keys for Modern Data Lakes
![](https://hudi.apache.org/assets/images/video_blogs/2023-09-27-Learn-How-to-Use-Apache-Flink-with-Kafka-Build-Transactional-Datalakes-on-S3-using-PyFLink-Locally.png)
Learn How to Use Apache Flink with Kafka & Build Transactional Datalakes on S3 using PyFLink Locally
![](https://hudi.apache.org/assets/images/video_blogs/2023-09-26-How-to-Ingest-Data-from-PostgreSQL-into-Hudi-Tables-on-S3-with-Apache-Flink-CDC-Connector-Python.png)
How to Ingest Data from PostgreSQL into Hudi Tables on S3 with Apache Flink CDC Connector & Python
![](https://hudi.apache.org/assets/images/video_blogs/2023-09-25-How-to-Use-Apache-Hudi-with-Flink-1-15-on-AWS-Managed-Apache-Flink-Hands-on-Guide-for-Beginners.png)
How to Use Apache Hudi with Flink 1.15 on AWS Managed Apache Flink | Hands on Guide for Beginners
![](https://hudi.apache.org/assets/images/video_blogs/2023-09-23-Flink-with-POSTGRES-RealTime-Stream-Data-Processing-with-Python-Hands-on-Labs.png)
Flink (CDC) with POSTGRES RealTime Stream Data Processing with Python Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-29-From-Zero-to-Data-Hero-Building-Dynamic-Data-Platforms-Like-a-Pro-Final-Part-Demo.png)
From Zero to Data Hero: Building Dynamic Data Platforms Like a Pro 🚀📊 Final Part Demo
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Easy Step by Step Guide for Beginner Ingest CSV Files into Hudi with AWS GLue | Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-06-Easy_Step_by_Step_Guide_for_Beginner_Setup_AWS_Transfer_Family_SFTP_with_S3.png)
Easy Step by Step Guide for Beginner Setup AWS Transfer Family - SFTP with S3
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-03-Powering_EventDriven_Workloads_with_Hudi_Read_Stream_AWS_Glue_Streaming_JOBS.png)
Powering Event-Driven Workloads with Hudi Read Stream & AWS Glue Streaming JOBS!
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-01-Building_and_Automating_Hudi_Medallion_Architecture_with_AWS_Glue_Workflow_Hands_on_Labs_StepbyStep.png)
Building and Automating Hudi Medallion Architecture with AWS Glue Workflow Hands on Labs StepbyStep
![](https://hudi.apache.org/assets/images/video_blogs/2023-07-28-Removing_Duplicates_in_Hudi_Partitions_with_InsertOverwrite_API_and_Spark_SQL.png)
Removing Duplicates in Hudi Partitions with Insert_Overwrite API and Spark SQL
![](https://hudi.apache.org/assets/images/video_blogs/2023-07-22-learn_How_to_use_AWS_Glue_Crawler_with_Hudi_Tables_to_Catlog_the_Data.png)
learn How to use AWS Glue Crawler with Hudi Tables to Catlog the Data
![](https://hudi.apache.org/assets/images/video_blogs/2023-07-09-Develop_Incremental_ETL_Pipeline_From_Hudi_Tables_to_Redshift_Using_AWS_Glue_and_Spark.png)
Develop Incremental ETL Pipeline From Hudi Tables to Redshift Using AWS Glue and Spark
![](https://hudi.apache.org/assets/images/video_blogs/2023-07-09-Incremental_Data_Extraction_from_Postgres_using_Triggers_and_PySpark.png)
Incremental Data Extraction from Postgres using Triggers and PySpark
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Building Lakehouse using Hudi | Apache Hudi | Data Lakehouse | Hudi | Apache
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Learn About Apache Hudi Pre Commit Validator with Hands on Lab
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
SNS + Lambda: How to Trigger Lambda Functions from SNS using Message Filtering
![](https://hudi.apache.org/assets/images/video_blogs/2023-06-10-How_to_read_data_from_Multiple_Hudi_Tables_Join_them_and_insert_into_DynamoDB_with_AWS_Glue.png)
How to read data from Multiple Hudi Tables Join them and insert into DynamoDB with AWS Glue
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How Data Scientist &Data Engineer Can Query Hudi Tables with Athena Spark Notebook for AdhocAnalysis
![](https://hudi.apache.org/assets/images/video_blogs/2023-06-07-Learn_How_to_delete_Partition_in_Apache_Hudi_on_AWS_Glue_Hands_on.png)
Learn | How to delete Partition in Apache Hudi on AWS Glue | Hands on
![](https://hudi.apache.org/assets/images/video_blogs/2023-06-05-How_to_JOIN_Hudi_Tables_in_Incremental_fashion_with_DynamoDB_in_AWS_GLue_Hands_on_Lab_for_Begineer.png)
How to JOIN Hudi Tables in Incremental fashion with DynamoDB in AWS GLue | Hands on Lab for Begineer
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-27-Automate_alerting_and_reporting_for_AWS_Glue_job_resource_usage.png)
Automate alerting and reporting for AWS Glue job resource usage
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-21-How_to_Set_Up_AWS_Glue_Locally.png)
How to Set Up AWS Glue Locally with Docker: Accessing Glue Database & Table in Your LocalEnvironment
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-20-Mastering_File_Sizing_in_Hudi_Boosting_Performance_and_Efficiency.png)
Mastering File Sizing in Hudi: Boosting Performance and Efficiency
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-19-Hands-On-Lab_-Unleashing-Efficiency-and-Flexibility-with-Partial-Updates-in-Apache-Hudi.png)
Hands-On Lab: Unleashing Efficiency and Flexibility with Partial Updates in Apache Hudi
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-16-Unify-Your-Event-Data_Guide-to-Mapping-Events-to-Standardized-Format-with-Incremental-ETL-using-Hudi.png)
Unify Your Event Data:Guide to Mapping Events to Standardized Format with Incremental ETL using Hudi
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-13-EMR-Serverless-Made-Easy_-Submitting-Hive-SQL-Queries-for-Beginners-with-NYC-Taxi-Dataset.png)
EMR Serverless Made Easy: Submitting Hive SQL Queries for Beginners with NYC Taxi Dataset
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-11-EMR_Serverless_for_Beginners_Ingest_Data_incrementally_Submit_Spark_Job_with_EMRCLI_Data_lake.png)
EMR Serverless for Beginners: | Ingest Data incrementally | Submit Spark Job with EMR-CLI |Data lake
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-07-Maximizing_Efficiency_DataLake_Hudi_Glue_ETL_Jobs_with_Templated_Approach_Serverless_Architecture.png)
Maximizing Efficiency DataLake(Hudi) Glue ETL Jobs with Templated Approach &Serverless Architecture
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-06-How_to_Build_Your_Own_Version_of_AWS_Glue_Bookmark_to_get_Only_New_Incremental_Files.png)
How to Build Your Own Version of AWS Glue Bookmark to get Only New Incremental Files
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-03-Build_deploy_and_run_Spark_jobs_on_Amazon_EMR_with_the_opensource_EMR_CLI_tool.png)
Build, deploy, and run Spark jobs on Amazon EMR with the open-source EMR CLI tool
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-03-Mastering_Slowly_Changing_Dimension_with_Hudi_A_StepbyStep_Guide_to_Efficient_Data_Management.png)
Mastering Slowly Changing Dimension with Hudi: A Step-by-Step Guide to Efficient Data Management|
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-01-Building_a_Scalable_and_Resilient_Streaming_ETL_Pipeline_with_Hudi_s_Incremental_Processing_1.png)
Building a Scalable and Resilient Streaming ETL Pipeline with Hudi's Incremental Processing #1
![](https://hudi.apache.org/assets/images/video_blogs/2023-04-29-Efficiently_Managing_Ride_Late_Arriving_Tips_Data_with_Incremental_ETL_using_Apache_Hudi_Hands_On.png)
Efficiently Managing Ride & Late Arriving Tips Data with Incremental ETL using Apache Hudi :Hands On
![](https://hudi.apache.org/assets/images/video_blogs/2023-04-26-From_Raw_Data_to_Insights_Building_a_Lake_House_with_Hudi_and_Star_Schema_Step_by_Step_Guide.png)
From Raw Data to Insights: Building a Lake House with Hudi and Star Schema | Step by Step Guide
![](https://hudi.apache.org/assets/images/video_blogs/2023-04-25-Joining_Hudi_Raw_Tables_for_Powerful_Data_Analysis_with_Spark_SQL.png)
Joining Hudi Raw Tables for Powerful Data Analysis with Spark SQL
![](https://hudi.apache.org/assets/images/video_blogs/2023-04-20-Effortlessly_Sync_Your_JDBC_Source_to_Hudi_Transactional_Datalake_No_DMS_or_Debezium_Required.png)
Effortlessly Sync Your JDBC Source to Hudi Transactional Datalake: No DMS or Debezium Required!
![](https://hudi.apache.org/website/static/assets/images/video_blogs/2023-04-12-Efficient_Data_Ingestion_with_Glue_Concurrency_and_Hudi_Data_Lake.png)
Efficient Data Ingestion with Glue Concurrency and Hudi Data Lake
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Journey to Hudi Transactional Data Lake Mastery: How I Learned and Succeeded
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Learn about Apache Hudi Transformers with Hands on Lab
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Bootstrapping in Apache Hudi on EMR Serverless with Lab
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Understanding Clustering in Apache Hudi and the Benefits of Asynchronous Clustering
![](https://hudi.apache.org/assets/images/video_blogs/2023-04-07-Advantages_of_Metadata_Indexing_and_Asynchronous_Indexing_in_Hudi_Hands_on_Lab.png)
Advantages of Metadata Indexing and Asynchronous Indexing in Hudi Hands on Lab
![](https://hudi.apache.org/assets/images/video_blogs/2023-04-06-Efficient_Data_Lake_Management_with_Apache_Hudi_Cleaner_Benefits_of_Scheduling_Data_Cleaning_1.png)
Efficient Data Lake Management with Apache Hudi Cleaner: Benefits of Scheduling Data Cleaning #1
![](https://hudi.apache.org/assets/images/video_blogs/2023-04-06-Efficient_Data_Lake_Management_with_Apache_Hudi_Cleaner_Benefits_of_Scheduling_Data_Cleaning_1.png)
Efficient Data Lake Management with Apache Hudi Cleaner: Benefits of Scheduling Data Cleaning #2
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Getting Alerts when hudi Delta Streamer Fails with Event Driven Approach using Lambdas &Event Bridge
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Running Apache Hudi Delta Streamer On EMR Serverless Hands on Lab step by step guide
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Learn How to Integrate Apache Hudi with Redshift Spectrum Hands on Labs with Code
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-30-Project_Using_Apache_Hudi_Deltastreamer_and_AWS_DMS_Hands_on_Lab_Part_1.png)
Project : Using Apache Hudi Deltastreamer and AWS DMS Hands on Lab# Part 5
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-30-Project_Using_Apache_Hudi_Deltastreamer_and_AWS_DMS_Hands_on_Lab_Part_1.png)
Project: Using Apache Hudi Deltastreamer and AWS DMS Hands on Lab# Part 1
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-30-Project_Using_Apache_Hudi_Deltastreamer_and_AWS_DMS_Hands_on_Lab_Part_1.png)
Project : Using Apache Hudi Deltastreamer and AWS DMS Hands on Lab# Part 2
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-30-Project_Using_Apache_Hudi_Deltastreamer_and_AWS_DMS_Hands_on_Lab_Part_1.png)
Project : Using Apache Hudi Deltastreamer and AWS DMS Hands on Lab# Part 3
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-30-Project_Using_Apache_Hudi_Deltastreamer_and_AWS_DMS_Hands_on_Lab_Part_1.png)
Project : Using Apache Hudi Deltastreamer and AWS DMS Hands on Lab# Part 4
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How to use Apache Hudi with AWS Glue Studio Visual Editor | Hands on Lab
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Build CDC Pipeline from Microsoft SQL Server into Apache Hudi with AWS DMS | PART 1
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Build CDC Pipeline from Microsoft SQL Server into Apache Hudi with AWS DMS | PART 2
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Build CDC Pipeline from Microsoft SQL Server into Apache Hudi with AWS DMS | PART 3
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Build CDC Pipeline from Microsoft SQL Server into Apache Hudi with AWS DMS | PART 4
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Build CDC Pipeline from Microsoft SQL Server into Apache Hudi with AWS DMS | PART 5
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Weekend Project |Build CDC Pipeline from Microsoft SQL Server into Apache Hudi #1
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Data Analysis for Apache Hudi Blogs on Medium with Pandas
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-21-RFC_42_Consistent_Hashing_in_Apache_Hudi_MOR_Tables.png)
RFC 42: Consistent Hashing in Apache Hudi MOR Tables
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
RFC - 18: Insert Overwrite in Apache Hudi with Example
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Push Hudi Commit Notification TO HTTP URI with Callback
![](https://hudi.apache.org/assets/images/blog/hudi-lakehouse-architecture-uber.png)
Setting Uber’s Transactional Data Lake in Motion with Incremental ETL Using Apache Hudi
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-15-Learn_About_Bucket_Index_SIMPLE_In_Apache_Hudi_with_lab.png)
Learn About Bucket Index (SIMPLE) In Apache Hudi with lab
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-11-How_do_I_read_data_from_Cross_Account_S3_Buckets_and_Build_Hudi_Datalake_in_Datateam_Account.png)
How do I read data from Cross Account S3 Buckets and Build Hudi Datalake in Datateam Account
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-11-Query_crossaccount_Hudi_Glue_Data_Catalogs_using_Amazon_Athena.png)
Query cross-account Hudi Glue Data Catalogs using Amazon Athena
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How to Rollback to Previous Checkpoint during Disaster in Apache Hudi using Glue 4.0 Demo
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-04-Develop_Incremental_Pipeline_with_CDC_from_Hudi_to_Aurora_Postgres_Demo_Video.png)
Develop Incremental Pipeline with CDC from Hudi to Aurora Postgres | Demo Video
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Python helper class which makes querying incremental data from Hudi Data lakes easy
![](https://hudi.apache.org/assets/images/video_blogs/2023-02-25-RFC51_Change_Data_Capture_in_Apache_Hudi_like_Debezium_and_AWS_DMS_Hands_on_Labs.png)
RFC-51 Change Data Capture in Apache Hudi like Debezium and AWS DMS Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-02-22-Use_Glue_40_to_take_regular_save_points_for_your_Hudi_tables_for_backup_or_disaster_Recovery.png)
Use Glue 4.0 to take regular save points for your Hudi tables for backup or disaster Recovery
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Streaming Ingestion from MongoDB into Hudi with Glue, kinesis&Event bridge&MongoStream Hands on labs
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Create Your Hudi Transaction Datalake on S3 with EMR Serverless for Beginners in fun and easy way
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How do I Ingest Extremely Small Files into Hudi Data lake with Glue Incremental data processing
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Learn How to restrict Intern from accessing Certain Column in Hudi Datalake with lake Formation
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Writing data quality and validation scripts for a Hudi data lake with AWS Glue and pydeequ| Hands on Lab
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How to detect and Mask PII data in Apache Hudi Data Lake | Hands on Lab
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How do I identify Schema Changes in Hudi Tables and Send Email Alert when New Column added/removed
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-17-Cleaner_Service_Save_up_to_40_on_data_lake_storage_costs_Hudi_Labs.png)
Cleaner Service: Save up to 40% on data lake storage costs | Hudi Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-17-Global_Bloom_Index_Remove_duplicates_guarantee_uniquness_Hudi_Labs.png)
Global Bloom Index: Remove duplicates & guarantee uniquness | Hudi Labs
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How businesses use Hudi Soft delete features to do soft delete instead of hard delete on Datalake
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-17-Leverage_Apache_Hudi_incremental_query_to_process_new_updated_data_Hudi_Labs.png)
Leverage Apache Hudi incremental query to process new & updated data | Hudi Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-17-Leverage_Apache_Hudi_upsert_to_remove_duplicates_on_a_data_lake_Hudi_Labs.png)
Leverage Apache Hudi upsert to remove duplicates on a data lake | Hudi Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-17-Precomb_Key_Overview_Avoid_dedupes_Hudi_Labs.png)
Precomb Key Overview: Avoid dedupes | Hudi Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-17-Use_Apache_Hudi_for_hard_deletes_on_your_data_lake_for_data_governance_Hudi_Labs.png)
Use Apache Hudi for hard deletes on your data lake for data governance | Hudi Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-15-Real_Time_Streaming_Data_Pipeline_From_Aurora_Postgres_to_Hudi_with_DMS_Kinesis_and_Flink_DEMO.png)
Real Time Streaming Pipeline From Aurora Postgres to Hudi with DMS , Kinesis and Flink |Hands on Lab
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-15-Real_Time_Streaming_Data_Pipeline_From_Aurora_Postgres_to_Hudi_with_DMS_Kinesis_and_Flink_DEMO.png)
Real Time Streaming Data Pipeline From Aurora Postgres to Hudi with DMS , Kinesis and Flink |DEMO
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-13-Build_Real_Time_Low_Latency_Streaming_pipeline_from_DynamoDB_to_Apache_Hudi_using_Kinesis_FlinkLab.png)
Build Real Time Low Latency Streaming pipeline from DynamoDB to Apache Hudi using Kinesis,Flink|Lab
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Build Real Time Streaming Pipeline with Apache Hudi Kinesis and Flink | Hands on Lab
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-01-Streaming_ETL_using_Apache_Flink_joining_multiple_Kinesis_streams_Demo.png)
Streaming ETL using Apache Flink joining multiple Kinesis streams | Demo
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-01-Streaming_ETL_using_Apache_Flink_joining_multiple_Kinesis_streams_Demo.png)
Transaction Hudi Data Lake with Streaming ETL from Multiple Kinesis Streams & Joining using Flink
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-30-Step_by_Step_guide_how_to_setup_VPC_Subnet_Get_Started_with_HUDI_on_EMR_Installation_Guide.png)
Step by Step guide how to setup VPC & Subnet & Get Started with HUDI on EMR | Installation Guide |
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-27-Bring_Data_from_Source_using_Debezium_with_CDC_into_Kafka_S3Sink_Build_Hudi_Datalake_Hands_on_lab.png)
Bring Data from Source using Debezium with CDC into Kafka&S3Sink &Build Hudi Datalake | Hands on lab
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Apache Hudi on Windows Machine Spark 3.3 and hadoop2.7 Step by Step guide and Installation Process
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Lets Build Streaming Solution using Kafka + PySpark and Apache HUDI Hands on Lab with code
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-23-Apache_Hudi_with_DBT_Hands_on_LabTransform_Raw_Hudi_tables_with_DBT_and_Glue_Interactive_Session.png)
Apache Hudi with DBT Hands on Lab.Transform Raw Hudi tables with DBT and Glue Interactive Session
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Learn Schema Evolution in Apache Hudi Transaction Datalake with hands on labs
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-20-Getting_started_with_Kafka_and_Glue_to_Build_Real_Time_Apache_Hudi_Transaction_Datalake.png)
Getting started with Kafka and Glue to Build Real Time Apache Hudi Transaction Datalake
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-19-Build_Production_Ready_Alternative_Data_Pipeline_from_DynamoDB_to_Apache_Hudi_PROJECT_DEMO.png)
Build Production Ready Alternative Data Pipeline from DynamoDB to Apache Hudi | PROJECT DEMO
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-19-Build_Production_Ready_Alternative_Data_Pipeline_from_DynamoDB_to_Apache_Hudi_Step_by_Step_Guide.png)
Build Production Ready Alternative Data Pipeline from DynamoDB to Apache Hudi | Step by Step Guide
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Insert|Update|Read|Write|SnapShot| Time Travel |incremental Query on Apache Hudi datalake (S3)
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-17-Migrate_Certain_Tables_from_ONPREM_DB_using_DMS_into_Apache_Hudi_Transaction_Datalake_with_GlueDemo.png)
Migrate Certain Tables from ONPREM DB using DMS into Apache Hudi Transaction Datalake with Glue|Demo
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-17-Step_by_Step_Guide_on_Migrate_Certain_Tables_from_DB_using_DMS_into_Apache_Hudi_Transaction_Datalake.png)
Step by Step Guide on Migrate Certain Tables from DB using DMS into Apache Hudi Transaction Datalake
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-15-Build_production_Ready_Real_Time_Transaction_Hudi_Datalake_from_DynamoDB_Streams_using_Glue_kinesis.png)
Build production Ready Real Time Transaction Hudi Datalake from DynamoDB Streams using Glue &kinesis
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Build Slowly Changing Dimensions Type 2 (SCD2) with Apache Spark and Apache Hudi | Hands on Labs
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Hands on Lab with using DynamoDB as lock table for Apache Hudi Data Lakes
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How to convert Existing data in S3 into Apache Hudi Transaction Datalake with Glue | Hands on Lab
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Build Datalakes on S3 with Apache HUDI in a easy way for Beginners with hands on labs | Glue
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Simple 5 Steps Guide to get started with Apache Hudi and Glue 4.0 and query the data using Athena
![](https://hudi.apache.org/assets/images/video_blogs/2022-11-19-Build_a_Spark_pipeline_to_analyze_streaming_data_using_AWS_Glue_Apache_Hudi_S3_and_Athena.png)
Build a Spark pipeline to analyze streaming data using AWS Glue, Apache Hudi, S3 and Athena
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)