![](https://hudi.apache.org/assets/images/video_blogs/2024-06-18-learn-how-to-ingest-xml-files-with-aws-glue-into-hudi-datalakes.png)
61 posts tagged with "aws glue"
View All Tags![](https://hudi.apache.org/assets/images/video_blogs/2024-06-18-learn-how-to-ingest-xml-files-with-aws-glue-into-hudi-datalakes.png)
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-18-Learn-How-to-use-Cloudwatch-metrics-with-Hudi-AWS-Glue-Jobs.png)
Learn How to use Cloudwatch metrics with Hudi AWS Glue Jobs
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-08-How-to-read-Hudi-Dataset-Using-AWS-Glue-Ray-and-Glue-Notebooks-without-Spark.png)
How to read Hudi Dataset Using AWS Glue Ray and Glue Notebooks (withouth Spark)
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-12-Managing-Updates-&-Deletes-in-Glue-Hudi-Spark-Jobs-with-CDC-Data:-Using-_hoodie_is_deleted-Flag.png)
Managing Updates & Deletes in Glue Hudi Spark Jobs with CDC Data
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-01-How-to-Query-Apache-Hudi-tables-from-Glue-Interactive-Notebook-for-AdHoc-Analysis.png)
How to Query Apache Hudi tables from Glue Interactive Notebook for AdHoc Analysis
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-27-Learn-How-you-can-run-DeltaStreamer-Running-on-AWS-Glue-with-Hudi-0-14-Step-by-Step-Guide.png)
Learn How you can run DeltaStreamer Running on AWS Glue with Hudi 0.14 Step by Step Guide
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-13-Setup-HUDI-with-AWS-Glue-and-MINIO-locally-using-Docker-Container-in-Minutes.png)
Setup HUDI with AWS Glue and MINIO locally using Docker Container in Minutes
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-25-Hudi-DBT-Spark-Glue-Hive-MetaStore-Join-two-hudi-tables-Labs-with-Exercise-Files.png)
Hudi + DBT + Spark + Glue Hive MetaStore | Join two hudi tables Labs with Exercise Files
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-24-Apache-Hudi-Spark-DBT-Glue-Hive-MetaStore-Setup-Locally-in-Minutes-Hands-On-Exercise.png)
Apache Hudi, Spark, DBT, Glue Hive MetaStore Setup | Locally | in Minutes – Hands-On Exercise!
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-19-How-to-Use-Apache-Hudi-0-14-and-RLI-on-AWS-Glue-Step-by-Step-Guide.png)
How to Use Apache Hudi 0.14 and RLI (record level index) on AWS Glue Step by Step Guide
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-17-Maximizing-Efficiency-by-Templating-Serverless-Architecture-in-Hudi-Data-Lakes.png)
Maximizing Efficiency by Templating Serverless Architecture in Hudi Data Lakes
![](https://hudi.apache.org/assets/images/video_blogs/2023-10-14-Accelerating-Data-Processing-Leveraging-Apache-Hudi-with-DynamoDB-for-Faster-Commit-Time-Retrieval.png)
Accelerating Data Processing: Leveraging Apache Hudi with DynamoDB for Faster Commit Time Retrieval
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-29-From-Zero-to-Data-Hero-Building-Dynamic-Data-Platforms-Like-a-Pro-Final-Part-Demo.png)
From Zero to Data Hero: Building Dynamic Data Platforms Like a Pro 🚀📊 Final Part Demo
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Easy Step by Step Guide for Beginner Ingest CSV Files into Hudi with AWS GLue | Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-06-Easy_Step_by_Step_Guide_for_Beginner_Setup_AWS_Transfer_Family_SFTP_with_S3.png)
Easy Step by Step Guide for Beginner Setup AWS Transfer Family - SFTP with S3
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-03-Powering_EventDriven_Workloads_with_Hudi_Read_Stream_AWS_Glue_Streaming_JOBS.png)
Powering Event-Driven Workloads with Hudi Read Stream & AWS Glue Streaming JOBS!
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-01-Building_and_Automating_Hudi_Medallion_Architecture_with_AWS_Glue_Workflow_Hands_on_Labs_StepbyStep.png)
Building and Automating Hudi Medallion Architecture with AWS Glue Workflow Hands on Labs StepbyStep
![](https://hudi.apache.org/assets/images/video_blogs/2023-07-09-Develop_Incremental_ETL_Pipeline_From_Hudi_Tables_to_Redshift_Using_AWS_Glue_and_Spark.png)
Develop Incremental ETL Pipeline From Hudi Tables to Redshift Using AWS Glue and Spark
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Building Lakehouse using Hudi | Apache Hudi | Data Lakehouse | Hudi | Apache
![](https://hudi.apache.org/assets/images/video_blogs/2023-06-22-Full_Workshop_Recap_Build_a_rideshare_lakehouse_platform.png)
Full Workshop Recap: Build a ride-share lakehouse platform
![](https://hudi.apache.org/assets/images/video_blogs/2023-06-10-How_to_read_data_from_Multiple_Hudi_Tables_Join_them_and_insert_into_DynamoDB_with_AWS_Glue.png)
How to read data from Multiple Hudi Tables Join them and insert into DynamoDB with AWS Glue
![](https://hudi.apache.org/assets/images/video_blogs/2023-06-07-Learn_How_to_delete_Partition_in_Apache_Hudi_on_AWS_Glue_Hands_on.png)
Learn | How to delete Partition in Apache Hudi on AWS Glue | Hands on
![](https://hudi.apache.org/assets/images/video_blogs/2023-06-05-How_to_JOIN_Hudi_Tables_in_Incremental_fashion_with_DynamoDB_in_AWS_GLue_Hands_on_Lab_for_Begineer.png)
How to JOIN Hudi Tables in Incremental fashion with DynamoDB in AWS GLue | Hands on Lab for Begineer
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How to Query Hudi Tables in Incremental Fashion and Get only New data on AWS Glue | Hands on Lab
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-31-AWS_and_Apache_Hudi_Workshop_Overview_Build_a_ride_share_lakehouse_platform.png)
AWS and Apache Hudi Workshop Overview: Build a ride share lakehouse platform
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-27-Automate_alerting_and_reporting_for_AWS_Glue_job_resource_usage.png)
Automate alerting and reporting for AWS Glue job resource usage
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-21-How_to_Set_Up_AWS_Glue_Locally.png)
How to Set Up AWS Glue Locally with Docker: Accessing Glue Database & Table in Your LocalEnvironment
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-07-Maximizing_Efficiency_DataLake_Hudi_Glue_ETL_Jobs_with_Templated_Approach_Serverless_Architecture.png)
Maximizing Efficiency DataLake(Hudi) Glue ETL Jobs with Templated Approach &Serverless Architecture
![](https://hudi.apache.org/assets/images/video_blogs/2023-05-06-How_to_Build_Your_Own_Version_of_AWS_Glue_Bookmark_to_get_Only_New_Incremental_Files.png)
How to Build Your Own Version of AWS Glue Bookmark to get Only New Incremental Files
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How to use Apache Hudi with AWS Glue Studio Visual Editor | Hands on Lab
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Build CDC Pipeline from Microsoft SQL Server into Apache Hudi with AWS DMS | PART 1
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Build CDC Pipeline from Microsoft SQL Server into Apache Hudi with AWS DMS | PART 2
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Build CDC Pipeline from Microsoft SQL Server into Apache Hudi with AWS DMS | PART 3
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Build CDC Pipeline from Microsoft SQL Server into Apache Hudi with AWS DMS | PART 4
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Build CDC Pipeline from Microsoft SQL Server into Apache Hudi with AWS DMS | PART 5
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-25-Build_CDC_Pipeline_from_Microsoft_SQL_Server_into_Apache_Hudi_with_AWS_DMS_PART_1.png)
Weekend Project |Build CDC Pipeline from Microsoft SQL Server into Apache Hudi #1
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-11-Query_crossaccount_Hudi_Glue_Data_Catalogs_using_Amazon_Athena.png)
Query cross-account Hudi Glue Data Catalogs using Amazon Athena
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How to Rollback to Previous Checkpoint during Disaster in Apache Hudi using Glue 4.0 Demo
![](https://hudi.apache.org/assets/images/video_blogs/2023-03-04-Develop_Incremental_Pipeline_with_CDC_from_Hudi_to_Aurora_Postgres_Demo_Video.png)
Develop Incremental Pipeline with CDC from Hudi to Aurora Postgres | Demo Video
![](https://hudi.apache.org/assets/images/video_blogs/2023-02-22-Use_Glue_40_to_take_regular_save_points_for_your_Hudi_tables_for_backup_or_disaster_Recovery.png)
Use Glue 4.0 to take regular save points for your Hudi tables for backup or disaster Recovery
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How do I Ingest Extremely Small Files into Hudi Data lake with Glue Incremental data processing
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Writing data quality and validation scripts for a Hudi data lake with AWS Glue and pydeequ| Hands on Lab
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How to detect and Mask PII data in Apache Hudi Data Lake | Hands on Lab
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How do I identify Schema Changes in Hudi Tables and Send Email Alert when New Column added/removed
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-17-Leverage_Apache_Hudi_incremental_query_to_process_new_updated_data_Hudi_Labs.png)
Leverage Apache Hudi incremental query to process new & updated data | Hudi Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-17-Leverage_Apache_Hudi_upsert_to_remove_duplicates_on_a_data_lake_Hudi_Labs.png)
Leverage Apache Hudi upsert to remove duplicates on a data lake | Hudi Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-01-Streaming_ETL_using_Apache_Flink_joining_multiple_Kinesis_streams_Demo.png)
Streaming ETL using Apache Flink joining multiple Kinesis streams | Demo
![](https://hudi.apache.org/assets/images/video_blogs/2023-01-01-Streaming_ETL_using_Apache_Flink_joining_multiple_Kinesis_streams_Demo.png)
Transaction Hudi Data Lake with Streaming ETL from Multiple Kinesis Streams & Joining using Flink
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-27-Bring_Data_from_Source_using_Debezium_with_CDC_into_Kafka_S3Sink_Build_Hudi_Datalake_Hands_on_lab.png)
Bring Data from Source using Debezium with CDC into Kafka&S3Sink &Build Hudi Datalake | Hands on lab
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-23-Apache_Hudi_with_DBT_Hands_on_LabTransform_Raw_Hudi_tables_with_DBT_and_Glue_Interactive_Session.png)
Apache Hudi with DBT Hands on Lab.Transform Raw Hudi tables with DBT and Glue Interactive Session
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-20-Getting_started_with_Kafka_and_Glue_to_Build_Real_Time_Apache_Hudi_Transaction_Datalake.png)
Getting started with Kafka and Glue to Build Real Time Apache Hudi Transaction Datalake
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-19-Build_Production_Ready_Alternative_Data_Pipeline_from_DynamoDB_to_Apache_Hudi_PROJECT_DEMO.png)
Build Production Ready Alternative Data Pipeline from DynamoDB to Apache Hudi | PROJECT DEMO
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-19-Build_Production_Ready_Alternative_Data_Pipeline_from_DynamoDB_to_Apache_Hudi_Step_by_Step_Guide.png)
Build Production Ready Alternative Data Pipeline from DynamoDB to Apache Hudi | Step by Step Guide
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-17-Migrate_Certain_Tables_from_ONPREM_DB_using_DMS_into_Apache_Hudi_Transaction_Datalake_with_GlueDemo.png)
Migrate Certain Tables from ONPREM DB using DMS into Apache Hudi Transaction Datalake with Glue|Demo
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-17-Step_by_Step_Guide_on_Migrate_Certain_Tables_from_DB_using_DMS_into_Apache_Hudi_Transaction_Datalake.png)
Step by Step Guide on Migrate Certain Tables from DB using DMS into Apache Hudi Transaction Datalake
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-15-Build_production_Ready_Real_Time_Transaction_Hudi_Datalake_from_DynamoDB_Streams_using_Glue_kinesis.png)
Build production Ready Real Time Transaction Hudi Datalake from DynamoDB Streams using Glue &kinesis
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
How to convert Existing data in S3 into Apache Hudi Transaction Datalake with Glue | Hands on Lab
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Build Datalakes on S3 with Apache HUDI in a easy way for Beginners with hands on labs | Glue
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Simple 5 Steps Guide to get started with Apache Hudi and Glue 4.0 and query the data using Athena
![](https://hudi.apache.org/assets/images/video_blogs/2022-11-19-Build_a_Spark_pipeline_to_analyze_streaming_data_using_AWS_Glue_Apache_Hudi_S3_and_Athena.png)
Build a Spark pipeline to analyze streaming data using AWS Glue, Apache Hudi, S3 and Athena
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)