![](https://hudi.apache.org/assets/images/video_blogs/2024-06-21-Four-Different-Ways-to-fetch-Apache-Hudi-Commit-time-in-Python-and-PySpark.png)
85 posts tagged with "beginner"
View All Tags![](https://hudi.apache.org/assets/images/video_blogs/2024-06-21-Four-Different-Ways-to-fetch-Apache-Hudi-Commit-time-in-Python-and-PySpark.png)
![](https://hudi.apache.org/assets/images/video_blogs/2024-06-18-learn-how-to-ingest-xml-files-with-aws-glue-into-hudi-datalakes.png)
Learn How to Ingest XML files with AWS Glue into Hudi Datalakes | Step by Step guide
![](https://hudi.apache.org/assets/images/video_blogs/2024-06-16-hudi-with-spark-sql-for-beginners-insert-updates-delete-incremental-query-stored-procedures.png)
Hudi with Spark SQL for Beginners | Insert| Updates | Delete | incremental Query | Stored procedures
![](https://hudi.apache.org/assets/images/video_blogs/2024-06-15-how-we-utilized-hudis-time-travel-query-to-investigate-bid-and-spend.png)
How we Utilized Hudi's Time Travel Query to Investigate Bid and Spend | Going Back in Time with Hudi
![](https://hudi.apache.org/assets/images/video_blogs/2024-06-12-hudi-cleaning-process-hoodie.keep.min.commits-and-hoodie.keep.max.commits-explained.png)
Hudi Cleaning Process | hoodie.keep.min.commits and hoodie.keep.max.commits Explained
![](https://hudi.apache.org/assets/images/video_blogs/2024-06-05-multiple-spark-writers-to-hudi-tables.png)
Multiple Spark Writers to Hudi tables | Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-25-learn-how-to-ingest-data-from-pulsar-topic-into-hudi-with-deltastreamer.png)
Learn How to Ingest data from pulsar Topic into Hudi with DeltaStreamer | Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-23-build-hudi-date-dimension-in-minutes-with-spark-sql-minio-and-query-with-trino.png)
Build Hudi Date Dimension in Minutes with Spark SQL Minio and Query with Trino
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-22-hudi-delta-streamer-implementing-slowly-changing-dimension-and-query-that-using-trino.png)
Demo Video : Hudi Delta Streamer Implementing Slowly Changing Dimension and Query that using Trino
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-22-hudi-streamer-implementing-slowly-changing-dimension-type-2-and-query-real-time-trino.png)
Hudi Streamer implementing Slowly Changing Dimension Type 2 and Query Real Time Trino | Hands on
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-20-deltastreamer-with-incremental-etl-and-broadcast-joins-for-faster-etl.png)
DeltaStreamer with incremental ETL and Broadcast Joins for Faster ETL
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-18-Learn-How-to-use-Cloudwatch-metrics-with-Hudi-AWS-Glue-Jobs.png)
Learn How to use Cloudwatch metrics with Hudi AWS Glue Jobs
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-12-Unleashing-the-Power-of-Serverless-Serving-Gold-Hudi-Tables-with-AWS-Lambda.png)
Unleashing the Power of Serverless: Serving Gold Hudi Tables with AWS Lambda
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-08-How-to-read-Hudi-Dataset-Using-AWS-Glue-Ray-and-Glue-Notebooks-without-Spark.png)
How to read Hudi Dataset Using AWS Glue Ray and Glue Notebooks (withouth Spark)
![](https://hudi.apache.org/assets/images/video_blogs/2024-05-04-Learn-How-to-Display-Data-From-Hudi-Tables-to-your-Frontend-with-Flask-and-Daft-NO-SPARK-NEEDED.png)
Learn How to Display Data From Hudi Tables to your Frontend with Flask and Daft (NO SPARK NEEDED)
![](https://hudi.apache.org/assets/images/video_blogs/2024-04-22-Hudi-with-Kyuubi-a-distributed-and-multi-tenant-gateway-to-provide-serverless-SQL-on-lakehouses.png)
Hudi with Kyuubi, a distributed & multi-tenant gateway, to provide serverless SQL on lakehouses
![](https://hudi.apache.org/assets/images/video_blogs/2024-04-10-Build-Universal-Data-lake-with-MySQL-+-Debezium+Kafka+DeltaSTreamer-+-Minio+HiveMetastore+Trino.png)
Build Universal Data lake with MySQL + Debezium+Kafka+DeltaSTreamer + Minio+HiveMetastore+Trino
![](https://hudi.apache.org/assets/images/video_blogs/2024-04-06-Build-Universal-Data-lake-with-Posgres-+-Debezium+Kafka+DeltaSTreamer-+-Minio+HiveMetastore+Trino.png)
Build Universal Data lake with Posgres + Debezium+Kafka+DeltaSTreamer + Minio+HiveMetastore+Trino
![](https://hudi.apache.org/assets/images/video_blogs/2024-04-03-Reading-Data-from-Hudi-INC-and-Joining-with-Delta-Tables-using-HudiStreamer-and-SQL-Based-Transformer.png)
Reading Data from Hudi INC & Joining with Delta Tables using HudiStreamer & SQL-Based Transformer
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-20-How-to-perform-Backfilling-jobs-with-Hudi-DeltaStreamer-and-Spark-SQL-using-SqlSource-Class.png)
How to perform Backfilling jobs with Hudi DeltaStreamer and Spark SQL using SqlSource Class
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-18-Mastering-Incremental-ETL-with-DeltaStreamer-and-SQL-Based-Transformer.png)
Mastering Incremental ETL with DeltaStreamer and SQL-Based Transformer
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-12-Managing-Updates-&-Deletes-in-Glue-Hudi-Spark-Jobs-with-CDC-Data:-Using-_hoodie_is_deleted-Flag.png)
Managing Updates & Deletes in Glue Hudi Spark Jobs with CDC Data
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-11-Getting-Started-Tutorial-Building-a-Data-Lakehouse-With-StarRocks-Apache-Hudi-and-MinIO.png)
Getting Started Tutorial: Building a Data Lakehouse With StarRocks, Apache Hudi, and MinIO
![](https://hudi.apache.org/assets/images/video_blogs/2024-03-01-How-to-Query-Apache-Hudi-tables-from-Glue-Interactive-Notebook-for-AdHoc-Analysis.png)
How to Query Apache Hudi tables from Glue Interactive Notebook for AdHoc Analysis
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-27-Learn-How-you-can-run-DeltaStreamer-Running-on-AWS-Glue-with-Hudi-0-14-Step-by-Step-Guide.png)
Learn How you can run DeltaStreamer Running on AWS Glue with Hudi 0.14 Step by Step Guide
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-23-Getting-Started-with-Open-Data-lineage-Marquez-Project-Apache-Hudi-Spark-jobs.png)
Getting Started with Open Data lineage | Marquez Project | Apache Hudi Spark jobs
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-18-Build-Incremental-ETL-pipeline-with-Hudi-and-Airflow-and-MinIO.png)
Build Incremental ETL pipeline with Hudi and Airflow and MinIO
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-17-Learn-How-to-Integerate-Hudi-Spark-job-with-Airflow-and-MinIO-Hands-on-Labs.png)
Learn How to Integerate Hudi Spark job with Airflow and MinIO | Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-10-Data-Ingestion-to-Visualization-Hudi-MinIO-StarRocks-HiveMetaStore-Apache-SuperSet-Hands-on-Guide.png)
Data Ingestion to Visualization: Hudi + MinIO + StarRocks + HiveMetaStore + Apache SuperSet Hands on Guide
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-07-Building-an-Open-Source-Data-Lake-House-with-Hudip-Postgres-Hive-Metastore-Minio-and-StarRocks.png)
Building an Open Source Data Lake House with Hudi, Postgres Hive Metastore, Minio, and StarRocks
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-03-Apache-Hudi-Table-Services-Export-Services-HoodieSnapshotExporter-Hands-on-labs.png)
Apache Hudi Table Services | Export Services | HoodieSnapshotExporter | Hands on labs
![](https://hudi.apache.org/assets/images/video_blogs/2024-02-03-Apache-Hudi-Table-Services-Offline-Compaction-HoodieCompactor-Hands-on-labs.png)
Apache Hudi Table Services | Offline Compaction | HoodieCompactor | Hands on labs
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-21-Learn-How-to-Move-Data-From-MongoDB-to-Apache-Hudi-Using-PySpark.png)
Learn How to Move Data From MongoDB to Apache Hudi Using PySpark
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-17-How-to-Delete-Items-from-Hudi-using-Delta-Streamer-operating-in-UPSERT-Mode-with-Kafka-Avro-MSG-12.png)
How to Delete Items from Hudi using Delta Streamer operating in UPSERT Mode with Kafka Avro MSG #12
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-13-Setup-HUDI-with-AWS-Glue-and-MINIO-locally-using-Docker-Container-in-Minutes.png)
Setup HUDI with AWS Glue and MINIO locally using Docker Container in Minutes
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-06-Dynamic-Delta-Streamer-Jobs-with-JDBC-Puller-for-Postgres-Bring-all-Tables-from-particular-Schema-full-video.png)
Dynamic Delta Streamer Jobs with JDBC Puller for Postgres | Bring all Tables from particular Schema- Full Video
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-06-Dynamic-Delta-Streamer-Jobs-with-JDBC-Puller-for-Postgres-Bring-all-Tables-from-particular-Schema.png)
Dynamic Delta Streamer Jobs with JDBC Puller for Postgres | Bring all Tables from particular Schema
![](https://hudi.apache.org/assets/images/video_blogs/2024-01-01-Data-Lake-to-Microservices-Apache-Hudi-Record-Index-FastAPI-Spark-Connect-with-Swagger-UI.png)
Data Lake to Microservices: Apache Hudi's Record Index, FastAPI, Spark Connect with Swagger UI
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-31-What-is-Spark-Connect-and-Getting-started-Spark-Connect-Hello-World.png)
What is Spark Connect and Getting started Spark Connect Hello World
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-30-Step-by-step-guide-on-How-to-Migrate-legacy-COW-Table-on-S3-to-MOR-Table-using-Hudi-CLI.png)
Step by step guide on How to Migrate legacy COW Table on S3 to MOR Table using Hudi CLI
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-29-Get-Started-with-Hudi-CLI-Locally-Using-Docker-in-Minutes-and-Connect-to-Your-S3-Data.png)
Get Started with Hudi CLI Locally Using Docker in Minutes and Connect to Your S3 Data
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-25-Hudi-DBT-Spark-Glue-Hive-MetaStore-Join-two-hudi-tables-Labs-with-Exercise-Files.png)
Hudi + DBT + Spark + Glue Hive MetaStore | Join two hudi tables Labs with Exercise Files
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-24-Apache-Hudi-Spark-DBT-Glue-Hive-MetaStore-Setup-Locally-in-Minutes-Hands-On-Exercise.png)
Apache Hudi, Spark, DBT, Glue Hive MetaStore Setup | Locally | in Minutes – Hands-On Exercise!
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-19-How-to-Use-Apache-Hudi-0-14-and-RLI-on-AWS-Glue-Step-by-Step-Guide.png)
How to Use Apache Hudi 0.14 and RLI (record level index) on AWS Glue Step by Step Guide
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-16-Learn-How-to-Setup-Hudi-on-EMR-with-Hive-and-Query-Data-using-Hue-and-Presto-CLI-Hands-on-Labs.png)
Learn How to Setup Hudi on EMR with Hive and Query Data using Hue and Presto CLI Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-12-Apache-Hudi-DeltaStreamer-in-Action-Python-Publishing-and-AvroKafkaSource-Consumption-11-Guide.png)
Apache Hudi Delta Streamer in Action: Python Publishing and AvroKafkaSource Consumption (#11 Guide)
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-11-Simplifying-Big-Data-Setting-Up-SparkSQL-Hive-Thrift-Server-and-Hudi-with-Beeline-in-Minutes.png)
Simplifying Big Data: Setting Up Spark SQL, Hive Thrift Server, and Hudi with Beeline in Minutes
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-09-Learn-How-to-use-DBT-with-Spark-and-Thrift-Server-on-Local-Machine-for-Begineers-Easy-Setup.png)
Learn How to use DBT with Spark and Thrift Server on Local Machine for Begineers Easy Setup
![](https://hudi.apache.org/assets/images/video_blogs/2023-12-08-How-to-use-DeltaStreamer-to-Read-Data-From-Hudi-Source-in-Incremental-Fashion-Bronze-to-Silver-10.png)
How to use DeltaStreamer to Read Data From Hudi Source in Incremental Fashion (Bronze to Silver) #10
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-30-Learn-How-to-use-MinIO-and-Apache-Hudi-DeltaStreamer-with-Hands-on-Lab-9.png)
Learn How to use MinIO and Apache Hudi Delta Streamer with Hands on Lab #9
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-27-Hudi-Metadata-table-Record-Level-Index-HBase-Index.png)
Hudi Metadata table, Record Level Index, HBase Index
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-27-Learn-How-to-Run-Clustering-in-Async-Mode-with-DeltaStreamer-in-Continuous-Mode-Hands-on-Labs-8.png)
Learn How to Run Clustering in Async Mode with Delta Streamer in Continuous Mode | Hands on Labs |#8
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-26-real-time-data-postgres-debezium-kafka-schema-registry-deltastreamer-7a.png)
Real-Time Data: Postgres, Debezium, Kafka, Schema Registry, Delta Streamer #7A
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-26-real-time-data-postgres-debezium-kafka-schema-registry-deltastreamer-7b.png)
Real-Time Data: Postgres, Debezium, Kafka, Schema Registry, DeltaStreamer #7B
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-24-Learn-How-to-use-DeltaStreamer-and-ingest-data-from-Kafka-Topic-Hands-on-Labs-6.png)
Learn How to use DeltaStreamer and ingest data from Kafka Topic Hands on Labs #6
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-24-hudi-table-types.png)
Hudi Table Types
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-23-Learn-How-to-Ingest-Data-Into-Hudi-Table-using-DeltaStreamer-in-continous-Mode-and-SQL-transformer-5.png)
Learn How to Ingest Data Into Hudi Table using Delta Streamer in continous Mode & SQL transformer#5
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-21-RFC-14-Step-by-Step-Guide-for-Incremental-Data-Pull-from-Postgres-to-Hudi-using-deltastreamer.png)
RFC-14: Step-by-Step Guide for Incremental Data Pull from Postgres to Hudi using DeltaStreamer (#4)
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-20-Hudi-Streamer-Hands-On-Guide-Local-Ingestion-from-CSV-Source-2.png)
Hudi Streamer Delta Streamer Hands On Guide: Local Ingestion from CSV Source #2
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-20-Learn-How-to-Ingest-Multiple-Tables-using-Hudi-MultiTable-Delta-Streamer-3.png)
Learn How to Ingest Multiple Tables using Hudi MultiTable Delta Streamer #3
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-19-Hudi-Streamer-Hands-On-Guide-Local-Ingestion-from-Parquet-Source-1.png)
Hudi Streamer (Delta Streamer) Hands-On Guide: Local Ingestion from Parquet Source #1
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-17-Maximizing-Efficiency-by-Templating-Serverless-Architecture-in-Hudi-Data-Lakes.png)
Maximizing Efficiency by Templating Serverless Architecture in Hudi Data Lakes
![](https://hudi.apache.org/assets/images/video_blogs/2023-11-08-A-Glide-Skip-or-a-Jump-Efficiently-Stream-Data-into-Your-Medallion-Architecture-with-Apache-Hudi.png)
A Glide, Skip or a Jump: Efficiently Stream Data into Your Medallion Architecture with Apache Hudi
![](https://hudi.apache.org/assets/images/video_blogs/2023-10-28-How-to-Unlock-Data-Insights-from-Hudi-Metrics-for-Your-Data-Lake-using-Elastic-Search-and-Kibana.png)
How to Unlock Data Insights from Hudi Metrics for Your Data Lake using Elastic Search and Kibana
![](https://hudi.apache.org/assets/images/video_blogs/2023-10-21-Full-Apache-Hudi-Course-for-beginner-Operations-Type-Part-5.png)
Full Apache Hudi Course for beginners | Operations Type | Part 5
![](https://hudi.apache.org/assets/images/video_blogs/2023-10-14-Accelerating-Data-Processing-Leveraging-Apache-Hudi-with-DynamoDB-for-Faster-Commit-Time-Retrieval.png)
Accelerating Data Processing: Leveraging Apache Hudi with DynamoDB for Faster Commit Time Retrieval
![](https://hudi.apache.org/assets/images/video_blogs/2023-10-07-Hudi-Latest-Feature-Auto-Generating-Primary-Keys-for-Modern-Data-Lakes.png)
Hudi's Latest Feature: Auto-Generating Primary Keys for Modern Data Lakes
![](https://hudi.apache.org/assets/images/video_blogs/2023-09-27-Learn-How-to-Use-Apache-Flink-with-Kafka-Build-Transactional-Datalakes-on-S3-using-PyFLink-Locally.png)
Learn How to Use Apache Flink with Kafka & Build Transactional Datalakes on S3 using PyFLink Locally
![](https://hudi.apache.org/assets/images/video_blogs/2023-09-26-How-to-Ingest-Data-from-PostgreSQL-into-Hudi-Tables-on-S3-with-Apache-Flink-CDC-Connector-Python.png)
How to Ingest Data from PostgreSQL into Hudi Tables on S3 with Apache Flink CDC Connector & Python
![](https://hudi.apache.org/assets/images/video_blogs/2023-09-25-How-to-Use-Apache-Hudi-with-Flink-1-15-on-AWS-Managed-Apache-Flink-Hands-on-Guide-for-Beginners.png)
How to Use Apache Hudi with Flink 1.15 on AWS Managed Apache Flink | Hands on Guide for Beginners
![](https://hudi.apache.org/assets/images/video_blogs/2023-09-23-Flink-with-POSTGRES-RealTime-Stream-Data-Processing-with-Python-Hands-on-Labs.png)
Flink (CDC) with POSTGRES RealTime Stream Data Processing with Python Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-29-From-Zero-to-Data-Hero-Building-Dynamic-Data-Platforms-Like-a-Pro-Final-Part-Demo.png)
From Zero to Data Hero: Building Dynamic Data Platforms Like a Pro 🚀📊 Final Part Demo
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Easy Step by Step Guide for Beginner Ingest CSV Files into Hudi with AWS GLue | Hands on Labs
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-06-Easy_Step_by_Step_Guide_for_Beginner_Setup_AWS_Transfer_Family_SFTP_with_S3.png)
Easy Step by Step Guide for Beginner Setup AWS Transfer Family - SFTP with S3
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-03-Powering_EventDriven_Workloads_with_Hudi_Read_Stream_AWS_Glue_Streaming_JOBS.png)
Powering Event-Driven Workloads with Hudi Read Stream & AWS Glue Streaming JOBS!
![](https://hudi.apache.org/assets/images/video_blogs/2023-08-01-Building_and_Automating_Hudi_Medallion_Architecture_with_AWS_Glue_Workflow_Hands_on_Labs_StepbyStep.png)
Building and Automating Hudi Medallion Architecture with AWS Glue Workflow Hands on Labs StepbyStep
![](https://hudi.apache.org/assets/images/video_blogs/2023-07-28-Removing_Duplicates_in_Hudi_Partitions_with_InsertOverwrite_API_and_Spark_SQL.png)
Removing Duplicates in Hudi Partitions with Insert_Overwrite API and Spark SQL
![](https://hudi.apache.org/assets/images/video_blogs/2023-07-22-learn_How_to_use_AWS_Glue_Crawler_with_Hudi_Tables_to_Catlog_the_Data.png)
learn How to use AWS Glue Crawler with Hudi Tables to Catlog the Data
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Hudi Best Practices: Handling Failed Inserts/Upserts with Error Tables
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Building Lakehouse using Hudi | Apache Hudi | Data Lakehouse | Hudi | Apache
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
SNS + Lambda: How to Trigger Lambda Functions from SNS using Message Filtering
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Create Your Hudi Transaction Datalake on S3 with EMR Serverless for Beginners in fun and easy way
![](https://hudi.apache.org/assets/images/video_blogs/2022-12-30-Step_by_Step_guide_how_to_setup_VPC_Subnet_Get_Started_with_HUDI_on_EMR_Installation_Guide.png)
Step by Step guide how to setup VPC & Subnet & Get Started with HUDI on EMR | Installation Guide |
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)
Apache Hudi on Windows Machine Spark 3.3 and hadoop2.7 Step by Step guide and Installation Process
![](https://hudi.apache.org/assets/images/hudi-video-page-default.png)