Blogs List Page | Apache Hudi

Skip to main content

What, Why and How : Apache Hudi’s Bloom Index

October 8, 2022 by Sivabalan Narayanan

indexing

Ingest streaming data to Apache Hudi tables using AWS Glue and Apache Hudi DeltaStreamer

October 6, 2022 by Vishal Pathak, Anand Prakash and Noritaka Sekiyama

Data processing with Spark: time traveling

September 28, 2022 by Petrica Leuca

querying

Building Streaming Data Lakes with Hudi and MinIO

September 20, 2022 by Matt Sarrel

Data Lake / Lakehouse Guide: Powered by Data Lake Table Formats (Delta Lake, Iceberg, Hudi)

August 25, 2022 by Simon Späti

Implementation of SCD-2 (Slowly Changing Dimension) with Apache Hudi & Spark

August 24, 2022 by Jayasheel Kalgal, Esha Dhing and Prashant Mishra

Use Flink Hudi to Build a Streaming Data Lake Platform

August 12, 2022 by Chen Yuzhao and Liu Dalong

How NerdWallet uses AWS and Apache Hudi to build a serverless, real-time analytics platform

August 9, 2022 by Kevin Chun and Dylan Qu

Build Open Lakehouse using Apache Hudi & dbt

July 11, 2022 by Vinoth Govindarajan

Apache Hudi vs Delta Lake - Transparent TPC-DS Lakehouse Performance Benchmarks

June 29, 2022 by Alexey Kudinkin

Hudi’s Column Stats Index and Data Skipping feature help speed up queries by an orders of magnitude!

June 9, 2022 by Alexey Kudinkin

Asynchronous Indexing using Hudi

June 4, 2022 by Sagar Sumit