Skip to main content

Apache Hudi

Hudi brings
 
 
to data lakes!
Hudi banner

What is Hudi

Apache Hudi is an open data lakehouse platform, built on a high-performance open table format to bring database functionality to your data lakes.
Hudi reimagines slow old-school batch data processing with a powerful new incremental processing framework for low latency minute-level analytics.
Hudi Data Lake

Hudi Features

Why Hudi

The most innovative and completely open data lakehouse platform in the industry!

Trusted Platform

Battle tested and proven in production in some of the largest data lakes on the planet.

Open Source

Hudi is a thriving & growing community that is built with contributions from people around the globe.

High Performance

Hudi's storage format is purpose-built to continuously deliver performance as data scales.

Data streams

Take advantage of built-in CDC sources and tools for streaming ingestion.

Join our Community

Get technical help, influence the product roadmap & see what’s new with Hudi!

GitHub

Join community

Slack

Join community

Linkedin

Join community

Twitter

Join community

Youtube

Subscribe

Mailing

Subscribe