Apache Hudi
Hudi brings
to data lakes!
What is Hudi
Apache Hudi is an open data lakehouse platform, built on a high-performance open table format to bring database functionality to your data lakes.
Hudi reimagines slow old-school batch data processing with a powerful new incremental processing framework for low latency minute-level analytics.
Hudi reimagines slow old-school batch data processing with a powerful new incremental processing framework for low latency minute-level analytics.
Hudi Features
Why Hudi
The most innovative and completely open data lakehouse platform in the industry!
Trusted Platform
Battle tested and proven in production in some of the largest data lakes on the planet.
Open Source
Hudi is a thriving & growing community that is built with contributions from people around the globe.
High Performance
Hudi's storage format is purpose-built to continuously deliver performance as data scales.
Data streams
Take advantage of built-in CDC sources and tools for streaming ingestion.
Join our Community
Get technical help, influence the product roadmap & see what’s new with Hudi!
GitHub
Join community
Slack
Join community
Join community
Join community
Youtube
Subscribe
Mailing
Subscribe