Apache Hudi: Managing Partition on a petabyte-scale tableFebruary 4, 2024 by Krishna Prasadawsapache spark
Leverage Partition Paths of your data lake tables to Optimize Data Retrieval Costs on the cloudJanuary 30, 2024 by Krishna Prasadawsperformanceapache spark
Data Engineering: Bootstrapping Data lake with Apache HudiJanuary 20, 2024 by Krishna Prasadbeginneretlawsapache spark
Learn How to Move Data From MongoDB to Apache Hudi Using PySparkJanuary 20, 2024 by Soumil Shahbeginnermongodbapache spark
In-House Data Lake with CDC Processing, Hudi, DockerJanuary 11, 2024 by Rahuldockercdcapache kafkadebeziumapache sparkaws
From Data lake to Microservices: Unleashing the Power of Apache Hudi's Record Level Index with FastAPI and Spark ConnectJanuary 1, 2024 by Soumil Shahbeginnerapache sparkindexingdmlfastapi