Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for successFebruary 27, 2024 byToney Thomas, Ben Vengerovsky and Rada Stanicblogapache hudiuse-casedata meshamazon
Enabling near real-time data analytics on the data lakeFebruary 23, 2024 byShi Kai Ng and Shuguang Xiangblogapache hudinear real-time analyticsmorgrab
How a POC became a production-ready Hudi data lakehouse through close team collaborationFebruary 12, 2024 byXiaoxiao Rey and Hussein Awalause-caseapache hudileboncoin-tech-blogbeginnerdeletegdpr deletionupsert
Building an Open Source Data Lake House with Hudi, Postgres Hive Metastore, Minio, and StarRocksFebruary 6, 2024 bySoumil Shahblogapache hudilinkedinbeginnerapache sparkapache hivehive metastoreminiostarrocksdockerpythonpostgrespostgresql
Combine Transactional Integrity and Data Lake Operations with YugabyteDB and Apache HudiFebruary 6, 2024 byBalachandar Seetharamanblogapache hudiACIDtransactionsreal-time datalakecdcetlyugabyte
Apache Hudi: Managing Partition on a petabyte-scale tableFebruary 4, 2024 byKrishna Prasadblogapache hudimediumintermediatepartitionaws glueapache sparkaws s3
Leverage Partition Paths of your data lake tables to Optimize Data Retrieval Costs on the cloudJanuary 30, 2024 byKrishna Prasadblogapache hudimediumintermediateaws gluecostapache sparkpartition
Use Amazon Athena with Spark SQL for your open-source transactional table formatsJanuary 24, 2024 byPathik Shah, Raj Devnathblogapache hudiawsbeginneraws glueaws athenatime travel queryclusteringcompactionaws s3apache icebergdelta lake
Data Engineering: Bootstrapping Data lake with Apache HudiJanuary 20, 2024 byKrishna Prasadblogapache hudimediumbeginnerETLaws glueapache sparkaws s3
Learn How to Move Data From MongoDB to Apache Hudi Using PySparkJanuary 20, 2024 bySoumil Shahblogapache hudilinkedinbeginnermongodbapache sparkpyspark
Deleting Items from Apache Hudi using Delta Streamer in UPSERT Mode with Kafka Avro MessagesJanuary 18, 2024 bySoumil Shahblogapache hudilinkedinbeginnerhudi streamerdeltastreamerapache kafkaapache avroupsertdelete
Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake FormationJanuary 17, 2024 byRaymond Lai, Aditya Shah, Bin Wang, and Melody Yangblogapache hudiawsintermediateamazon emraws lake formationaws glueaws s3amazon sagemakeraws cloud9amazon athenaaccess control