Streaming Ingestion from MongoDB into Hudi with Glue, kinesis&Event bridge&MongoStream Hands on labsFebruary 18, 2023 bySoumil Shahguidestreaming ingestionnear real-time analyticsmongodb atlasmerge on readMORamazon kinesisevent busapache hudi
Create Your Hudi Transaction Datalake on S3 with EMR Serverless for Beginners in fun and easy wayFebruary 11, 2023 bySoumil Shahguideamazon emr serverlessamazon s3apache hudibeginner
How do I Ingest Extremely Small Files into Hudi Data lake with Glue Incremental data processingFebruary 7, 2023 bySoumil Shahguidesmall filesincremental-processingpysparkaws glueamazon s3apache hudi
Learn How to restrict Intern from accessing Certain Column in Hudi Datalake with lake FormationJanuary 28, 2023 bySoumil Shahguideaccess restrictioncomplianceaws lake formationapache hudiamazon athena
Writing data quality and validation scripts for a Hudi data lake with AWS Glue and pydeequ| Hands on LabJanuary 23, 2023 bySoumil Shahguidedata qualityvalidationpydeequpythonaws glueapache hudi
How to detect and Mask PII data in Apache Hudi Data Lake | Hands on LabJanuary 21, 2023 bySoumil Shahguidemask piihipaagdprmaskingcomplianceamazon s3aws glueapache hudiamazon athena
How do I identify Schema Changes in Hudi Tables and Send Email Alert when New Column added/removedJanuary 20, 2023 bySoumil Shahguideschema changesschema evolutionalertingamazon s3aws glueapache hudiamazon athena
Cleaner Service: Save up to 40% on data lake storage costs | Hudi LabsJanuary 17, 2023 bySoumil Shahguidecleaner servicestorage costapache hudi
Global Bloom Index: Remove duplicates & guarantee uniquness | Hudi LabsJanuary 17, 2023 bySoumil Shahguideduplicatesde-duplicateindexingglobal indexbloomuniquenessapache hudi
How businesses use Hudi Soft delete features to do soft delete instead of hard delete on DatalakeJanuary 17, 2023 bySoumil Shahguidedeletesoft deleteapache hudi