演讲 & 报告
“Hoodie: Incremental processing on Hadoop at Uber” - By Vinoth Chandar & Prasanna Rajaperumal Mar 2017, Strata + Hadoop World, San Jose, CA
“Hoodie: An Open Source Incremental Processing Framework From Uber” - By Vinoth Chandar. Apr 2017, DataEngConf, San Francisco, CA Slides Video
“Incremental Processing on Large Analytical Datasets” - By Prasanna Rajaperumal June 2017, Spark Summit 2017, San Francisco, CA. Slides Video
“Hudi: Unifying storage and serving for batch and near-real-time analytics” - By Nishith Agarwal & Balaji Vardarajan September 2018, Strata Data Conference, New York, NY
“Hudi: Large-Scale, Near Real-Time Pipelines at Uber” - By Vinoth Chandar & Nishith Agarwal October 2018, Spark+AI Summit Europe, London, UK
“Powering Uber’s global network analytics pipelines in real-time with Apache Hudi” - By Ethan Guo & Nishith Agarwal, April 2019, Data Council SF19, San Francisco, CA.
“Building highly efficient data lakes using Apache Hudi (Incubating)” - By Vinoth Chandar June 2019, SF Big Analytics Meetup, San Mateo, CA
“Apache Hudi (Incubating) - The Past, Present and Future Of Efficient Data Lake Architectures” - By Vinoth Chandar & Balaji Varadarajan September 2019, ApacheCon NA 19, Las Vegas, NV, USA
“Insert, upsert, and delete data in Amazon S3 using Amazon EMR” - By Paul Codding & Vinoth Chandar December 2019, AWS re:Invent 2019, Las Vegas, NV, USA
“Building Robust CDC Pipeline With Apache Hudi And Debezium” - By Pratyaksh, Purushotham, Syed and Shaik December 2019, Hadoop Summit Bangalore, India
“Using Apache Hudi to build the next-generation data lake and its application in medical big data” - By JingHuang & Leesf March 2020, Apache Hudi & Apache Kylin Online Meetup, China
“Building a near real-time, high-performance data warehouse based on Apache Hudi and Apache Kylin” - By ShaoFeng Shi March 2020, Apache Hudi & Apache Kylin Online Meetup, China
“Building large scale, transactional data lakes using Apache Hudi” - By Nishith Agarwal, June 2020, Berlin Buzzwords 2020.
“Apache Hudi - Design/Code Walkthrough Session for Contributors” - By Vinoth Chandar, July 2020, Hudi community.
“PrestoDB and Apache Hudi” - By Bhavani Sudha Saktheeswaran and Brandon Scheller, Aug 2020, PrestoDB Community Meetup.
“Landing practice of Apache Hudi in T3go” - By VinoYang and XianghuWang, November 2020, Qcon.
You can check out our blog pages for content written by our committers/contributors.
- “The Case for incremental processing on Hadoop” - O’reilly Ideas article by Vinoth Chandar
- “Hoodie: Uber Engineering’s Incremental Processing Framework on Hadoop” - Engineering Blog By Prasanna Rajaperumal
- “New – Insert, Update, Delete Data on S3 with Amazon EMR and Apache Hudi” - AWS Blog by Danilo Poccia
- “The Apache Software Foundation Announces Apache® Hudi™ as a Top-Level Project” - ASF Graduation announcement
- “Apache Hudi grows cloud data lake maturity”
- “Building a Large-scale Transactional Data Lake at Uber Using Apache Hudi” - Uber eng blog by Nishith Agarwal
- “Hudi On Hops” - By NETSANET GEBRETSADKAN KIDANE
- “开源数据湖存储框架 Apache Hudi 如何玩转增量处理” - InfoQ CN article by Yanghua
- “Origins of Data Lake at Grofers” - by Akshay Agarwal
- “Data Lake Change Capture using Apache Hudi & Amazon AMS/EMR” - Towards DataScience article, Oct 20
- “How nClouds Helps Accelerate Data Delivery with Apache Hudi on Amazon EMR” - published by nClouds in partnership with AWS
- “Apply record level changes from relational databases to Amazon S3 data lake using Apache Hudi on Amazon EMR and AWS Database Migration Service” - AWS blog