Skip to main content

63 posts tagged with "aws"

Use Apache Hudi tables in Athena for Spark

September 9, 2024 by Amazon

aws

Developer Guide: How to Submit Hudi PySpark(Python) Jobs to EMR Serverless (7.1.0) with AWS Glue Hive MetaStore

September 4, 2024 by Soumil Shah

Use AWS Data Exchange to seamlessly share Apache Hudi datasets

May 22, 2024 by Saurabh Bhutyani, Ankith Ede, and Chandra Krishnan

Apache Hudi on AWS Glue

May 19, 2024 by Sagar Lakshmipathy

aws

Learn how to read Hudi data with AWS Glue Ray using Daft (No Spark)

May 7, 2024 by Soumil Shah

Build Real Time Streaming Pipeline with Kinesis, Apache Flink and Apache Hudi with Hands-on

April 21, 2024 by Md Shahid Afridi P

Cost Optimization Strategies for scalable Data Lakehouse

March 22, 2024 by Suresh Hasundi

Building Data Lakes on AWS with Kafka Connect, Debezium, Apicurio Registry, and Apache Hudi

February 27, 2024 by Gary A. Stafford

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

February 27, 2024 by Toney Thomas, Ben Vengerovsky and Rada Stanic

Apache Hudi: Managing Partition on a petabyte-scale table

February 4, 2024 by Krishna Prasad

Leverage Partition Paths of your data lake tables to Optimize Data Retrieval Costs on the cloud

January 30, 2024 by Krishna Prasad

Use Amazon Athena with Spark SQL for your open-source transactional table formats

January 24, 2024 by Pathik Shah, Raj Devnath