Skip to main content
Version: 0.9.0

Baidu Cloud

In this page, we explain how to get your Hudi job to store into Baidu BOS.

Baidu BOS configs#

There are two configurations required for Hudi-BOS compatibility:

  • Adding Baidu BOS Credentials for Hudi
  • Adding required Jars to classpath

Baidu BOS Credentials#

Add the required configs in your core-site.xml from where Hudi can fetch them. Replace the fs.defaultFS with your BOS bucket name, replace fs.bos.endpoint with your bos endpoint, replace fs.bos.access.key with your bos key, replace fs.bos.secret.access.key with your bos secret key. Hudi should be able to read/write from the bucket.

<property>  <name>fs.defaultFS</name>  <value>bos://bucketname/</value></property>
<property>  <name>fs.bos.endpoint</name>  <value>bos-endpoint-address</value>  <description>Baidu bos endpoint to connect to,for example : http://bj.bcebos.com</description></property>
<property>  <name>fs.bos.access.key</name>  <value>bos-key</value>  <description>Baidu access key</description></property>
<property>  <name>fs.bos.secret.access.key</name>  <value>bos-secret-key</value>  <description>Baidu secret key.</description></property>
<property>  <name>fs.bos.impl</name>  <value>org.apache.hadoop.fs.bos.BaiduBosFileSystem</value></property>

Baidu bos Libs#

Baidu hadoop libraries jars to add to our classpath

  • com.baidubce:bce-java-sdk:0.10.165
  • bos-hdfs-sdk-1.0.2-community.jar

You can download the bos-hdfs-sdk jar from here , and then unzip it.