git.net

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Multiple hdfs


Hi Kien,

Thanks for you reply.

Your goal is to store the checkpoints in one hdfs cluster and the data in other hdfs cluster.

So the flink should be able to connect to two different hdfs clusters.

Thanks

2018-05-22 15:00 GMT+01:00 Kien Truong <duckientruong@xxxxxxxxx>:
Hi,

If your cluster are not high-availability clusters then just use the full path to the cluster.

For example, to refer to directory /checkpoint on cluster1, use hdfs://namenode1_ip:port/checkpoint

Like wise, /data on cluster2 will be hdfs://namenode2_ip:port/data


If your cluster is a HA cluster, then you need to modify the hdfs-site.xml like section 1 of this guide

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.4/bk_administration/content/distcp_between_ha_clusters.html

Then use the full path to the cluster hdfs://cluster1ha/checkpoint & hdfs://cluster2ha/data

Regards,
Kien


On 5/21/2018 9:19 PM, Raul Valdoleiros wrote:
Hi,

I want to store my data in one hdfs and the flink checkpoints in another hdfs. I didn't find a way to do it, anyone can point me a direction?

Thanks in advance,
Raul



( ! ) Warning: include(msgfooter.php): failed to open stream: No such file or directory in /var/www/git/apache-flink-users/msg09380.html on line 96
Call Stack
#TimeMemoryFunctionLocation
10.0021364552{main}( ).../msg09380.html:0

( ! ) Warning: include(): Failed opening 'msgfooter.php' for inclusion (include_path='.:/var/www/git') in /var/www/git/apache-flink-users/msg09380.html on line 96
Call Stack
#TimeMemoryFunctionLocation
10.0021364552{main}( ).../msg09380.html:0