git.net

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Old job resurrected during HA failover


Till,

Thoughts?

On Wed, Aug 1, 2018 at 7:34 PM vino yang <yanghua1127@xxxxxxxxx> wrote:
Your analysis is correct, yes, in theory the old jobgraph should be deleted, but Flink currently uses the method of locking and asynchronously deleting Path, so that it can not give you the acknowledgment of deleting, so this is a risk point.

cc Till, there have been users who have encountered this problem before. I personally think that asynchronous deletion may be risky, which may cause JM to be revived by the cancel job after the failover.