git.net

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Stream to Stream Join Memory Management


Hello!

I'm trying to do a simple DataStream to DataStream join. Have two kafka topics that has common field. I'm trying to join by via keyBy-join-where-equalTo-TumblingWindow API in Flink 1.4.1.

My tumbling window size is 1 day. There will be more data than machine has memory. I know that Flink uses RocksDB to store state of the window. Will Flink use RocksDB to join between windows and not use HashMap for the merge operation?

Best,
Sayat


( ! ) Warning: include(msgfooter.php): failed to open stream: No such file or directory in /var/www/git/apache-flink-users/msg09357.html on line 69
Call Stack
#TimeMemoryFunctionLocation
10.0008363032{main}( ).../msg09357.html:0

( ! ) Warning: include(): Failed opening 'msgfooter.php' for inclusion (include_path='.:/var/www/git') in /var/www/git/apache-flink-users/msg09357.html on line 69
Call Stack
#TimeMemoryFunctionLocation
10.0008363032{main}( ).../msg09357.html:0