git.net

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: UTF-16 support for TextInputFormat


Hi David,

Did you try to set the encoding on the TextInputFormat with

TextInputFormat tif = ...
tif.setCharsetName("UTF-16");

Best, Fabian

2018-08-08 17:45 GMT+02:00 David Dreyfus <dddreyfus@xxxxxxxxx>:
Hello -

It does not appear that Flink supports a charset encoding of "UTF-16". It particular, it doesn't appear that Flink consumes the Byte Order Mark (BOM) to establish whether a UTF-16 file is UTF-16LE or UTF-16BE. Are there any plans to enhance Flink to handle UTF-16 with BOM?

Thank you,
David