git.net

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RecordBatch with different-length Arrays


Hi,

The docs suggest that a RecordBatch is a collection of equal-length array
instances. It appears that this is not enforced and one could build a
RecordBatch from arrays of different length. Is this intentional?

Here is an example:

>>> b = pyarrow.RecordBatch.from_arrays(
   [pyarrow.array([1, 2, 3]),
    pyarrow.array([1, 2]),
    pyarrow.array([1])],
   ['a', 'b'.'c'])

>>>[len(b[i]) for i in range(3)]
[3, 2, 1]

Cheers,
Rares


( ! ) Warning: include(msgfooter.php): failed to open stream: No such file or directory in /var/www/git/apache-arrow-development/msg05287.html on line 87
Call Stack
#TimeMemoryFunctionLocation
10.0007358376{main}( ).../msg05287.html:0

( ! ) Warning: include(): Failed opening 'msgfooter.php' for inclusion (include_path='.:/var/www/git') in /var/www/git/apache-arrow-development/msg05287.html on line 87
Call Stack
#TimeMemoryFunctionLocation
10.0007358376{main}( ).../msg05287.html:0