git.net

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RecordBatch with different-length Arrays


Hi,

The docs suggest that a RecordBatch is a collection of equal-length array
instances. It appears that this is not enforced and one could build a
RecordBatch from arrays of different length. Is this intentional?

Here is an example:

>>> b = pyarrow.RecordBatch.from_arrays(
   [pyarrow.array([1, 2, 3]),
    pyarrow.array([1, 2]),
    pyarrow.array([1])],
   ['a', 'b'.'c'])

>>>[len(b[i]) for i in range(3)]
[3, 2, 1]

Cheers,
Rares