git.net

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[jira] [Created] (ARROW-3002) Inconsistent DataType Hashing


Sam Oluwalana created ARROW-3002:
------------------------------------

             Summary: Inconsistent DataType Hashing
                 Key: ARROW-3002
                 URL: https://issues.apache.org/jira/browse/ARROW-3002
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: Sam Oluwalana



{code:python}
>>> x = pa.field('record', pa.struct([pa.field('x', pa.int32(), nullable=False)]))
>>> y = pa.field('record', pa.struct([pa.field('x', pa.int32(), nullable=True)]))
>>> x.__hash__()
-9223372036569171727
>>> y.__hash__()
285604054
>>> x.type
StructType(struct<x: int32>)
>>> x.type.__hash__()
429437081997812647
>>> y.type.__hash__()
429437081997812647
>>> x
pyarrow.Field<record: struct<x: int32>>
>>> y
pyarrow.Field<record: struct<x: int32>>
}
{code}

The StructType should take nullable fields into account when generating the hash.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)