git.net

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[jira] [Created] (ARROW-3401) [C++] Pluggable statistics collector API for unconvertible CSV values


Wes McKinney created ARROW-3401:
-----------------------------------

             Summary: [C++] Pluggable statistics collector API for unconvertible CSV values
                 Key: ARROW-3401
                 URL: https://issues.apache.org/jira/browse/ARROW-3401
             Project: Apache Arrow
          Issue Type: New Feature
          Components: C++
            Reporter: Wes McKinney
             Fix For: 0.12.0


It would be useful to be able to collect statistics (e.g. distinct value counts) about values in a column of a CSV file that cannot be converted to a desired data type. 

When conversion fails, the converters can call into an abstract API like

{code}
statistics_->CannotConvert(token, size);
{code}

or something similar



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)