This is similar to https://github.com/rapidsai/cudf/issues/17091, but not the same. Currently, when the input JSON data has a column with the same name as in the input schema, it will be output without checking whether that column has the correct data type. For example, with the following input:
This is similar to https://github.com/rapidsai/cudf/issues/17091, but not the same. Currently, when the input JSON data has a column with the same name as in the input schema, it will be output without checking whether that column has the correct data type. For example, with the following input:
Then
read_json
will output aLIST<INT8>
column. The correct output should be an all-null column instead.