Open jorisvandenbossche opened 1 month ago
Good point! Maybe include the output of schema.fields
by default (perhaps truncated to a reasonable number of lines?)
import nanoarrow as na
url = "https://github.com/apache/arrow-experiments/raw/main/data/arrow-commits/arrow-commits.arrows"
schema = na.ArrayStream.from_url(url).schema
schema.fields
#> [<Schema> 'commit': string,
#> <Schema> 'time': timestamp('us', 'UTC'),
#> <Schema> 'files': int32,
#> <Schema> 'merge': bool,
#> <Schema> 'message': string]
Looking at this example, I think it might be reasonable to truncate the schema repr when it is embedded in the Array repr, but then when inspecting the schema itself I would expected to see the full repr (or at least the truncation can be more relaxed, and show more by default):