Open Ben-Hodgkiss opened 1 month ago
@CarlosCoelhoSL - when picking this up, please liaise with @cpcundill about work he's already done on #102 as it may have done some of the ticket already. Also worth speaking with @eveleighoj to see if we can link the new "Expectations" work on this ticket into Parquet and Datasette at the same time.
Overview
Expose issues in Parquet format via datasette
Background Following the design proposal for an internal API, we would like to prove some technology choices which include the use of Fast API with DuckDB accessing Parquet on S3.
This work was identified during the spike on API design.
By exposing the issues in Parquet format via datasette, we will know whether Parquet will be an appropriate format for consuming via datasette as well as a new internal API.
A spike was done on using parquet in datasette by @ssadhu-sl here, which contains a fork of the datasette parquet plugin that we can start with
Tech Approach
Acceptance Criteria/Tests
Ticket Management - DELETE this section once completed