lanl / dsi

LANL Data Science Infrastructure Project
https://lanl.github.io/dsi
5 stars 3 forks source link

Bueno change: Handle multiple file inputs for Plugins #63

Open qwofford opened 1 year ago

qwofford commented 1 year ago

Currently the Bueno plugin handles the case where a single file is responsible for determining the schema and sampling metadata.

We need to support a list of files. The Plugin should build up the schema from the union of input files, and append one or more rows for each file parsed. If any row values are missing, we should fill them with null values.

At the end of a transload operation, we should report the following to the user: