echemdb / unitpackage

A Python library to interact with a collection of frictionless datapackages
https://echemdb.github.io/unitpackage/
GNU General Public License v3.0
4 stars 3 forks source link

Support CSV files with multiple header lines and different delimiter #23

Open DunklesArchipel opened 1 year ago

DunklesArchipel commented 1 year ago

CSV files in the scientific community often contain more than one header line. This header usually contains some additional metadata, for example from the instrument.

The header lines as well as the delimiter can be included in the datapackage dialect. https://framework.frictionlessdata.io/docs/guides/describing-data.html

Upon creating a unit package, that data could be inferred from the data package to create the pandas' resource.