Closed lwjohnst86 closed 1 month ago
frictionless
can definitely be used to extract an initial schema (for csv files*). We just need to find a way to save it to a db instead of a file.
In #544, I have added frictionless
with sql to our dependencies. From the documentation, it seems that frictionless
supports both read and write from sql (using sqlalchemy
).
*Frictionless doesn't support extraction from e.g., .txt
files with data (at least as far as I understand). But they support other file types such as json and parquet
We've already started doing this!
There is a Python package for extracting the metadata, https://v4.framework.frictionlessdata.io/docs/guides/describing-data
So it might look like:
If we follow the frictionless approach, this would make it easier to integrate with them.