datatractor / schema

Schemas for Metadata Extractors
https://datatractor.github.io/schema/
MIT License
1 stars 0 forks source link

Towards `Extractor` output format #4

Open PeterKraus opened 2 weeks ago

PeterKraus commented 2 weeks ago
We should think about how extractors can report their desired formats at 
some point (thinking e.g. if we'll have issues with pandas 3 etc)

Originally posted by @ml-evs in https://github.com/datatractor/beam/pull/3#pullrequestreview-2294132022

PeterKraus commented 2 weeks ago

What we have now:

Of course, this deals with FileTypes only and not any returned in-memory objects. For usage.method == python, it might be sensible to also provide something like required_formats that indicates what package(s) (e.g. pandas, numpy < 2.0 etc.) needs to be installed in the parent env to be able to handle the returned object.