BelgianBiodiversityPlatform / python-dwca-reader

🐍 A Python package to read Darwin Core Archive (DwC-A) files.
BSD 3-Clause "New" or "Revised" License
45 stars 21 forks source link

Support for very basic archives #47

Closed niconoe closed 8 years ago

niconoe commented 8 years ago

I.e a simple CSV with column headers.

This was mentioned by Peter and Stijn in the context of their dwca validator. This looks definitely doable. Next question is: is that in the scope of python-dwca-reader.

If not and we don't want to clutter this package with such code, it may be a good idea to implement a higher-level wrapper to abstract things, in a way similar to:

if given_file_type == dwca:
    dispatch to python-dwca-reader()
elif given_file_type==csv:
    analyze_headers()
    parse_as_csv()
niconoe commented 8 years ago

Would the description of simple darwincore (http://rs.tdwg.org/dwc/terms/simple/index.htm) match this?

peterdesmet commented 8 years ago

Haven't read the definition of Simple DwC in depth again, but from what I recall, that would indeed be what we want to support.

niconoe commented 8 years ago

Cool! There are also interesting explanations about "darwin core archives without metafile" on page 6 of http://www.gbif.org/resource/80639.

That looks also similar, so this issue looks like a duplicate of #7.

niconoe commented 8 years ago

@peterdesmet I'm working on this, so that should be there for next release!

niconoe commented 8 years ago

Fixed in current!