niconoe / DarwinSQL

DarwinCore Archives expressed as SQLite
BSD 2-Clause "Simplified" License
1 stars 0 forks source link
biodiversity biodiversity-informatics biodiversity-standards gbif tdwg

DarwinSQL

Like DarwinCore Archive, but using SQLite

Rationale

The Darwin Core Archive format is widely for biodiversity informatics (within the GBIF community for example), and has largely shown its effectiveness as an exchange format.

Due to its nature (basically, a bunch of CSV files zipped together) it is however a poor candidate for data use and analysis: many users get data in Darwin Core Archive, but then immediately extract data from the Archive and transfer it to some custom/non-standard format that's easier to manipulate, such as a spreadsheet or a relational database.

DarwinSQL: the concept

The aim of DarwinSQL is to propose an alternative, standardized file format that can be used for exchanging data, but also for simple data use and analysis.

There are two main milestones to this project:

The DarwinSQL format

Work in progress, subject to change at any time, comments are welcome!