frictionlessdata / frictionless-py

Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data
https://framework.frictionlessdata.io
MIT License
688 stars 142 forks source link

Support raster geoinformation #536

Open gperonato opened 3 years ago

gperonato commented 3 years ago

Using raster georeferenced files is a popular way of storing geoinformation in many disciplines. Many open data repositories (e.g. Copernicus Climate Data Store, NASA Earth Data) provide raster files. GeoTiff, NetCDF are some of the most popular formats. GDAL subprograms allow for easy conversion between formats and extraction of relevant metadata (e.g., with gdalinfo). I know this has been discussed before, but still, I think raster support would be a nice feature and could increase the popularity of frictionless in some fields. Is there any interest in this? Is someone already working on it?


Please preserve this line to notify @roll (lead of this repository)

fallahnejad commented 3 years ago

I can confirm what @gperonato has mentioned. During our H2020 project Hotmaps we were searching for guidelines to create metadata for raster data, which later on could help us in integrating the datasets in the database. We encountered with the frictionless guidelines; however, no guidelines were provided for raster data. As a result, we tried to define and create own guidelines for the raster data (for details please refer to page 11-13 of this document). A sample of datapackage that was developed for raster file can be found HERE. As mentioned in the previous comment by @gperonato , it is pretty easy to obtain encapsulated metadata of raster file using GDAL library and subsequently, create a datapackage file for it.

I would be happy to collaborate on this topic, if required.

lwinfree commented 3 years ago

this is super cool, thanks for sharing! I'm going to post this on the discord channel so other users know about it. @rufuspollock do you have thoughts here?

gperonato commented 3 years ago

I have started working on a raster dataflow here: https://github.com/enermaps/dataflows/tree/raster See README-raster.md for more information

roll commented 3 years ago

@gperonato Great news!

cc @lwinfree

loleg commented 1 year ago

How is it going, any chance there will be a PR on your dataflows branch @gperonato ?

gperonato commented 1 year ago

I am very sorry I missed this message. From my side I am not able to continue working on that branch/idea :(