ropensci / fishbaseapi

Fishbase API
https://fishbaseapi.readme.io/
MIT License
42 stars 12 forks source link

Reference source for most updated version of Parquet files #159

Closed alessandro-russo closed 9 months ago

alessandro-russo commented 10 months ago

@cboettig,

could you please clarify where to find the most updated version of Parquet files for FishBase and SeaLifeBase? Concretely, should we look at the files reachable from https://fishbase.ropensci.org/fishbase (or /sealifebase) or those in your rfishbase_board repo?

Should we assume that the files in https://fishbase.ropensci.org/fishbase are the same as the latest version in rfishbase_board (2023-05, as of today)? It doesn't seem to be the case now (files in https://fishbase.ropensci.org/fishbase were LastModified on 2023-12-31, so I may think that they are "more recent" than the 2023-05 version...).

Thanks!!!

cboettig commented 10 months ago

Please consult the schema.org metadata, https://github.com/ropensci/rfishbase/blob/master/inst/prov/fb.prov, which is also used by rfishbase. Note that the metadata includes sha checksums, applications should rely on checksums to know if versions are the same as other sources or not. (Note that some tables remain unchanged between releases, and thus have the same checksum. rfishbase relies on checksums to access files and not download files that are identical to those previously downloaded).

and yeah, I think we're due for a new release, I'll ping the FishBase team and see if we can get that in the queue!

alessandro-russo commented 9 months ago

Many thanks for the clarification and for pointing to the reference metadata!