VertNet / gulo

Shredding Darwin Core Archives with ferocity, strength, and Cascalog.
7 stars 5 forks source link

Accommodate harvest from hosted IPT with metadata from original #118

Closed tucotuco closed 10 years ago

tucotuco commented 10 years ago

Right now the CartoDB resource_staging table holds the URL for the source data resource. To facilitate hosting migrated versions without having to re-do and maintain metadata, add a metadataURL column pointing to the original IPT's resource url and harvest the metadata from that while harvest the data from the hosted, migrated version.

tucotuco commented 10 years ago

Added source_url field to resource_staging. Need to make sure this gets propagated to resource, and that the publisher detail goes to the source_url instead or the page in the field url in VertNet/webapp. Also make sure detail page source url for data rights shows the source_url field contents in VertNet/webapp. Make sure the download also contains that value in the datasource_and_rights field. This may need to happen in harvest to get the right data into the right field to be indexed.

tucotuco commented 10 years ago

Addressed with 28213adf1755e62fe6154bcee8cec32cdce5d5b8