IQSS / dataverse-pm

Project management issue tracker for the Dataverse Project. Note: Related links and documents may not be public.
https://dataverse.org
0 stars 0 forks source link

GREI 3: HDV Task - Improve OAI-PMH Harvesting #171

Open cmbz opened 7 months ago

cmbz commented 7 months ago

Overview

"Our proposed project will significantly improve the widely-used Harvard Dataverse repository to better support NIH-funded research. A critical measure of the GREI program’s success is to standardize the discoverability across generalist repositories.

To help with this, we propose to improve the existing harvesting functionality in the Dataverse software based on the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) standard, and coordinate with other repository packaging standards to share or move metadata and data. Dataverse already supports the Bags as defined by the Research Data Alliance (RDA) Research Data Repository Interoperability Working Group.

Here we proposed to improve the support for Bags, test it for NIH-funded datasets, and explore and define the appropriate standard to use to move the metadata and data across generalist repositories This will help with a sustainable and succession plan - if one repository cannot support anymore a specific dataset, it will allow to easily move the dataset to another repository without losing any information about the dataset." (Source)

Issues

Spikes

Harvesting Issues (Year 3)

In Progress

Complete

Pending

These issues will be sized and prioritized once the In Progress issues are closed.

Additional Harvesting Issues (Year 4)

These issues will be prioritized during GREI Year 4 Planning.

Related

Resources

cmbz commented 7 months ago

Status: January 2024

Several issues relating to harvesting other repositories' metadata were resolved for Harvard Dataverse. A number of Dataverse harvesting-related issues and bugs were closed, see list:

Completed

In progress

cmbz commented 6 months ago

Status: February 2024

The following OAI-PMH harvesting improvements shown below were made this month.

Completed

cmbz commented 5 months ago

Status: March 2024

Meetings

A meeting was held on 2024/03/13 to:

Other Work Towards Improving Harvesting

Completed

cmbz commented 5 months ago

Status: April 2024

cmbz commented 5 months ago

Status: June 2024

jggautier commented 2 months ago

@landreev, @cmbz and @scolapasta, today or sometime this week I'm thinking of following up with contacts of the repositories who've emailed our support email addresses about harvesting, to let them know that we're continuing to work on improvements related to how Dataverse indexes records, and some of those improvements may affect harvesting. Most of the emails sitting in my RT queue are harvesting related:

342774511-51b92ddc-676c-42bc-af33-24d7a64d0c5d

cmbz commented 2 months ago

@jggautier Great thanks! When you do, please update the June status comment, too https://github.com/IQSS/dataverse-pm/issues/171#issuecomment-2009648663

cmbz commented 1 month ago

Status: July 2024

cmbz commented 2 weeks ago

Status: August 2024

Updates

Harvesting Issues