JMSLab / LaroplanOCR

Swedish primary school curricula (Läroplaner för grundskolan) in digital format.
MIT License
2 stars 0 forks source link

Prepare release for Dataverse #9

Closed jmshapir closed 2 years ago

jmshapir commented 2 years ago

The goal of this issue is to prepare a release for posting to the Harvard Dataverse.

We'd like to structure this in such a way to keep maintenance costs low going forward, i.e. so that a new release in github can be easily pushed as an update to Dataverse.

Once we have a plan ready I think we can:

santiagohermo commented 2 years ago

Thanks @jmshapir! I like the open of keeping maintenance costs low.

I envision creating a PDF documentation for the Dataverse that resembles the README. A question: would you like the README to be identical to the pdf to be uploaded into the Dataverse? Some thoughts:

I imagine that some things we probably don't wnat to add to the Dataverse, and thus the second option seems like the way to go. What do you think?

jmshapir commented 2 years ago

@santiagohermo does teh Dataverse require PDF documentation or will they accept MD or TXT?

If Dataverse requires PDF what if we have:

santiagohermo commented 2 years ago

Thanks @jmshapir! I have been looking around and it doesn't look like the Dataverse asks specifically for a PDF, so MD should be fine.

I like your idea of a CODEBOOK.md and a release.py though. I'll start with that structure in mind.

Required fields for data submission at Harvard Dataverse

![image](https://user-images.githubusercontent.com/45404755/160300171-a9430045-1ac5-4e16-b62b-9f8df6687e21.png) ![image](https://user-images.githubusercontent.com/45404755/160300212-32b7ec72-48da-4a60-8f79-46746246381f.png)

jmshapir commented 2 years ago

Thanks @santiagohermo and sounds good!

Maybe we should include in CODEBOOK.MD our text for any non-obvious metadata fields required by the Dataverse. That way whenever we push to the Dataverse we'll have a point of reference inside the repository for those fields.

santiagohermo commented 2 years ago

Continues in #10

santiagohermo commented 2 years ago

Summary: In this issue we prepared code to build a release for the Dataverse. After that, we

  1. Added a first release to the repo
  2. Added the data to the Harvard Dataverse: https://github.com/JMSLab/LaroplanOCR/releases/tag/v1.0.0
  3. Set the repository to public

Changes merged to main in https://github.com/JMSLab/LaroplanOCR/commit/a2c1884065bb96fc17444d6c34f183fa9a3ac702.