grwells / TickBase

Data storage for web crawler results from TickBase project, summer 2021.
0 stars 1 forks source link

Author Format Standardization for DSpace 7.x #6

Closed grwells closed 2 years ago

grwells commented 2 years ago

DSpace 7.x uses a standardized item creation format that is more strict around author name formatting than DSpace 6.x. Currently all CSVs store authors in first-last name format with authors separated by commas.

DSpace 7.x separates authors with semicolons, allowing for author names to be universally formatted in last-first name format.

grwells commented 2 years ago

Modified Interfaces (untested)

  1. Mendeley
  2. Mendeley Data
  3. Data Dryad
  4. Springer

Unknown/Unmodified Interfaces

  1. Neon
  2. PubMed
  3. Figshare (has no author metadata field)
  4. KNB
  5. LTER
grwells commented 2 years ago

Testing

Tested and debugged Mendeley and Mendeley Data author output with the new standards. Decided that for sources that do not specify first/last names no changes should be made to the order when formatting for DSpace.

Tested (working)
  1. Mendeley Data
  2. Mendeley
  3. LTER
  4. Figshare (no author fields)
  5. Neon (no author fields)
  6. KNB
  7. PubMed
  8. Data Dryad
  9. Springer Nature