CUB-Libraries-CTA / counter-data-loader

Loads COUNTER database from JR1 report spreadsheets
1 stars 2 forks source link

Review Folio Spreadsheet Export (5) #74

Closed ericnienhouse closed 1 year ago

ericnienhouse commented 1 year ago

Estimate: 5

Tests have been done with the Folio service and a sample spreadsheet. These sheets look similar, however, columns typically with value of '0' are empty. These fields can assume a default of '0' for the preprocessor and loader.

Acceptance Criteria:

Review and compare Folio export spreadsheet to current COUNTER sheets. Identify list of concerns and possible changes needed to code.

Consider: Add to error reporting when missing data found.

Timebox to 3-4 day.

ericnienhouse commented 1 year ago

Note: In the future we may use SUSHI or Folio APIs to access the Folio spreadsheet data. Note: 2021 J3 HiWire spreadheet also has "a few" missing fields (request from VS) Note: 2022 J3 HiWire spreadheet also has missing fields (request from VS)

bonnland commented 1 year ago

After adding some checks to address missing values, the 2021 J3 HiWire spreadsheet was loaded successfully into my local mySQL database.

bonnland commented 1 year ago

The Oxford Academic spreadsheet downloaded from Folio was also processed without error.

Therefore, there is evidence for minimal or no differences between Folio spreadsheets and those obtained directly from publishers.