OregonDigital / OD2-migration

Project board for migration tracking
0 stars 0 forks source link

Texts and Data (texts-data) #122

Open lsat12357 opened 2 years ago

lsat12357 commented 2 years ago

Item Count: 162 items

Item Types: Document/Audio/Generic

Access Restrictions: 161 items

Complex Objects: 24

pid List: https://github.com/OregonDigital/OD2-migration/blob/master/texts-data lists generated 20220624

lsat12357 commented 2 years ago

preflight clean exported 20220712 migrated ch 20220715

A LOT of fails in the children batch (94 out 134), looks like partway through the batch postgres possibly? got overwhelmed. the assets exist but will need the usual fixes. Oh wow. it looks like most of the damage got fixed by deleting one bad fileset that was throwing an LDP error; this in turn caused indexing to fail for a lot of assets during nested indexing. the deleted fileset for df661w41j is an audio file, mpeg, which currently doesn't ingest. Verification: fx71bs65g is missing pages, rerunning derivatives. fixed there are a handful of assets that don't have thumbnails. its unclear to me atm what the expectation is for behavior in regard to some of these filetypes. I'm inclined once the the mpeg fail is resolved, to let QA decide if the remaining assets are ok. the assets that are missing thumbnails appear to be spreadsheets. going to pass on these: 2r36vc13c 2r36vc27q 2r36vc718 2r36vc79g 2r36vd04m 2r36vc97d 2r36vc75c 2r36vc99z 2r36vc921 df661w41j retrying attach file (audio). mpeg, required manual fix for duration, seems to be working now.

so more to this saga: many of these came in as generic assets, but not because they are cpds, but bc they were file types that OD1 did not know what else to do with? verification does not flag missing filesets. (i.e. it's ok for cpds to come in with no fileset) However, bc of the way we temporarily have indexing set up, these generic assets also refused to index uris until there was a child attached (normally either the fileset or in the case of cpds, the child assets) so this is the way I finally realized which assets were missing their filesets: they were missing all their labels.

verification ok, as far as I can tell.

sseymore commented 2 years ago

textsanddata_pids.txt

@lsat12357 pids to review for QA attached.

sarahfish07 commented 1 year ago

QA FAIL

QA NOTE: QA COMPLETE Overall:

Search/indexing/faceting check:

Facets:

Work pages:

Collection pages:

sarahfish07 commented 1 year ago

NOTE: I double checked links above that are missing viewers. UPDATE: Works are still missing viewer. update 11/7; 11/14: Viewer still missing.; 11/21: same errors

sseymore commented 1 year ago

We need to restructure these datasets with the viewers not working. QA pass and move forward for bulk review.

sarahfish07 commented 1 year ago

No URI errors.

sarahfish07 commented 1 year ago

Fixed descriptions, access restrictions. QA'ed. No other metadata errors found. 113/162 works are missing viewers:

  1. 2r36vd14v
  2. 2r36vd29g
  3. 2r36vd27x
  4. 2r36vd36w
  5. 2r36vd23t
  6. 2r36vd25c
  7. 2r36vd31h
  8. 2r36vd218
  9. 2r36vd34b
  10. 2r36vd307
  11. 2r36vd286
  12. 2r36vd13k
  13. 2r36vd22j
  14. 2r36vd17p
  15. 2r36vd200
  16. 2r36vd197
  17. 2r36vd25c
  18. 2r36vd26n
  19. 2r36vd200
  20. 2r36vd154
  21. 2r36vd16d
  22. 2r36vc98p
  23. 2r36vc629
  24. 2r36vc45n
  25. 2r36vc387
  26. 2r36vd03b
  27. 2r36vd065
  28. 2r36vc11t
  29. 2r36vc700
  30. 2r36vd32s
  31. 2r36vd18z
  32. 2r36vd35m
  33. 2r36vc22b
  34. 2r36vc06z
  35. 2r36vc939
  36. 2r36vc17g
  37. 2r36vc82s
  38. 2r36vc590
  39. 2r36vc64v
  40. 2r36vc964
  41. 2r36vc611
  42. df661w743
  43. 2r36vc123
  44. 2r36vc94k
  45. 2r36vc565
  46. 2r36vc832
  47. 2r36vc57f
  48. 2r36vc88f
  49. 2r36vd01s
  50. 2r36vc53b
  51. 2r36vc95v
  52. 2r36vc034
  53. 2r36vc077
  54. 2r36vc255
  55. df661w36p
  56. 2r36vc51s
  57. 2r36vc697
  58. 2r36vc476
  59. 2r36vc37z
  60. 2r36vc15x
  61. 2r36vc18r
  62. 2r36vc743
  63. 2r36vc41j
  64. 2r36vc66d
  65. 2r36vc09s
  66. 2r36vc67p
  67. 2r36vc301
  68. fx71bc11p
  69. 2r36vd10r
  70. 2r36vc08h
  71. 2r36vc319
  72. 2r36vc58q
  73. 2r36vc35d
  74. 2r36vc89q
  75. 2r36vc39h
  76. 2r36vc04d
  77. df661w75c
  78. 2r36vc786
  79. 2r36vc875
  80. 2r36vc344
  81. 2r36vc32k
  82. 2r36vc73t
  83. 2r36vc85m
  84. 2r36vc23m
  85. 2r36vc91r
  86. 2r36vc48g
  87. 2r36vc433
  88. 2r36vc921
  89. 2r36vc26f
  90. 2r36vc99z
  91. 2r36vd022
  92. 2r36vd08q
  93. 2r36vd090
  94. 2r36vc212
  95. 2r36vc654
  96. 2r36vc46x
  97. 2r36vc75c
  98. 2r36vc54m
  99. 2r36vc97d
  100. 2r36vc86w
  101. 2r36vd04m
  102. 2r36vc191
  103. 2r36vc79g
  104. 2r36vc166
  105. 2r36vc42t
  106. 2r36vd111
  107. 2r36vc77x
  108. 2r36vc27q
  109. 2r36vc718
  110. 2r36vc13c
  111. 2r36vc298
  112. 2r36vc50h
  113. 2r36vd07f
sarahfish07 commented 11 months ago

Checked for copyright issues- no dates fall between 1924-1927

sarahfish07 commented 8 months ago

Checked copyright for 1928. No updates needed.