uclibs / scholar_uc_legacy

Source code for Scholar@UC up to version 3.x. Replaced by ucrate
Other
5 stars 1 forks source link

FileSet missing depositor_ssim #1917

Closed jamesvanmil closed 6 years ago

jamesvanmil commented 6 years ago

File sets are missing this expected value in the Solr document. We need to dig in to figure out why this is happening, and how we should/could manage occurrences.

This is impacting the stats import.

scherztc commented 6 years ago

scholar@libschpwl1:curate_uc$ bundle exec rake manifest:files RAILS_ENV=production

scherztc commented 6 years ago

There are 44 FileSets in production that don't have an owner or depositor:

05741v149 | woodmanc_phoenecia_thumb.png member_of = https://scholar.uc.edu/concern/media/05741s34c bc386k33m | woodmanc_phoenecia_2015.mov member_of = https://scholar.uc.edu/concern/media/05741s34c bc388c32f | Aniba_ITS.nex member_of = https://scholar.uc.edu/concern/datasets/bc388c315 bc388c340 | Aniba_psbD-trnT.nex member_of = https://scholar.uc.edu/concern/datasets/bc388c315 bc388c358 | Aniba_trnC-rpoB.nex member_of = https://scholar.uc.edu/concern/datasets/bc388c315 bc388c33q | Aniba_psbA-trnH.nex member_of = https://scholar.uc.edu/concern/datasets/bc388c315 bc388c36j | Aniba_trnS-trnG.nex member_of = https://scholar.uc.edu/concern/datasets/bc388c315 bc386j766 | woodmanc_dharmapops_2014.mov member_of = https://scholar.uc.edu/concern/generic_works/1r66j159n bc386j61k | L1_HSV_movie.avi member_of = https://scholar.uc.edu/concern/generic_works/1r66j159n wm117p50x | readme.docx member_of = https://scholar.uc.edu/concern/datasets/wm117p495 wm117p53r | L1_velocity_fields_images.zip member_of = https://scholar.uc.edu/concern/datasets/wm117p495 wm117p52g | L1_velocity_fields_data.zip member_of = https://scholar.uc.edu/concern/datasets/wm117p495 wm117p516 | L1_data.dat member_of = https://scholar.uc.edu/concern/datasets/wm117p495 wm117r802 | L1_HSV_movie-1.mpg member_of = https://scholar.uc.edu/concern/datasets/wm117p495 bc386p052 | woodmanc_stvrainswoods_2013.mp4 member_of = https://scholar.uc.edu/concern/media/wm117q351 wm117q369 | woodmanc_stvrainswoods_thumb.png member_of = https://scholar.uc.edu/concern/media/wm117q351 000000086 | woodmanc_tableofelementsinstalled_2012.mov member_of = https://scholar.uc.edu/concern/media/wm117q49c 057420017 | johnson1891.pdf member_of = https://scholar.uc.edu/concern/documents/05742000z 057420131 | kerl_s1869.pdf member_of = https://scholar.uc.edu/concern/documents/05742012r 057420174 | kirkham1857.pdf member_of = https://scholar.uc.edu/concern/documents/05742016v 057420352 | knox1809.pdf member_of = https://scholar.uc.edu/concern/documents/05742034s 057420416 | knox1809.pdf member_of = https://scholar.uc.edu/concern/documents/05742040x 057420441 | knox1809.pdf member_of = https://scholar.uc.edu/concern/documents/05742043r 057420484 | lewis1899.pdf member_of = https://scholar.uc.edu/concern/documents/05742047v 057420505 | lewis1900.pdf member_of = https://scholar.uc.edu/concern/documents/05742049d 057420662 | macleod1891.pdf member_of = https://scholar.uc.edu/concern/documents/05742065s 057420840 | picket1818.pdf member_of = https://scholar.uc.edu/concern/documents/05742083q 057420904 | powell1882.pdf member_of = https://scholar.uc.edu/concern/documents/05742089c 057420947 | quakenbos1851.pdf member_of = https://scholar.uc.edu/concern/documents/05742093z 057421005 | quakenbos1864.pdf member_of = https://scholar.uc.edu/concern/documents/05742099m 057421162 | roux1847.pdf member_of = https://scholar.uc.edu/concern/documents/05742115s 057421226 | scot1897.pdf member_of = https://scholar.uc.edu/concern/documents/05742121x 057421340 | smith1830.pdf member_of = https://scholar.uc.edu/concern/documents/05742133q 057421383 | spalding1906.pdf member_of = https://scholar.uc.edu/concern/documents/05742137t 057421404 | spalding1896.pdf member_of = https://scholar.uc.edu/concern/documents/05742139c 057421447 | sweet1886.pdf member_of = https://scholar.uc.edu/concern/documents/05742143z 057421561 | thompson1858.pdf member_of = https://scholar.uc.edu/concern/documents/05742155r 057421625 | tower1855.pdf member_of = https://scholar.uc.edu/concern/documents/05742161w 057421782 | wells1846.pdf member_of = https://scholar.uc.edu/concern/documents/05742177s 057421803 | welsh1896.pdf member_of = https://scholar.uc.edu/concern/documents/05742179b 057421846 | welsh1896.pdf member_of = https://scholar.uc.edu/concern/documents/05742183x 057421960 | woodbridge1899.pdf member_of = https://scholar.uc.edu/concern/documents/05742195q 057422061 | zander1869.pdf member_of = https://scholar.uc.edu/concern/documents/05742205r 057422125 | Experiment_3__Hetergeneous_Networks_in_Decline.csv member_of = https://scholar.uc.edu/concern/datasets/zp38wc927

hortongn commented 6 years ago

That's interesting. Looks like not having a depositor breaks them in the GUI as well. If I try to view those file sets in the browser I get a "something went wrong" message.

We need to figure out if these are orphan file sets that just need to be deleted or if we need to fix them. So maybe the next step is to see if they are attached to a work. Do a file.member_of on each file set.

scherztc commented 6 years ago

Next step is to compare with migration manifest to determine proper depositor on File Set.

If we can't then use work depositor and work deposit date.

Loop through and set depositor and date_uploaded

scherztc commented 6 years ago

Missing Depositor Table.xlsx

scherztc commented 6 years ago

@jamesvanmil @hortongn : I set the depositors on all 41 of these FileSets. Let's watch the analytics job and see if everything passes on Monday.

hortongn commented 6 years ago

Closing this since it looks like the changes resolved the stats rake task problems.