iftechfoundation / ifarchive-admintool

Admin script for IF Archive work
1 stars 1 forks source link

Import web-upload.log into the uploads database #10

Closed erkyrath closed 10 months ago

erkyrath commented 11 months ago

...so that the "info" button works for old files.

A bit of a nuisance, since we'll have to match by filename to find the MD5.

erkyrath commented 10 months ago

Started messing with this, but it's going to be imperfect at best. We have a lot of cases where filenames were changed after uploading.

We can match by file size, too (within a kilobyte). That will help.

erkyrath commented 10 months ago

Done, as much as possible.

We implemented the upload form in June 2017. Since then, we've gotten 1883 uploads through it. I was able to match 1428 of those upload record against existing Archive files -- although this is a fuzzy match and not guaranteed accurate! I then generated DB records from that data.

(I relied on a fuzzy-match of the filename or zipped filename, combinred with a file size check.)