Closed typophyllum closed 9 months ago
Just in case can you share which sources have been fixed?
id=186378, id=173960, id=173961
Any idea how this could have happened? Found by chance fixing source without serial (copying title into search source task and then searching reference by author and year in old OSF, and surprised that first author was missing in TW).
I strongly suggest a check routine to compare old vs migrated database. This is trivial for bibliography: Just export two flat tables from each version, dump them into a new database, and compare the tables using SQL. They should be identical!
I have no clue how this happened but according to import logs surprisingly this is the only instance and looks you found all cases:
hernan@sf12:~/onedb2tw/logs/FINAL$ cat *create_source_roles* | grep -i PersonID | grep -i ERROR
[ERROR]2023-08-26 05:24:07.266: Missing person ERROR: SF.PersonID = 31577 (person_nil_counter = 1)
[ERROR]2023-08-26 05:24:07.583: Missing person ERROR: SF.PersonID = 31577 (person_nil_counter = 2)
[ERROR]2023-08-26 06:41:55.315: Missing person ERROR: SF.PersonID = 31577 (person_nil_counter = 3)
hernan@sf12:~/onedb2tw/working$ cat tblPeople.txt | grep 31577
31577 1 31577 0 Dorjsuren Altanchimeg A. "" 513 0 2023-08-10 18:47:00 110 2023-05-08 16:00:00 148
Logs from both production and an isolated machine (sf12) that ran the same import both have exactly the same set of missing authors (i.e. background activity is not the reason for this problem).
The create_people stage, although it reports some errors when creating alternate spellings because them are duplicates, there is not a single case of a person failing to be created.
I strongly suggest a check routine to compare old vs migrated database.
@klausriede you can help with this general class of problems! The filtering and API endpoints let you run powerful queries, download the results in numerous different formats, and do some sleuthing (perhaps using OpenRefine) to track down data issues?
Dorjsuren, Altanchimeg She is author of three sources which migrated with only the other authors (two papers have altogether 3 authors, one has 4) . Fixed.