Closed xhero closed 8 years ago
For the very log 240 $m: if it is needed as-is we can increase the dimension of std_title to a TEXT field
I would pretty truncate std_title. It does not make sense to have it that long.
Ok for me - maybe we can pretty truncate the single pieces that make it up? Like
Masses (v4); 2vl 3vc etc etc etc etc...; other stuff etc...
instead of Masses (4v); impressively long list of instruments...
Yes, this would be better
Truncate is ok, these records should been corrected by hand later
It seems @HirschSt comment was deleted when I clicked the ckeckbox, sorry!
@HirschSt Is the new data for June 01 already available? I would like to run an import
@xhero I am working on it, but latest fixes from today would be available tomorrow at the earliest (dataset > 20160601)
Could you also drop the offending 700s without $0? they still create problems when loading/saving/reindexing
Ok, I will drop 700 and 710 from sources if $0 missing
@HirschSt could you also add the titles with ""no-publishing"? thanks!
@xhero "non-publishing" records are included in dataset >= 20160601
There are still records with the DE-588a IDS in 700, which still created problems. I added a hard-coded fix in to_internal, also for 852 $x
852$x will be fixed in data and in the export
Data import does not show any errors anymore - most of these seem to be resolved
Well done. They are a few remaining checkboxes...
I am not sure about dropping 856 ... will it be restored in the export, and where are the images hosted? The remaining three sources still are needing more investigation and probably should be fixed in the application
Indexing now is almost perfect! only two records resist:
=710 1#$aMusikverlag N. Simrock Berlin, Leipzig (Berlin$00$4asn
=710 1#$aMusikverlag N. Simrock Berlin, Leipzig (Berlin$00$4fmo
They both have 710 $0 = 0
@xhero SH: will be fixed with the export >=20160608
Oops, two more:
Tag 240: missing master (expected in $0), tag contents: =240 10$mB
Tag 240: missing master (expected in $0), tag contents: =240 10$mS; bc
Not bad, out of 1532188 total reindexed items!
Last check: Import signals these tags as missing:
Subfield 031 $2 missing in the marc configuration
Subfield 594 $a missing in the marc configuration
Subfield 240 $n missing in the marc configuration
Subfield 710 $g missing in the marc configuration
Subfield 772 $t missing in the marc configuration
I checked all of them and they are either dropped or are removed in to_internal, so for me it is OK. I just need this to be double-checked so we have no surprises.
They will all need to be added back in to_external (together with control fields), so keep the list in the appropriate ticket.
I was pretty sure I had added 710$g to sources-conf, so I will add it back (with 54f960a). Other missing tags ok with me too.
130 comes from moving 240 to 130 because of the collection record type, so there is no $0; 950000002 needs updating by hand
950000002 is corrected in dataset >= 20160609