gbv / cmo

Corpus Musicae Ottomanicae
GNU General Public License v3.0
6 stars 0 forks source link

some print sources are not indexed #189

Closed kkrebs closed 4 years ago

kkrebs commented 4 years ago

https://corpus-musicae-ottomanicae.de/receive/cmo_source_00000056 is not indexed because of more than one search field "title.type.alt". Leads to:

Caused by: org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at http://localhost:8983/solr/cmo: Exception writing document id cmo_source_00000056 to the index; possible analysis error: cannot change DocValues type from SORTED to SORTED_SET for field "title.type.alt"

kkrebs commented 4 years ago

I fixed this search error. Now title search field are only created for titles that are not part of componentGrp. Searching for TMKlii now only have one result. If you want to find the expressions too, you have to use wildcard search: *TMKlii*.

You can test this solution on our test application: https://corpus-musicae-ottomanicae.de/cmo-test/

annplaksin commented 4 years ago

How does the wildcard search works? I don't find the expressions now.

kkrebs commented 4 years ago

using *TMKlii* 258 objects are found

sorry, i didn't see the formatting adjustment. I fixed the examples.

annplaksin commented 4 years ago

Oh yes, thanks!

annplaksin commented 4 years ago

The other "broken source" I mentioned is by the way again the infamous source 1: https://corpus-musicae-ottomanicae.de/cmo-test/receive/cmo_source_00000001 By the way, in test we have the old runtime error because of iso15924:fra-latn (which can be deleted).

kkrebs commented 4 years ago

I fixed all broken objects. In cmo_source_00000073 "John Masefield" was added as editor but no link to cmo_person set. I fixed this in our index stylesheet, but I'm not sure if this is the right solution. Is it ok to allow editors without reference here? https://corpus-musicae-ottomanicae.de/cmo/receive/cmo_source_00000073

annplaksin commented 4 years ago

That source was actually added in the test version as a workshop example. This is the reason why there was no person added for the editor. Sorry for messing around with that. From my side, it is okay to have editors without reference, but I'll check with the colleagues if they want to forbid it.

And thanks for fixing the broken objects.

kkrebs commented 4 years ago

Now also fixed in production. 6967 objects are found in source catalogue.