ukwa / ukwa-services

Deployment configuration for all UKWA services stacks.
Apache License 2.0
4 stars 5 forks source link

Small number of topics and themes not available in search #91

Open jasonwebber-bl opened 2 years ago

jasonwebber-bl commented 2 years ago

A number of live topics and theme collections are not available to browse or search though the T&T page.

Two are currently identified: Gaelic life on the web - https://www.webarchive.org.uk/en/ukwa/collection/2898 New media writing prize - https://www.webarchive.org.uk/en/ukwa/collection/2912

I suspect there might be some more but it is difficult to check.

jasonwebber-bl commented 2 years ago

Example of collection that is published but not appearing on the live page: image Once in the edit, nicola noticed that they have tried to add it to catgories but these changes haven't gone through yet. This seems to be the case for the missing collections.

jasonwebber-bl commented 2 years ago

Is this still being worked on? The number of collections available has gone down from 111 on live to 101 on dev? I think there should be 125-126 in total.

anjackson commented 2 years ago

Not sure if it's related, but I'm seeing some weird errors:

access_website_ukwa-ui.1.hwgzsgujtg78@prod1    | org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at http://solr.api.wa.bl.uk: Bad Request
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    |
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    | request: http://solr.api.wa.bl.uk/solr/all/select?q=collection: &wt=json&version=2.2
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    |        at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:610) ~[solr-solrj-6.4.1.jar!/:6.4.1 72f75b2503fa0aa4f0aff76d439874feb923bb0e - jpountz
- 2017-02-01 14:49:08]
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    |        at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:279) ~[solr-solrj-6.4.1.jar!/:6.4.1 72f75b2503fa0aa4f0aff76d439874feb923bb0e - jpountz - 2017
-02-01 14:49:08]
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    |        at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:268) ~[solr-solrj-6.4.1.jar!/:6.4.1 72f75b2503fa0aa4f0aff76d439874feb923bb0e - jpountz - 2017
-02-01 14:49:08]
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    |        at org.apache.solr.client.solrj.SolrClient.request(SolrClient.java:1219) ~[solr-solrj-6.4.1.jar!/:6.4.1 72f75b2503fa0aa4f0aff76d439874feb923bb0e - jpountz - 2017-02-01 14:49
:08]
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    |        at com.marsspiders.ukwa.solr.SolrCommunicator.sendRequest(SolrCommunicator.java:98) ~[classes!/:1.4.7.RELEASE]
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    |        at com.marsspiders.ukwa.solr.SolrSearchService.sendRequestCheckCollection(SolrSearchService.java:213) [classes!/:1.4.7.RELEASE]
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    |        at com.marsspiders.ukwa.solr.SolrSearchService.searchContent(SolrSearchService.java:200) [classes!/:1.4.7.RELEASE]
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    |        at com.marsspiders.ukwa.controllers.CollectionController.searchAnyFullTextIndex(CollectionController.java:295) [classes!/:1.4.7.RELEASE]
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    |        at com.marsspiders.ukwa.controllers.CollectionController.collectionOverviewPage(CollectionController.java:122) [classes!/:1.4.7.RELEASE]
access_website_ukwa-ui.1.hwgzsgujtg78@prod1    |        at sun.reflect.GeneratedMethodAccessor244.invoke(Unknown Source) ~[na:na]

Fascinatingly, the BETA site has different, more up to date categories: https://beta.webarchive.org.uk/en/ukwa/category

anjackson commented 2 years ago

Hah, seems it was much more basic. The Solr service that the production website uses had not been updated properly since moving off the 'access' server. Attempting to update now.

anjackson commented 2 years ago

Okay, service is now updated to latest W3ACT data. There do seem to be 102 collections. Where does the number 125-126 come from?

nicolabingham commented 2 years ago

Hi @jasonwebber-bl you can check the number of collections that should be available by looking in ACT at the Collections (Taxonomies) list. It means looking into each one to see which are marked 'published', which is a bit manual, but that will give you the up to date figure.