Closed syphax-bouazzouni closed 2 years ago
Friday Fev 4th : bin/ncbo_ontology_process -o THVIANDE,THESAE,GECO,GAO,DSW,CO_360,CO_358, CO_350 => It seems a bug has generated "ERROR_RDF" issues for almost all these ontologies (except THVIANDE) (maybe the issue was cause by not running the script with ontoportal user)
Monday Fev 7th : bin/ncbo_ontology_process -o THESAE bin/ncbo_ontology_process -o GECO,GAO,DSW,CO_360,CO_358,CO_350,CO_324,CO_327,CO_330,CO_337,CO_338,CO_339,CO_340 => Last command failed after CO_358
(while monitoring the server)
Feb 11th: bin/ncbo_ontology_process -o CO_324,CO_327,CO_330,CO_337,CO_338,CO_339
Feb 14th:
Feb 14th: bin/ncbo_ontology_process -o CO_341,CO_343,CO_346,CO_348,CO_350
New ones from Admin UI:
I confirm that parsing GECO in StagePortal fix the indexing issue.
However the problem still appear for the Annotator that certainly adopts the same behaviour (ignoring labels in French) when building its dictionary independently of the language parameters of the portal:
However, for the Annotator, this is a desired behaviour. With AgroPortal not fully multulingual, we have defined the Annotator to handle only labels in english or without language tags.
Process to resolve parsing issues for up-listed ontologies with mostly French labels:
After discussion we decided that we will not change AgroPorta settings to avoid creating a special case with French language. See #219 for details.
Fixing the the up-listed issues in admin requires to update the indexation test that is not in synch with what's actually implemented. @syphax-bouazzouni to look into this.
To avoid having multilingual labels included in the search index by mistake we will be fixing ontologies => #221
Our problems with indexing were related to our handling of multilingual values, where for the case of Agroportal we take only attributes values that are tagged with the English language or that have no language tag.
So for the case of resources (ontologies) that are only in French (or any other language), we don't get any values from the triple store.
Our solution was to update this behavior like detailed here: https://github.com/ontoportal-lirmm/goo/issues/6 and https://github.com/ontoportal-lirmm/goo/issues/8
Ontologies with RDF data, without SOLR data: