aist-oceanworks / mudrod

Mining and Utilizing Dataset Relevancy from Oceanographic Datasets to Improve Data Discovery and Access, online demo: https://mudrod.jpl.nasa.gov/#/
https://mudrod.github.io/
15 stars 15 forks source link

Keyword 'Altimetry' returns nothing.'Altimeter' does. #152

Open Yongyao opened 7 years ago

Yongyao commented 7 years ago

@fgreg I think this has something to do with stemming. If this is the only keyword that does not work. We can just hard code this.

@lewismc Do you think your current ontology results can fix this problem, through link query expansion?

lewismc commented 7 years ago

@Yongyao yes this is exactly the type of thing that any implementation of the Ontology Synonyms API would address. This could be accessed internally, or externally through the OntologyResource. Assigned it to myself.

lewismc commented 7 years ago

Hi @Yongyao, now that I review the process of ontology term expansion, unless there is an equialentClass attribute in the ontology(ies) resource(s), ontology will not help here. This has to do with stemming in Lucene, so we need to look at what kind of pre-processing is going on prior to the index construction. We should iterate on this.

SWEET for example, maintains the concept of Altimetry but as you can see, it is not linked to any other concept... this is to say, that within the SWEET world, Altimetry and Altimeter are two different things. The former is a representation of a methodology and the latter is an instrument which acquires physical measurements of data described by the methodology.