metaphacts / semopenalex

36 stars 5 forks source link

Difficulty in Retrieving Institutions Using Acronyms in SPARQL Queries #105

Open yileitu opened 1 week ago

yileitu commented 1 week ago

Description:

I am currently working on a project where I need to perform searches in SemOpenAlex based on user input. For example, when a user wants to find information about CMU (Carnegie Mellon University), my program executes the following SPARQL query:

PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>
PREFIX soa: <https://semopenalex.org/ontology/>
PREFIX terms: <http://purl.org/dc/terms/>
PREFIX org: <http://www.w3.org/ns/org#>
PREFIX Service: <http://www.metaphacts.com/ontologies/platform/service/>
PREFIX entitylookup: <http://www.metaphacts.com/ontologies/platform/service/entitylookup/>
PREFIX ns1: <https://semopenalex.org/ontology/>

SELECT ?uri ?citedByCount WHERE {
    SERVICE Service:entityLookup {
        ?uri entitylookup:entityName "cmu";
            entitylookup:candidateType soa:Institution;
    }
    ?uri ns1:citedByCount ?citedByCount.
}
ORDER BY DESC(?citedByCount)

However, the results retrieved only include the SYSU-CMU International Joint Research Institute (https://semopenalex.org/institution/I4210129003), which is clearly not the CMU (Carnegie Mellon University) the user is looking for. Similarly, acronyms such as MIT, JHU, UMich, etc., do not correctly retrieve the intended institutions.

I understand that replacing the acronym with the full name (e.g., "Carnegie Mellon University" instead of "CMU") would return the correct results. However, users are more inclined to input acronyms rather than full names.

Question: Do you have any suggestions or solutions for handling such cases where users input acronyms instead of full institution names? How can we improve the accuracy of our queries to match user expectations?

Thank you for your assistance.

yileitu commented 3 days ago

Hi SemOpenAlex Team, any updates?