semantic-systems / nfdi-search-engine

A lightweight, KG-driven search engine over different endpoints and APIs
https://nfdi-search.nliwod.org/
MIT License
6 stars 3 forks source link

Integrate Research Funding Databases #47

Closed Najeeb-Shams closed 1 year ago

RicardoUsbeck commented 1 year ago

Important databases are GEPRIS (German) and CORDIS (EU)

Najeeb-Shams commented 1 year ago

Well, should we integrate one or both of them or should we explore other alternatives and then decide which ones to integrate into NFDI SE?

RicardoUsbeck commented 1 year ago

Well, I would integrate exactly these two since they are most important.

In general, issues should be very concrete to be able to assess whether a ticket can be done in a given time frame.

Najeeb-Shams commented 1 year ago

Hi Ricardo, CORDIS is integrated, but GEPRIS currently only provides HTML data. What do you suggest?

RicardoUsbeck commented 1 year ago

It looks like the GEPRIS HTML is parseable. Have you tried Beatufil Soup plus XQuery/XPath ?

Najeeb-Shams commented 1 year ago

I haven't yet tried using Beautiful Soup with XQuery or XPath, I will try.

huntila commented 1 year ago

Both GEPRIS (German) and CORDIS (EU) are integrated.