WolfgangFahl / ProceedingsTitleParser

Shallow Semantic Parser to extract metadata from scientific proceedings titles
Apache License 2.0
3 stars 1 forks source link

Add extract "scrape" mode #14

Closed WolfgangFahl closed 4 years ago

WolfgangFahl commented 4 years ago

Given a list of urls on "known" source sites like:

the search should visit the given urls and extract the meta data according to the API or structure of the target pages.

Acceptance Criterion: Situation The list of URLs

is given Action search and extract Expected Result

  1. DL4KG2020: Proceedings of the Workshop on Deep Learning for Knowledge Graphs (DL4KG2020) (http://ceur-ws.org/Vol-2635/)
  2. BlockSW-CKG 2019: Proceedings of the Blockchain enabled Semantic Web Workshop (BlockSW) and Contextualized Knowledge Graphs (CKG) Workshop (http://ceur-ws.org/Vol-2599/)
  3. SemTab 2019: Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching (http://ceur-ws.org/Vol-2553/)
  4. DI2KG2019: Proceedings of the 1st International Workshop on Challenges and Experiences from Data Integration to Knowledge Graphs (http://ceur-ws.org/Vol-2512/)
  5. KGB-LASCAR 2019: Joint Proceedings of the 1st International Workshop on Knowledge Graph Building and 1st International Workshop on Large Scale RDF Analytics (http://ceur-ws.org/Vol-2489/)