dbpedia / GSoC

Google Summer of Code organization
37 stars 27 forks source link

Extend the Extraction Framework for your language #9

Open mgns opened 6 years ago

mgns commented 6 years ago

Effort

1-2 days

Skills

basic maven, scala

Description

The DBpedia extraction framework has a default configuration that is language agnostic. However, language specific configuration can boost the coverage and precision of the extracted data for that particular language. We keep all language specific configurations here. Browse through the code and try to see how you can improve existing languages of provide configuration for a new one.

Impact

Improvements in the data quality & quantity for a particular language

AnubhavUjjawal commented 5 years ago

I created a pull request in reference to this warm-up task.

JJ-Author commented 5 years ago

see e.g. also https://github.com/dbpedia/extraction-framework/commit/f60edd4002081ef7350f2a1111bac47b52f1541c which is not yet merged to master but shows some language specific configuration for the number parser

AnubhavUjjawal commented 5 years ago

Ok? Should I check the language specific configurations made in https://github.com/dbpedia/extraction-framework/commit/f60edd4002081ef7350f2a1111bac47b52f1541c if they are correct or not?

JJ-Author commented 5 years ago

no it was just an additional note. in case other people do use this as warmup task as well that they do not duplicate the work already done ;-). luckily in your case that did not happen