FHNW-IVGI / Geoharvester

NDGI Project Geoharvester
10 stars 1 forks source link

[NLP] Training data for classification model #31

Open eliaferrari opened 1 year ago

eliaferrari commented 1 year ago

Description

The classification of the datasets into INSPIRE or eCH categories requires more training data to work properly. In order to create the training dataset, the classified datasets form Swisstopo can be used. This will require a data pipeline to retrieve the corresponding metadata/classes from geocat. Once completed the training datasets can be used to train a DL-model in the preprocessing phase using Spacy and extending the class NLP_Spacy.