NationalLibraryOfNorway / meteor

A python module and REST API for automatic extraction of metadata from PDF files
Apache License 2.0
11 stars 2 forks source link

Feature request: Language detection for Finnish and Swedish #4

Closed osma closed 1 year ago

osma commented 1 year ago

The current language detection works only on a few languages used in Norway.

Could you please add models for detecting Finnish and Swedish as well? I understood that these exist elsewhere and just need to be copied to this project.

Thanks!

pierrebeauguitte commented 1 year ago

Now that we have replaced our previous language detection module with langdetect, Finnish and Swedish languages should be recognized properly. Once we find a proper way to reintegrate the previous models, that are optimized for Nordic languages, we will make sure to keep supporting Finnish and Swedish (and more).

osma commented 1 year ago

Perfect, thanks a lot @pierrebeauguitte !