boudinfl / pke

Python Keyphrase Extraction module
GNU General Public License v3.0
1.57k stars 291 forks source link

Langcode "ge" does not work with spacy #201

Closed cbartz closed 2 years ago

cbartz commented 2 years ago

German langcode ge https://github.com/boudinfl/pke/blob/master/pke/lang.py#L22 is not recognized by spacy, which leads to a missing stopwords assignment in https://github.com/boudinfl/pke/blob/master/pke/lang.py#L34-L40 .

Observed with pke 2.0.0 and spacy 3.3.1.

shyambhu-mukherjee commented 2 years ago

isn't that supposed to be de?

cbartz commented 2 years ago

Yes.