makcedward / nlpaug

Data augmentation for NLP
https://makcedward.github.io/
MIT License
4.46k stars 463 forks source link

allow to use word embeddings models with different formats #305

Open IgorMunizS opened 2 years ago

IgorMunizS commented 2 years ago

Hi,

I had some problems when using word embeddings models from other languages due to the file format. Currently, we have only three options of models with fixed format parameters (binary and no_header). I propose to create a "custom" model where the user can input those parameters and avoid unnecessary complications since gensim library already allows it.

Best,

makcedward commented 2 years ago

Can you add some test cases for that?