facebookresearch / fastText

Library for fast text representation and classification.
https://fasttext.cc/
MIT License
25.87k stars 4.71k forks source link

Add documentation about Hugging Face integration #1335

Closed sheonhan closed 1 year ago

sheonhan commented 1 year ago

Word vectors for 157 languages are now hosted on the Hugging Face Hub as well as the language identification model. (cc @ajoulin)

A newer language model referred in the NLLB project is not mentioned in the official website, so I updated the doc accordingly.

facebook-github-bot commented 1 year ago

Hi @sheonhan!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at cla@meta.com. Thanks!

sheonhan commented 1 year ago

Hi @dmitryvinn I was hoping to get this documentation update merged (I've spoken with Juan Pino and @ajoulin as part of this project) to officially announce fastText's integration on the Hugging Face Hub. Let me know if you need anything on my end or if there's anyone else I should ping!

facebook-github-bot commented 1 year ago

@jmp84 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot commented 1 year ago

@jmp84 merged this pull request in facebookresearch/fastText@48171ac6470f7b3638acc2699e0a370793d33fdd.

sheonhan commented 1 year ago

Hi @jmp84, thanks for merging the PR. It doesn't seem like this change has been reflected in prod. Could you help us with that?

Celebio commented 1 year ago

hi @sheonhan , could you clarify what you mean by "prod"? It looks like the changes are reflected in the main branch: https://github.com/facebookresearch/fastText/commit/48171ac6470f7b3638acc2699e0a370793d33fdd

Best regards, Onur

sheonhan commented 1 year ago

Hi @Celebio, yes it looks like the changes aren't reflected on the website. https://fasttext.cc/docs/en/crawl-vectors.html

Thanks for taking a look!