JohnSnowLabs / spark-nlp

State of the Art Natural Language Processing
https://sparknlp.org/
Apache License 2.0
3.83k stars 710 forks source link

It's too slow to download a model #14251

Closed xueyuan1990 closed 4 months ago

xueyuan1990 commented 4 months ago

Link to the documentation pages (if available)

It cost too mush time when I want to download the model in this link : https://sparknlp.org/2022/04/11/bert_embeddings_chinese_roberta_wwm_ext_zh_3_0.html 11

How could the documentation be improved?

Please don't limit download speed

maziyarpanahi commented 4 months ago

All the models and pipelines are hosted on AWS S3, it has the highest bandwidth and it's impossible to control it. Where you are must be throttling the speed to access AWS S3 service.

You can test it on Google Colab to see how fast it downloads.