-
Take sentence:
Mr. Sherlock Holmes , who was usually very late in the mornings , save upon those not infrequent occasions when he was up all night , was seated at the breakfast table .
The c…
-
### System Info / 系統信息
huggingface_hub==0.19.4 其他配置和官方一致
### Who can help? / 谁可以帮助到您?
@abmfy
### Information / 问题信息
- [X] The official example scripts / 官方的示例脚本
- [ ] My own modified scripts / 我自…
-
> Hugging Face Transformers is an open-source framework for deep learning created by Hugging Face. It provides APIs and tools to download state-of-the-art pre-trained models and further tune them to m…
-
Hi, at the moment I noticed that the transformer model in huggingface-cli Tencent-Hunyuan/HunyuanDiT has different structure than the transformer model in Diffusers pipeline. This makes me cannot appl…
-
### Model description
"Attention Is All You Need" is a landmark 2017 research paper authored by eight scientists working at Google, responsible for expanding 2014 attention mechanisms proposed by Bah…
-
Recently https://github.com/EricLBuehler/mistral.rs released, a really nice solution to, besides mistral, add support for a lot of huggingface models.
By implementing SimplePrompt, existing transfo…
-
After waiting 10 minutes I get this message 🤷♂️
Due to a bug fix in https://github.com/huggingface/transformers/pull/28687 transcription using a multilingual Whisper will default to language detec…
-
Hi,
Deformable DETR is now available in 🤗 Transformers: https://huggingface.co/docs/transformers/main/en/model_doc/deformable_detr.
All checkpoints are on the hub: https://huggingface.co/models?…
-
In transformers as a rule we load models always in as `float32` for stability, even if the weights are in `bfloat16`. As a result, loading `llama-3-8B` can't be done lazily via mmap, since we have to …
-
### System Info
While working on https://github.com/huggingface/transformers/pull/31828 I realized that `GroundingDinoProcessor.__call__` passes the kwargs only to the `self.tokenizer` which is not i…