-
### System Info
N/A for doc bug.
### Who can help?
@stevhliu
### Information
- [ ] The official example scripts
- [ ] My own modified scripts
### Tasks
- [ ] An officially supported task in the…
-
python get_semantic_embed.py --model_path ./Llama-2-7b-hf --dataset BookCrossing --pooling average --gpu_id 1
miniconda3/envs/rella/lib/python3.10/site-packages/transformers/configuration_utils.py:9…
-
CI on PRs occasionally fails with the following message:
```
FAILED tests/recipes/test_eleuther_eval.py::TestEleutherEval::test_torchtune_checkpoint_eval_results[truthfulqa_gen-0.1-1] - RuntimeEr…
-
### Describe your use-case.
Flux has layers named single_transformer_blocks.* and transformer_blocks.*.
If I want to train only the **transformer_blocks.*** layers but exclude **single_transformer…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
At this moment llama-index-postprocessor-colbert-rerank import requires torch and his nv…
-
## Summary
When calling `qkv_fuse_projections()` on an instance of `Flux2DTransformerModel` that was quantized with `torchao`'s `quantize_`, it fails with the following error:
```
File "/Users/s…
-
## Inspiration
So there is a gradio space [https://huggingface.co/spaces/hf-audio/whisper-large-v3](url) that uses whisper, from the hugging face api :
```python
import spaces
import torch
…
-
Dear all,
It would be great to see an end-to-end practical example of LoTR. By "practical" I mean that one takes, for example some existing LLM weights file, compresses it into a smaller weights fi…
-
### Describe the bug
I'd like to change the input layers of FLUX for training some img2img stuff, but got:
`TypeError: expected str, bytes or os.PathLike object, not NoneType`
when loading `FluxTra…
-
### System Info / 系統信息
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.129.06 Driver Version: 470.129.06 CUDA Version: 12.4 |
|-------------…