-
### Right Case
When I follow the doc : https://github.com/pytorch/executorch/blob/main/examples/models/llama/README.md#enablement,
I export the Llama3.2-1B-Instruct:int4-spinquant-eo8 model to xnnpa…
-
### Feature request
🤗 Accelerate has a gradient accumulation wrapper, and the `no_trainer` scripts should be updated to include it!
An example can be seen [here](https://github.com/huggingface/…
-
### 🐛 Describe the bug
There is an inconsistency when running `torch.aminmax` out variant on eager and inductor. This behavior is also found when using `aot_eager` backend.
```python
def run(op):…
-
## Bug Description
The example of pytorch tensorrt cannot generate images correctly. There were no issues with version v2.3.0, but version v2.4.0 cannot generate images correctly.
https://git…
-
### System Info
```
None of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and only tokenizers, configuration and file/data utilities can be used.
None of PyTorch,…
-
Platforms: linux
This test was disabled because it is failing on main branch ([recent examples](https://torch-ci.com/failure?failureCaptures=%5B%22'test%2Finductor%2Ftest_cudacodecache.py%3A%3ATest…
-
### 🐛 Describe the bug
Following [https://github.com/pytorch/executorch/blob/main/examples/models/llava/README.md](https://github.com/pytorch/executorch/blob/main/examples/models/llava/README.md) I e…
-
-
### System Info
transformers commit: 52ea4aa589324bae43dfb1b6db70335da7b68654 (main at time of writing)
the rest isn't relevant.
### Who can help?
trainer: @muellerzr @SunMarc
### Informa…
-
### System Info
GPU-A100,
TensorRT-LLM version = tensorrt_llm-0.13.0.dev2024090300
Ubuntu machine.
### Who can help?
hi @ncomly-nvidia , @byshiue ,
I want to set the 'no_repeat_ngram_size'=0…