issues
search
huggingface
/
optimum-nvidia
Apache License 2.0
893
stars
87
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add more cli params
#163
mfuntowicz
opened
3 weeks ago
0
Update hub.py
#162
philschmid
opened
3 weeks ago
0
Speed up training?
#161
fzyzcjy
opened
1 month ago
0
Update to latest stable 0.13.0
#160
mfuntowicz
opened
1 month ago
0
chore: update README badges
#159
mfuntowicz
closed
2 months ago
0
Bump version to 0.1.0b8
#158
mfuntowicz
closed
2 months ago
0
chore: remove invalid examples
#157
mfuntowicz
closed
2 months ago
0
Fix test again
#156
mfuntowicz
closed
2 months ago
0
Fix license detection path
#155
mfuntowicz
closed
2 months ago
0
tests(cli): uncomment out tests for CLI
#154
mfuntowicz
closed
2 months ago
0
Add CLI quantization option
#153
mfuntowicz
closed
2 months ago
0
(misc) disable xQA kernels for now as they seem to hang
#152
mfuntowicz
closed
2 months ago
0
Disable xQA kernels for now
#151
mfuntowicz
closed
2 months ago
0
move to new cluster
#150
glegendre01
closed
2 months ago
0
How to use TensorRT model converter
#149
FortunaZhang
opened
2 months ago
0
Add no code export infrastructure through CLI
#148
mfuntowicz
closed
2 months ago
0
Bring back quantization with Nvidia ModelOpt
#147
mfuntowicz
closed
2 months ago
0
OutOfMemory - Not able to run the text-generation.py example on V100, and A10G cores.
#146
yahavb
opened
3 months ago
0
feat(tests) : Update CI to use new workflow and silicon.
#145
mfuntowicz
closed
4 months ago
0
Unable to install `optimum-nvidia` on Ubuntu
#144
QuantumStaticFR
opened
4 months ago
2
Error on Quickstart example
#143
laikhtewari
opened
4 months ago
1
Error for gated model access despite valid HF_TOKEN
#142
laikhtewari
opened
4 months ago
0
Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0!
#141
d142796
opened
4 months ago
1
Model of type falcon is not supported
#140
puneeshkhanna
opened
5 months ago
0
Fix namespace error of dtype args
#139
puneeshkhanna
opened
5 months ago
0
CI new l4 runners
#138
mfuntowicz
closed
4 months ago
0
Enable automatic build of container at each release
#137
mfuntowicz
closed
5 months ago
0
Enable trufflehog scanner CI on GA
#136
mfuntowicz
closed
5 months ago
0
Upgrade to 0.10.0
#135
mfuntowicz
closed
5 months ago
0
Model of type gpt_bigcode is not supported
#134
Aryabhattacharjee
opened
5 months ago
0
Refactor the overall Hugging Face -> TRTLLM export workflow
#133
mfuntowicz
closed
4 months ago
0
feat(package): make sure we dont have init as optimum level
#132
mfuntowicz
closed
5 months ago
0
Mixtral
#131
mfuntowicz
closed
6 months ago
0
Providing input_embeddings for generation instead of IDs
#129
verityw
opened
6 months ago
0
No engine file found for LLama 3 and Cuda API error with LLama 2 with use_fp8
#128
PhilSapiens
opened
6 months ago
1
Can't Run README Code
#127
hammoudhasan
opened
6 months ago
0
Load from local path?
#126
bdambrosio
opened
6 months ago
0
Use a percentage based matching rather than exact token match for tests
#125
mfuntowicz
closed
6 months ago
0
Update to TensorRT-LLM v0.9.0
#124
mfuntowicz
closed
6 months ago
0
Is there support for StoppingCriteria?
#123
RomanKoshkin
opened
7 months ago
0
Docker container fails on RTX A6000
#122
RomanKoshkin
opened
7 months ago
0
Failed to import optimum.nvidia
#121
abpani
opened
7 months ago
0
Add support for Phi family of models
#120
mfuntowicz
opened
7 months ago
0
ValueError: mutable default <class 'tensorrt_llm.lora_manager.LoraBuildConfig'> for field lora_config is not allowed: use default_factory
#119
manish-marwah
opened
7 months ago
5
Remove claim of Turing support
#118
laikhtewari
closed
7 months ago
0
Test batched causallm inference
#117
fxmarty
closed
7 months ago
1
FileNotFoundError: [Errno 2] No such file or directory: 'trtllm-build'
#116
Quang-elec44
opened
7 months ago
0
Fix checking output limits for #114
#115
zaycev
closed
7 months ago
1
Batching seems to be broken.
#114
zaycev
closed
7 months ago
1
Mention important additional parameters for engine config in README
#113
zaycev
closed
6 months ago
2
Next