issues
search
huggingface
/
optimum-neuron
Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
Apache License 2.0
196
stars
59
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update cache guide to including the caching for traced models
#557
JingyaHuang
opened
5 months ago
9
Add step closure instead of regular log
#556
michaelbenayoun
closed
4 months ago
3
Cleanup obsolete code
#555
michaelbenayoun
closed
5 months ago
1
Disable weights / neff separation of SDXL's UNET for neuron sdk 2.18
#554
JingyaHuang
closed
6 months ago
1
Cache utils related cleanup
#553
michaelbenayoun
closed
5 months ago
3
Remove print that should not be there
#552
michaelbenayoun
closed
6 months ago
1
- new notebook - TGI + SageMaker + Mistral
#551
samir-souza
opened
6 months ago
6
Audio models
#550
michaelbenayoun
opened
6 months ago
8
Adding CodeLlama-7B inference and compilation example notebook
#549
jimburtoft
closed
6 months ago
2
Upgrade Neuron SDK to 2.18.0 and TGI to 1.4.5 (fix)
#548
davidshtian
closed
6 months ago
2
Use AWS Neuron sdk 2.18
#547
dacorvo
closed
5 months ago
6
fix: bug in get_available_cores within container
#546
oOraph
closed
6 months ago
3
HF DL AMI v20240318 optimum-neuron install generates error on import
#545
jimburtoft
closed
5 months ago
4
optimum.neuron.modeling_decoder.get_available_cores returns a wrong result
#544
oOraph
closed
6 months ago
3
Add missing notebooks to doc
#543
JingyaHuang
closed
6 months ago
1
Fix TGI CI workflow
#542
dacorvo
closed
6 months ago
0
Add setup runtime step for K8S
#541
glegendre01
closed
6 months ago
0
sdxl compiled model load faild in sagemaker only (it works very well in EC2 instance)
#540
Suprhimp
closed
5 months ago
1
Disable logging during precompilation
#539
michaelbenayoun
closed
6 months ago
2
Fix style
#538
JingyaHuang
closed
6 months ago
0
Add tools for auto filling traced models cache
#537
JingyaHuang
closed
6 months ago
1
Do not use deprecated list_files_info
#536
Wauplin
closed
6 months ago
1
Add support to read different neuron cache from the local directory of the DLC based on ENVs
#535
Neo9061
opened
6 months ago
13
Bump optimum version
#534
JingyaHuang
closed
6 months ago
0
upgrade optimum and then install optimum-neuron
#533
shub-kris
closed
5 months ago
3
optimum-cli does not support exporting to Neuron by default in HF DL AMI v20240318
#532
mlopezr
opened
6 months ago
0
Fix GQA permutation computation and sequential weight initialization / loading when doing TP
#531
michaelbenayoun
closed
6 months ago
2
ADD stale bot
#530
philschmid
closed
6 months ago
1
Set up tgi environment values with the ones used to build the model
#529
oOraph
closed
5 months ago
6
Failing Casual Language Modeling with Expanded Mistral-7B Model ( 1.75B Trainable Parameters)
#528
shamanez
opened
6 months ago
1
Adding link to existing Fine-tuning example in Notebooks
#527
jimburtoft
closed
6 months ago
2
fixing format in getting-started.ipynb
#526
jimburtoft
closed
6 months ago
0
Removing colab links in notebooks.mdx
#525
jimburtoft
closed
6 months ago
1
Skip weight load during parallel compile
#524
michaelbenayoun
closed
6 months ago
1
Mixed-precision training with both `torch_xla` or `torch.autocast`
#523
michaelbenayoun
closed
6 months ago
1
TGI improvements
#522
dacorvo
closed
6 months ago
1
Init on the `xla` device
#521
michaelbenayoun
closed
6 months ago
1
RuntimeError: Failed to import optimum.neuron.modeling because of the following error (look up to see its traceback): No module named 'optimum.exporters.neuron'
#520
hyogrin
closed
6 months ago
0
Unable to deploy TinyLlama in Amazon SageMaker using Optimum Neuron 0.0.20 w/ Neuronx 2.*
#519
ari-vedant-jain
closed
2 weeks ago
9
Optimum Neuron v 0.0.20 w/ Neuron 2.x taking too long to fine-tune TinyLlama in Amazon SageMaker
#518
ari-vedant-jain
opened
6 months ago
2
Fix/ami authorized keys
#517
shub-kris
closed
6 months ago
0
TGI NeuronX DLC (Optimum-neuron) 0.0.20: SageMaker deployment failure with llama-2 7B
#516
Neo9061
opened
6 months ago
14
Request TGI NeuronX DLC to support Flan T5 models in SageMaker
#515
Neo9061
opened
6 months ago
7
Support Marian models inference
#514
JingyaHuang
opened
6 months ago
0
Support owlv2 models inference
#513
JingyaHuang
opened
6 months ago
0
Support layoutlm models inference
#512
JingyaHuang
opened
6 months ago
0
Muti Node CLM Mistral training tutorial
#511
shamanez
opened
6 months ago
9
[Inference] Neuron cache for traced torchscript models (encoders, stable diffusion)
#510
JingyaHuang
closed
6 months ago
1
Support phi model on feature-extraction, text-classification, token-classification tasks
#509
JingyaHuang
closed
6 months ago
2
Add support for phi-2
#508
5cp
closed
6 months ago
2
Previous
Next