issues
search
aws-neuron
/
transformers-neuronx
Apache License 2.0
88
stars
24
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Neuron model NEFFs are dependent on the python path
#91
dacorvo
opened
16 hours ago
1
Sync internal repo to external June 28 2024
#90
hannanjgaws
closed
2 days ago
0
Any plan to support Qwen-2 Model
#89
mynewstart
opened
1 week ago
0
llava support
#88
sonic182
opened
2 weeks ago
3
Add Gemma
#87
yisi-wang-slalom
opened
1 month ago
0
For Mistral 7B - Generate Text using Input Embeddings + Add no_repeat_ngram_size Support
#86
davidshtian
opened
2 months ago
0
Sync internal repo to external Apr 15 2024
#85
hannanjgaws
closed
2 months ago
0
Latest changes introduced for continuous batching break Mixtral model
#84
dacorvo
opened
2 months ago
4
Add support for Baichuan-13B model
#83
cszhz
opened
2 months ago
0
Add support for `gemma` models
#82
benglewis
opened
3 months ago
1
Sync internal repo to external Mar 29 2024
#81
hannanjgaws
closed
3 months ago
0
Improve Neuron model loading time
#80
dacorvo
opened
3 months ago
4
NaN outputs when masking llama model inputs
#79
dacorvo
opened
4 months ago
6
Backward compatibility with saved llama 2 compiled artifacts
#78
dacorvo
opened
5 months ago
1
Issue while compiling Mistral 7B 0.2 Instruct
#77
josete89
closed
3 months ago
5
User feedback when compiling and reloading a large model
#76
dacorvo
opened
5 months ago
1
`stopping_criteria_list(input_ids, probs)` does not check for the correct sequence.
#75
michaelfeil
closed
4 months ago
4
Support for MPT model
#74
klutzDrawers
opened
5 months ago
1
Infering logits from `model.forward` for the entire batch instead of the last forward's output.
#73
michaelfeil
opened
5 months ago
5
Generate Llama 2 from Embeddings
#72
liechtym
opened
5 months ago
5
Mixtral config issue -- not handling null well
#71
jimburtoft
closed
2 months ago
8
How to use generate() with inputs_embeds
#70
liechtym
closed
6 months ago
2
Sync internal repo to external Dec 28 2023
#69
hannanjgaws
closed
6 months ago
0
Skipping generation for useless tokens, and modiying cacheids
#68
enochlev
closed
5 months ago
3
Inf2 Modified Llama 2 Loading Issue
#67
liechtym
closed
6 months ago
11
Vicuna13B model support
#66
petrovicu
opened
6 months ago
0
Mixtral Model support
#65
enochlev
closed
6 months ago
2
llama-2/codellama benchmark for inf2.xlarge
#64
zliendo
closed
6 months ago
4
Llama2 inference overhead time way too long
#63
enochlev
closed
6 months ago
6
Added safetensors support in from_pretrained()
#62
dennj
opened
7 months ago
0
LLaMA fails when the input token length is over 1790 tokens
#61
dennj
closed
3 months ago
6
from_pretrained is broken after transformers made safetensor serialization default
#60
dennj
closed
7 months ago
1
Compilation error on llama 7 B with batch size 8
#59
dacorvo
closed
1 month ago
4
Can't save/serialize any models except GPT2
#58
awskila
closed
3 days ago
4
Avoid splitting Hugging Face Hub checkpoint files on disk
#57
dacorvo
closed
1 month ago
7
Turn off safe_serialization from save_split so that save_function is called
#56
jitto
opened
7 months ago
0
save_split seems to be broken after transformers made safetensor serialization default
#55
jitto
closed
4 months ago
3
Sync internal repo to external Oct 27 2023
#54
hannanjgaws
closed
6 months ago
0
About loading and saving llama model of pretraining job
#53
etsurin
closed
7 months ago
2
Serving Throughput Optimizations (e.g. PagedAttention)
#52
vigneshv59
closed
2 days ago
3
Support for encoder-decoder models
#51
kwontaek-amazon
closed
6 months ago
2
Support for Mistral-7B model
#50
henghui-zhu-amazon
closed
4 months ago
4
Core dump during inference on llama2 model with batch size 4 and 1024 inputs
#49
dacorvo
closed
9 months ago
13
Very long compilation times for llama2 with batch size 4
#48
dacorvo
closed
7 months ago
4
Rebasing PR into main (Adding support and performance link)
#47
eshalakhotia
opened
9 months ago
0
Possible error in top-p filtering
#46
dacorvo
closed
4 months ago
5
Compilation errors for llama 2 models
#45
dacorvo
closed
9 months ago
8
Sync internal repo to external Oct 27 2023
#44
awshaichen
closed
8 months ago
0
reapply external contributions
#43
awshaichen
closed
9 months ago
0
sync internal repo to external
#42
awshaichen
closed
9 months ago
0
Next