issues
search
awslabs
/
llm-hosting-container
Large Language Model Hosting Container
Apache License 2.0
75
stars
32
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
fix: Copy Cargo.lock from tgi to respect pinned versions
#57
jinyoung-lim
closed
8 months ago
1
Not able to get the files copied inside Dockerfile
#56
byash11
opened
8 months ago
4
chore: add optimum-neuron 0.0.18 image
#55
dacorvo
closed
8 months ago
2
TGI 1.4.0
#54
amzn-choeric
closed
8 months ago
1
ADD TGI 1.4.0
#53
philschmid
closed
8 months ago
1
TGI 1.4.0
#52
timelfrink
closed
8 months ago
1
Test
#51
amzn-choeric
closed
8 months ago
0
Old Versions Archiving
#50
amzn-choeric
closed
8 months ago
0
chore: add optimum version 0.0.17
#49
dacorvo
closed
8 months ago
1
chore: device_version to cuda_version
#48
jinyoung-lim
closed
8 months ago
0
feature: optimum-neuronx 0.0.16
#47
jinyoung-lim
closed
9 months ago
0
overwrite test branch with changes including neuronx dockerfile
#46
jinyoung-lim
closed
9 months ago
0
feature: optimum-neuronx 0.0.16
#45
jinyoung-lim
closed
9 months ago
0
TGI 1.3.3
#44
amzn-choeric
closed
10 months ago
1
TGI 1.3.1
#43
amzn-choeric
closed
10 months ago
1
Add 1.3.1 with Mixtral
#42
philschmid
closed
10 months ago
1
TGI 1.2.0
#41
amzn-choeric
closed
10 months ago
1
TGI 1.2.0
#40
philschmid
closed
10 months ago
1
Test
#39
amzn-choeric
closed
10 months ago
0
Test Update & 1.03 Files (Testing)
#38
amzn-choeric
closed
10 months ago
0
TGI Sanity Tests
#37
amzn-choeric
closed
11 months ago
0
Add Neuronx TGI
#36
philschmid
closed
9 months ago
1
ValueError: Unsupported model type falcon
#35
shenshaoyong
opened
1 year ago
0
V1.1.0 release
#34
haixiw
closed
1 year ago
1
ADD TGI 1.1.0
#33
philschmid
closed
11 months ago
1
Is there a way to get GenerateResponse full json when calling sagemaker endpoints
#32
nth-attempt
opened
1 year ago
1
Add python packages licenses
#31
haixiw
closed
1 year ago
0
V1.0.3
#30
philschmid
closed
1 year ago
0
TGI 1.0.2 Third Party Licenses File
#29
amzn-choeric
closed
1 year ago
0
adds huggingface tgi container tuning documentation
#28
sean-eich
closed
1 year ago
0
How can I delpoy a model with AWS S3 and without downloading model from hunggingface via TGI image on Sagemaker?
#27
weiZhenkun
opened
1 year ago
1
add TGI docs
#26
lanking520
closed
1 year ago
0
[fix] update pytorch for conda install in 0.6.0
#25
tosterberg
closed
1 year ago
0
Update build-huggingface.yml
#24
xyang16
closed
1 year ago
0
[fix] update mamba version
#23
tosterberg
closed
1 year ago
0
ADD TGI v1.0.2
#22
philschmid
closed
1 year ago
5
[huggingface] Update build workflow to 0.9.3
#21
xyang16
closed
1 year ago
0
Latency and Throughput Inquiry
#20
ctandrewtran
opened
1 year ago
4
Llama2
#19
philschmid
closed
1 year ago
9
AllTraffic did not pass the ping health check
#18
existeundelta
closed
1 year ago
1
Updating HF_MODEL_QUANTIZE
#17
rodalarcon
closed
1 year ago
0
'HF_MODEL_QUANTIZE' value error
#16
yanivg10
closed
1 year ago
1
[huggingface] Update sagemaker version in notebook
#15
xyang16
closed
1 year ago
0
[huggingface] Add falcon example notebooks
#14
xyang16
closed
1 year ago
0
[huggingface] Clean the src files for TGI 0.8.2
#13
xyang16
closed
1 year ago
0
Add TGI 0.8.2
#12
philschmid
closed
1 year ago
2
Please update to 0.8.2 to fix multiple bugs
#11
austinmw
closed
1 year ago
2
[huggingface] Update sagemaker dlc test
#10
xyang16
closed
1 year ago
0
[huggingface] Add get_huggingface_llm_image_uri() in example notebook
#9
xyang16
closed
1 year ago
0
[huggingface] Update instance type and input parameters in example notebook
#8
xyang16
closed
1 year ago
0
Previous
Next