issues
search
basetenlabs
/
truss-examples
Examples of models deployable with Truss
https://trussml.com
MIT License
102
stars
24
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Updating files for ComfyUI to use build commands by default
#319
htrivedi99
closed
12 hours ago
0
added gemma2 9b and 27b with streaming using local-gemma
#318
dsingal0
opened
2 days ago
1
Adding vllm speculative decoding example
#317
htrivedi99
opened
2 days ago
0
Fix poems to match docs.
#316
squidarth
opened
5 days ago
0
Add whisper chainlet to warmups.
#315
squidarth
closed
6 days ago
0
Update GPU count on README.md
#314
vshulman
opened
1 week ago
0
Add warm up chains github action.
#313
squidarth
closed
6 days ago
0
Added whisper model that takes base64 input.
#312
squidarth
closed
1 week ago
0
Updating ComfyUI readme to include build commands
#311
htrivedi99
closed
6 days ago
0
fixing llama engine to be tp2 as name suggests
#310
vshulman
closed
1 week ago
0
[Droid] Integrate Florence-2-Large Model
#309
factory-droid[bot]
opened
1 week ago
5
Het/sd3 fix
#308
htrivedi99
closed
3 weeks ago
0
Adding sd3 truss
#307
htrivedi99
closed
3 weeks ago
0
Make LoRA truss work in baseten development mode
#306
pankajroark
closed
3 weeks ago
0
[DO NOT MERGE] briton test prep
#305
pankajroark
opened
3 weeks ago
0
Lora example
#304
pankajroark
closed
3 weeks ago
0
LoRA hot swapping
#303
aspctu
closed
4 weeks ago
0
Constrained decoding
#302
kchatr
opened
1 month ago
0
truss push faulty on windows?
#301
TidorP
opened
1 month ago
3
Llama 3 70B - TRT and change directory for TRT
#300
vshulman
closed
1 week ago
0
"GET /v1/models/model/schema HTTP/1.1" 404
#299
ngsitrong26
opened
1 month ago
1
error no module name node_helpers, how to deploy ipadapter faceid with baseten
#298
ngsitrong26
opened
1 month ago
0
Fix llama tokenizer reference
#297
joostinyi
closed
1 month ago
2
Not getting timestamp information with text in Whisper Streaming
#296
usman61
opened
1 month ago
0
Update truss examples to not use py38.
#295
squidarth
closed
1 month ago
0
Fixed deploying whisper streaming to baseten by removing python executable path in config file
#294
usman61
closed
1 month ago
2
adding higher context llama 3 TRT engine
#293
vshulman
opened
1 month ago
0
Llama 3 ChatQA 1.5 in 8B and 70B
#292
philipkiely-baseten
opened
1 month ago
0
Remove llama/medusa example
#291
aspctu
closed
1 month ago
0
new mixtral needs new template
#290
vshulman
closed
1 month ago
0
Only use messages key when openai compatible
#289
jrochette
closed
1 month ago
0
Update config.yaml
#288
philipkiely-baseten
closed
2 months ago
0
fix issue with nomic model
#287
philipkiely-baseten
closed
2 months ago
0
updating mistral vllm implementation to 0.2 + adding support for hf token
#286
vshulman
opened
2 months ago
0
Medusa implementation
#285
aspctu
closed
2 months ago
0
add hf token reqs to larger mistral models
#284
philipkiely-baseten
closed
2 months ago
0
[Droid] DBRX Truss Implementation
#283
factory-droid[bot]
opened
2 months ago
4
Add truss example for Qwen1.5-110B with vllm & streaming support
#282
ImmarKarim
opened
2 months ago
0
Add Phi 3 (both context windows)
#281
philipkiely-baseten
closed
2 months ago
0
Llama 3 70B TRT-LLM
#280
aspctu
closed
2 months ago
0
reduce Llama 3 70B from 4 to 2 H100s
#279
philipkiely-baseten
closed
2 months ago
0
Fix mistral trt example.
#278
squidarth
closed
2 months ago
0
Fix truss ci examples
#277
squidarth
closed
2 months ago
0
Actually use the auth tokens when pulling hf repos
#276
squidarth
closed
2 months ago
0
Fix mistral models to have an hf_access_token.
#275
squidarth
closed
2 months ago
0
Adding some fixes for trt llama 3 8b instruct
#274
htrivedi99
closed
2 months ago
0
Add metadata to fix HF link bug
#273
philipkiely-baseten
closed
2 months ago
0
Adding files for mixtral 8x22b instruct trt int8 quantized
#272
htrivedi99
closed
2 months ago
0
fp8 llama3 8b
#271
joostinyi
closed
2 months ago
0
llama3 8b instruct fp8 on h100
#270
aspctu
closed
2 months ago
1
Next