issues
search
InftyAI
/
llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes
Apache License 2.0
15
stars
6
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Benchmark toolkit support
#66
kerthcet
opened
1 month ago
3
Support llama.cpp as alternative backend
#65
kerthcet
closed
1 month ago
3
Add new rules to golangci
#64
kerthcet
closed
1 month ago
2
Milestone v0.1.0
#63
kerthcet
opened
1 month ago
0
Support different GPU accelerators for fungibility
#62
kerthcet
opened
1 month ago
1
An an example for multi-host inference with Service
#61
kerthcet
opened
1 month ago
1
Support ObjectStore as another datasource
#60
kerthcet
closed
1 month ago
1
Support speculative decoding
#59
kerthcet
closed
1 week ago
3
Model version management
#58
kerthcet
opened
1 month ago
2
Add baseline for tests
#57
kerthcet
closed
1 month ago
1
merge once tests passed
#56
kerthcet
closed
1 month ago
32
Add golang ci test
#55
kerthcet
closed
1 month ago
2
Bump github.com/onsi/gomega from 1.33.1 to 1.34.1
#54
dependabot[bot]
closed
1 month ago
3
Bump github.com/onsi/ginkgo/v2 from 2.19.0 to 2.19.1
#53
dependabot[bot]
closed
1 month ago
1
Bump the kubernetes group with 5 updates
#52
dependabot[bot]
closed
1 month ago
1
Add dependabot and issue & pr template
#51
kerthcet
closed
1 month ago
2
Concurrently download the main container image when downloading weights
#50
kerthcet
opened
1 month ago
3
Update docs of examples
#49
kerthcet
closed
1 month ago
1
Add new datasource interface
#48
kerthcet
closed
1 month ago
2
Support loading models from Image volume source
#47
kerthcet
closed
3 weeks ago
4
Feat: support sglang backend
#46
vicoooo26
closed
1 month ago
9
Feat: support modelscope
#45
vicoooo26
closed
1 month ago
4
CI support for tests
#44
kerthcet
closed
1 month ago
1
Use modelHub as one data source
#43
kerthcet
closed
1 month ago
1
Install lws at llmaz-system namespace
#42
kerthcet
opened
1 month ago
3
Add support for multithread when downloading weights
#41
kerthcet
closed
1 month ago
2
Use rust instead of python to download model weights
#40
kerthcet
closed
1 month ago
2
Support SGLang as another backend
#39
kerthcet
closed
1 month ago
2
Use rust instead of python when downloading model weights
#38
kerthcet
closed
1 month ago
6
Support OpenAPI
#37
kerthcet
opened
1 month ago
1
Update README.md to avoid confusion
#36
kerthcet
closed
1 month ago
1
Support OCI artifacts
#35
kerthcet
opened
1 month ago
3
Will sharing models via hostPath leading to security probelm
#34
kerthcet
opened
1 month ago
11
Model should be namespaced
#33
kerthcet
closed
1 month ago
1
Support Deployment for serving most models
#32
kerthcet
opened
1 month ago
9
Chore: fix readme typos
#31
kerthcet
closed
1 month ago
2
Release v0.0.1
#30
kerthcet
closed
1 month ago
1
Support loading model from huggingface
#29
kerthcet
closed
1 month ago
3
[2/N] Add support for single host deployment
#28
kerthcet
closed
1 month ago
2
Lora multiplexing support
#27
kerthcet
opened
2 months ago
4
Failed to patch inferenceService because of schema undeclared
#26
kerthcet
closed
1 month ago
3
[1/N] Add support for per model deployment
#25
kerthcet
closed
2 months ago
2
Remove `core` folder
#24
kerthcet
opened
2 months ago
3
Support HA with LeaderElection
#23
kerthcet
opened
2 months ago
1
Support Secret in Playground
#22
kerthcet
closed
1 month ago
3
Liveness & Readiness support
#21
kerthcet
opened
2 months ago
1
Support DataSource of docker image
#20
kerthcet
closed
1 week ago
5
Support DataSource of ModelScope
#19
kerthcet
closed
1 month ago
2
Support DataSource of Huggingface
#18
kerthcet
closed
1 month ago
1
Add more testcases for webhooks
#17
kerthcet
opened
2 months ago
3
Previous
Next