issues
search
InftyAI
/
llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes
Apache License 2.0
25
stars
10
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Support loading models from Image volume source
#47
kerthcet
closed
2 months ago
4
Feat: support sglang backend
#46
vicoooo26
closed
3 months ago
9
Feat: support modelscope
#45
vicoooo26
closed
3 months ago
4
CI support for tests
#44
kerthcet
closed
3 months ago
1
Use modelHub as one data source
#43
kerthcet
closed
3 months ago
1
Install lws at llmaz-system namespace
#42
kerthcet
opened
3 months ago
3
Add support for multithread when downloading weights
#41
kerthcet
closed
3 months ago
2
Use rust instead of python to download model weights
#40
kerthcet
closed
3 months ago
2
Support SGLang as another backend
#39
kerthcet
closed
3 months ago
2
Use rust instead of python when downloading model weights
#38
kerthcet
closed
3 months ago
6
Support OpenAPI
#37
kerthcet
opened
3 months ago
1
Update README.md to avoid confusion
#36
kerthcet
closed
3 months ago
1
Support OCI artifacts
#35
kerthcet
opened
3 months ago
3
Will sharing models via hostPath leading to security probelm
#34
kerthcet
opened
3 months ago
11
Model should be namespaced
#33
kerthcet
closed
3 months ago
1
Support Deployment for serving most models
#32
kerthcet
opened
3 months ago
9
Chore: fix readme typos
#31
kerthcet
closed
3 months ago
2
Release v0.0.1
#30
kerthcet
closed
3 months ago
1
Support loading model from huggingface
#29
kerthcet
closed
3 months ago
3
[2/N] Add support for single host deployment
#28
kerthcet
closed
3 months ago
2
Lora multiplexing support
#27
kerthcet
opened
3 months ago
4
Failed to patch inferenceService because of schema undeclared
#26
kerthcet
closed
3 months ago
3
[1/N] Add support for per model deployment
#25
kerthcet
closed
3 months ago
2
Remove `core` folder
#24
kerthcet
opened
3 months ago
3
Support HA with LeaderElection
#23
kerthcet
closed
1 month ago
5
Support Secret in Playground
#22
kerthcet
closed
3 months ago
3
Liveness & Readiness support
#21
kerthcet
opened
3 months ago
3
Support DataSource of docker image
#20
kerthcet
closed
2 months ago
5
Support DataSource of ModelScope
#19
kerthcet
closed
3 months ago
2
Support DataSource of Huggingface
#18
kerthcet
closed
3 months ago
1
Add more testcases for webhooks
#17
kerthcet
opened
3 months ago
5
Support multi-host inference
#16
kerthcet
opened
3 months ago
1
Support splitwise with multiModelsClaims
#15
kerthcet
opened
3 months ago
2
Failed to pass through the labels to the lws Pods
#14
kerthcet
closed
3 months ago
3
Add webhook to Playground
#13
kerthcet
closed
3 months ago
1
Add E2E test framework
#12
kerthcet
closed
3 months ago
1
Add webhook to Model
#11
kerthcet
closed
3 months ago
2
Add webhooks
#10
kerthcet
closed
3 months ago
2
Add OWNERS file
#9
kerthcet
closed
3 months ago
2
Add Inference API
#8
kerthcet
closed
3 months ago
7
Add workflow
#7
kerthcet
closed
4 months ago
0
Integrate with lws
#6
kerthcet
closed
7 months ago
0
Use lws as default workload
#5
kerthcet
closed
4 months ago
3
[WIP]inital operator
#4
B1F030
closed
7 months ago
2
Support autoscaling
#3
kerthcet
opened
11 months ago
10
Support reconcile the `serve.Replicas`
#2
kerthcet
closed
4 months ago
2
Init
#1
kerthcet
closed
11 months ago
0
Previous