issues
search
InftyAI
/
llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
Apache License 2.0
31
stars
10
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
fix: helm-install cmd
#210
googs1025
opened
1 hour ago
1
Add preheat field
#209
kerthcet
closed
1 day ago
1
Bump github.com/onsi/ginkgo/v2 from 2.21.0 to 2.22.0
#208
dependabot[bot]
closed
1 day ago
1
Bump sigs.k8s.io/controller-runtime from 0.19.1 to 0.19.2
#207
dependabot[bot]
closed
1 day ago
1
Bump the kubernetes group with 5 updates
#206
dependabot[bot]
closed
1 day ago
1
Add TensorRT-LLM support as another backend
#205
kerthcet
opened
1 week ago
0
Bump sigs.k8s.io/structured-merge-diff/v4 from 4.4.1 to 4.4.3
#204
dependabot[bot]
closed
1 week ago
1
fix: helm INSTALLATION FAILED
#203
googs1025
closed
1 week ago
2
Install lws controller together with llmaz controller in the same namespace
#202
kerthcet
opened
1 week ago
3
update docs
#201
kerthcet
closed
2 weeks ago
2
Update README.md
#200
kerthcet
closed
2 weeks ago
1
Bump sigs.k8s.io/lws from 0.4.1 to 0.4.2
#199
dependabot[bot]
closed
2 weeks ago
1
Bump github.com/open-policy-agent/cert-controller from 0.11.0 to 0.12.0
#198
dependabot[bot]
closed
2 weeks ago
1
Support speculative decoding with llama.cpp
#197
kerthcet
opened
2 weeks ago
0
Bump sigs.k8s.io/structured-merge-diff/v4 from 4.4.1 to 4.4.2
#196
dependabot[bot]
closed
3 weeks ago
2
Bump github.com/onsi/gomega from 1.34.2 to 1.35.1
#195
dependabot[bot]
closed
3 weeks ago
1
Bump github.com/onsi/ginkgo/v2 from 2.20.2 to 2.21.0
#194
dependabot[bot]
closed
3 weeks ago
2
Support ollama
#193
qinguoyi
closed
2 weeks ago
8
Serverless support
#192
kerthcet
opened
4 weeks ago
0
Bump sigs.k8s.io/controller-runtime from 0.19.0 to 0.19.1
#191
dependabot[bot]
closed
4 weeks ago
1
Bump the kubernetes group with 5 updates
#190
dependabot[bot]
closed
4 weeks ago
1
Support to serving Stable Diffusion models
#189
kerthcet
opened
1 month ago
0
Release v0.0.8
#188
kerthcet
closed
1 month ago
1
Release v0.0.8
#187
kerthcet
closed
1 month ago
2
Release v0.0.8
#186
kerthcet
closed
1 month ago
2
Bump sigs.k8s.io/lws from 0.4.0 to 0.4.1
#185
dependabot[bot]
closed
1 month ago
1
chore:update llama readme
#184
qinguoyi
closed
1 month ago
1
Update helm files
#183
kerthcet
closed
1 month ago
3
Support TGI as another backendRuntime
#182
kerthcet
closed
1 month ago
1
Make field Command optional
#181
kerthcet
closed
1 month ago
2
Remove namespace when getting OpenModel
#180
kerthcet
closed
1 month ago
2
Downsize model-loader image
#179
qinguoyi
closed
1 month ago
1
feat:update model loader
#178
qinguoyi
closed
1 month ago
16
Update arch
#177
kerthcet
closed
1 month ago
1
Update Revision default to main
#176
kerthcet
closed
2 months ago
2
fix:load models cost seconds
#175
qinguoyi
closed
2 months ago
1
fix:catch os error
#174
qinguoyi
closed
2 months ago
1
Update installation doc
#173
kerthcet
closed
2 months ago
2
feat:support apply llmaz to any ns
#172
qinguoyi
closed
2 months ago
3
Upgrade project
#171
kerthcet
closed
2 months ago
1
feature(webhook): add BackendRuntimeConfig resources validation
#170
googs1025
closed
2 months ago
6
update controller-gen
#169
kerthcet
closed
2 months ago
2
chore:update leader-elect chart config
#168
qinguoyi
closed
2 months ago
4
Update make generate
#167
kerthcet
closed
2 months ago
2
Bump the kubernetes group with 5 updates
#166
dependabot[bot]
closed
2 months ago
1
Is there any early proposal or document about integrating with Gateway API ?
#165
caozhuozi
opened
2 months ago
2
chore: add unit test in util package
#164
googs1025
closed
2 months ago
3
[ModelLoader] Some huggingface models may contain duplicated weights
#163
kerthcet
closed
4 weeks ago
7
chore: bump LWS version to v0.4.0
#162
googs1025
closed
2 months ago
5
Bump LWS version to v0.4.0
#161
kerthcet
closed
2 months ago
2
Next