issues
search
InftyAI
/
llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
Apache License 2.0
30
stars
10
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump github.com/onsi/ginkgo/v2 from 2.19.1 to 2.20.1
#105
dependabot[bot]
closed
2 months ago
1
Change model name to github.com/inftyai/llmaz
#104
kerthcet
closed
2 months ago
1
Accelerate model loading
#103
kerthcet
closed
1 week ago
3
Stop sharing model weights across Pods in the same node
#102
kerthcet
closed
3 months ago
1
Prepare for v0.0.4
#101
kerthcet
closed
3 months ago
1
Support filesystems
#100
kerthcet
opened
3 months ago
1
Download models in prior
#99
kerthcet
closed
3 months ago
3
Bump sigs.k8s.io/controller-runtime from 0.18.4 to 0.19.0
#98
dependabot[bot]
closed
3 months ago
1
Bump the kubernetes group with 5 updates
#97
dependabot[bot]
closed
3 months ago
1
Model aware scheduling
#96
kerthcet
opened
3 months ago
3
add e2e tests with llama.cpp
#95
kerthcet
closed
3 months ago
2
Support llama.cpp
#94
kerthcet
closed
3 months ago
2
Downsize the model-loader image
#93
kerthcet
closed
1 month ago
8
Playground will not reconcile once model created
#92
kerthcet
closed
2 months ago
2
ollama support
#91
kerthcet
closed
1 week ago
13
Prompts managements
#90
kerthcet
opened
3 months ago
1
Update Readme.md
#89
kerthcet
closed
3 months ago
1
Always download the model weights when pod starts
#88
kerthcet
closed
3 months ago
2
Support loading models from object store
#87
kerthcet
closed
3 months ago
2
Failover policy for various backends
#86
kerthcet
opened
3 months ago
1
Parallel model serving
#85
kerthcet
opened
3 months ago
1
Update contributing.md
#84
kerthcet
closed
3 months ago
1
Update code of conduct
#83
kerthcet
closed
3 months ago
2
Mark project as alpha
#82
kerthcet
closed
3 months ago
1
Lack the flexibility to express deploy primitives
#81
kerthcet
opened
3 months ago
4
Bump sigs.k8s.io/controller-runtime from 0.17.3 to 0.18.4
#80
dependabot[bot]
closed
3 months ago
2
dependency cleanup
#79
kerthcet
closed
3 months ago
2
Bump github.com/open-policy-agent/cert-controller from 0.10.1 to 0.11.0
#78
dependabot[bot]
closed
3 months ago
1
Bump github.com/onsi/ginkgo/v2 from 2.19.1 to 2.20.0
#77
dependabot[bot]
closed
3 months ago
2
Change Model API to OpenModel
#76
kerthcet
closed
3 months ago
3
Prepare for v0.0.2
#75
kerthcet
closed
3 months ago
1
Integrate with Kueue for fungibility capacity
#74
kerthcet
opened
3 months ago
1
Mount /dev/shm for shared memory files
#73
kerthcet
opened
3 months ago
1
Support TGI as another alternative backend
#72
kerthcet
closed
1 month ago
5
Support Secrets to store HF_TOKEN
#71
kerthcet
closed
3 months ago
4
Fix: name must not contain dots
#70
kerthcet
closed
3 months ago
3
Add more e2e tests
#69
kerthcet
closed
3 months ago
1
Support SGLang & modify model source API
#68
kerthcet
closed
3 months ago
3
Once name containers dot, failed to create Pods
#67
kerthcet
closed
3 months ago
1
Benchmark toolkit support
#66
kerthcet
opened
3 months ago
3
Support llama.cpp as alternative backend
#65
kerthcet
closed
3 months ago
3
Add new rules to golangci
#64
kerthcet
closed
3 months ago
2
Milestone v0.1.0
#63
kerthcet
opened
3 months ago
0
Support different GPU accelerators for fungibility
#62
kerthcet
opened
3 months ago
1
An an example for multi-host inference with Service
#61
kerthcet
opened
3 months ago
1
Support ObjectStore as another datasource
#60
kerthcet
closed
3 months ago
1
Support speculative decoding
#59
kerthcet
closed
2 months ago
3
Model version management
#58
kerthcet
opened
3 months ago
2
Add baseline for tests
#57
kerthcet
closed
3 months ago
1
merge once tests passed
#56
kerthcet
closed
3 months ago
32
Previous
Next