issues
search
InftyAI
/
llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
Apache License 2.0
30
stars
10
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add readme.md
#155
kerthcet
closed
2 months ago
0
Update installation doc
#154
kerthcet
closed
2 months ago
1
Helm uninstall will not delete the CRDs
#153
kerthcet
closed
2 months ago
1
Add helm chart v0.0.2
#152
kerthcet
closed
2 months ago
0
Add index.yaml
#151
kerthcet
closed
2 months ago
0
Add helm chart 0.0.2
#150
kerthcet
closed
2 months ago
0
Update installation doc
#149
kerthcet
closed
2 months ago
0
Support publish helm charts
#148
kerthcet
closed
2 months ago
0
Fix filename error
#147
kerthcet
closed
2 months ago
0
Rename workflow name
#146
kerthcet
closed
2 months ago
0
Support publish helm chart
#145
kerthcet
closed
2 months ago
0
Add helm chart v0.0.1
#144
kerthcet
closed
2 months ago
1
Prepare for v0.0.7
#143
kerthcet
closed
2 months ago
4
Add helm chart support
#142
kerthcet
closed
2 months ago
2
Support install llmaz at any namespace
#141
kerthcet
closed
1 month ago
2
Customized flags for backendRuntimes
#140
kerthcet
opened
2 months ago
2
[2/N] Add backendRuntime implementation
#139
kerthcet
closed
2 months ago
2
[1/N] Add backendRuntime CRD
#138
kerthcet
closed
2 months ago
3
helm chart support for easy installation
#137
kerthcet
closed
2 months ago
4
Fix resource limits could be small than requests
#136
kerthcet
closed
2 months ago
1
Add validation to Playground as the backendConfig's resource requests should not be greater than limits
#135
kerthcet
closed
2 months ago
5
Add new API object BackendRuntime for expandability
#134
kerthcet
closed
2 months ago
5
Support traditional models
#133
kerthcet
opened
2 months ago
1
Update typo
#132
kerthcet
closed
2 months ago
2
Update typo
#131
kerthcet
closed
2 months ago
2
Update Architecture
#130
kerthcet
closed
2 months ago
1
Add Architecture diagram
#129
kerthcet
closed
2 months ago
1
Bump vllm to 0.6.0 according to the great performance improvement
#128
kerthcet
closed
2 months ago
3
Prepare for v0.0.6
#127
kerthcet
closed
2 months ago
1
Prepare for v0.0.5
#126
kerthcet
closed
2 months ago
1
Change ModelClaims API
#125
kerthcet
closed
2 months ago
2
Add verbose log to modelLoader
#124
kerthcet
closed
2 months ago
1
Requests could be bigger than limits
#123
kerthcet
closed
2 months ago
2
Report filename and file size in modelLoader
#122
kerthcet
closed
2 months ago
3
[2/N] Support SpeculativeDecoding with llama.cpp
#121
kerthcet
closed
2 months ago
2
Add new conditions to Playground
#120
kerthcet
closed
2 months ago
2
Loading model weights more efficiently
#119
kerthcet
opened
2 months ago
6
[1/N] Add SpeculativeDecoding support
#118
kerthcet
closed
2 months ago
3
Add new status once models haven't been created
#117
kerthcet
closed
2 months ago
1
Bump github.com/onsi/gomega from 1.34.1 to 1.34.2
#116
dependabot[bot]
closed
2 months ago
2
Bump github.com/onsi/ginkgo/v2 from 2.20.1 to 2.20.2
#115
dependabot[bot]
closed
2 months ago
1
Unify the API routes for different inference engines
#114
kerthcet
closed
2 months ago
3
Add integration tests for Playground/Service status update
#113
kerthcet
closed
2 months ago
3
Add project logo
#112
kerthcet
closed
2 months ago
3
Add model label to Playground
#111
kerthcet
closed
2 months ago
2
update .github/PULL_REQUEST_TEMPLATE.md
#110
carlory
closed
1 month ago
5
Playground should be triggered to create Services and then Pods once the model is created
#109
carlory
closed
2 months ago
3
Fix watch for changes to LeaderWorkerSet created by llmaz and trigger a Reconcile for the owner
#108
carlory
closed
2 months ago
7
fix wrong field path in the openmodel webhook
#107
carlory
closed
2 months ago
2
Support scaling with Spot instances for cost saving
#106
kerthcet
opened
2 months ago
6
Previous
Next