issues
search
deepjavalibrary
/
djl-serving
A universal scalable machine learning model deployment solution
Apache License 2.0
182
stars
59
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[ci] fix nightly wheel pip installs
#2096
tosterberg
closed
1 week ago
0
[Neuron] Fix Neuron compilation logging
#2095
a-ys
closed
1 week ago
0
[0.28.0-dlc] Record telemetry including acceptance rate
#2094
zachgk
closed
1 week ago
0
Llama 2 7b chat model output quality is low
#2093
VrushaliJoshi-v37040
opened
1 week ago
4
[python] move parse input functions to input_parser.py
#2092
sindhuvahinis
closed
2 days ago
2
[CI] Inferentia tests through pytest
#2091
zachgk
closed
1 week ago
0
[python] refactor rolling batch inference method
#2090
sindhuvahinis
closed
2 weeks ago
0
[CI] Fix post autoAwq cleanup
#2089
zachgk
closed
2 weeks ago
0
Record telemetry including acceptance rate
#2088
zachgk
closed
1 week ago
0
codeql for python
#2087
siddvenk
closed
2 weeks ago
0
[0.28.0 dlc][cherry-pick] AOT fix and pin numpy
#2086
sindhuvahinis
closed
2 weeks ago
0
fix pinning of numpy version in docker files by reinstalling specifie…
#2085
siddvenk
closed
2 weeks ago
0
fix pinning of numpy version in docker files by reinstalling specifie…
#2084
siddvenk
closed
2 weeks ago
0
fix pinning of numpy version in docker files by reinstalling specifie…
#2083
siddvenk
closed
2 weeks ago
0
fix pinning of numpy version in docker files by reinstalling specifie…
#2082
siddvenk
closed
1 week ago
1
[IB] Supports forwarding environment variables
#2081
zachgk
closed
2 weeks ago
0
[cherry-pick][0.28.0-dlc][aot] Fix aot quantization for weight only quantization (#2079)
#2080
sindhuvahinis
closed
2 weeks ago
0
[aot] Fix aot quantization for weight only quantization
#2079
tosterberg
closed
2 weeks ago
0
[cherry-pick][0.28.0-dlc][Neo] Fix Neo Quantization properties output. Add some additional con…
#2078
sindhuvahinis
closed
2 weeks ago
0
[Neo] Fix Neo Quantization properties output. Add some additional configuration.
#2077
a-ys
closed
2 weeks ago
0
pin numpy to <2 in ci/docker (#2071)
#2076
siddvenk
closed
2 weeks ago
0
pin numpy to <2 in ci/docker
#2075
siddvenk
closed
2 weeks ago
0
pin numpy to <2 in ci/docker
#2074
siddvenk
closed
2 weeks ago
0
[ib] fix docker env file write location
#2073
tosterberg
closed
2 weeks ago
0
[fix] initialize sequence dictionary for default sequence index to pr…
#2072
siddvenk
closed
2 weeks ago
1
pin numpy to <2 in ci/docker
#2071
siddvenk
closed
2 weeks ago
1
DJL-TensorRT-LLM : inference no longer working, tokenizer error.
#2070
eduardzl
closed
6 days ago
3
[cherry-pick][0.28.0-dlc][fix] remove validating quantization in properties_manager.py
#2069
sindhuvahinis
closed
2 weeks ago
0
[ci][fix] don't use env vars for llm integ test as it causes issues w…
#2068
siddvenk
closed
2 weeks ago
0
[fix] remove validating quantization in properties_manager.py
#2067
sindhuvahinis
closed
2 weeks ago
0
[0.28.0-dlc][cherry-pick][secure-mode] Fix entrypoint control name
#2066
ethnzhng
closed
3 weeks ago
0
[secure-mode] Fix entrypoint control name
#2065
ethnzhng
closed
3 weeks ago
0
[ci] Adding integration test for AutoAwq
#2064
sindhuvahinis
closed
2 weeks ago
0
[lmi][docs] update to 0.28.0 in lmi docs
#2063
siddvenk
closed
3 weeks ago
0
[secure-mode] Refactor secure mode plugin (#2058)
#2062
frankfliu
closed
3 weeks ago
0
[0.28.0-dlc][cherry-pick] AutoAWQ Integration Script (#2038)
#2061
sindhuvahinis
closed
3 weeks ago
0
[plugin] Fixes plugin scaning bug
#2060
frankfliu
closed
3 weeks ago
0
[plugin] Fixes plugin scaning bug
#2059
frankfliu
closed
3 weeks ago
0
[secure-mode] Refactor secure mode plugin
#2058
frankfliu
closed
3 weeks ago
0
Stopping short: Very few output tokens returned, even though max tokens set very high.
#2057
yaronr
closed
3 weeks ago
1
[0.28.0-dlc] IP protection cherry-pick with Kotlin to gradle changes
#2056
sindhuvahinis
closed
3 weeks ago
0
[cherry-pick] fix bug with duplicate models when HF_MODEL_ID points to model store …
#2055
siddvenk
closed
3 weeks ago
0
fix bug with duplicate models when HF_MODEL_ID points to model store
#2054
siddvenk
closed
3 weeks ago
0
[0.28.0 dlc][cherry-pick] cherry-pick all doc updates to 0.28.0-dlc
#2053
sindhuvahinis
closed
3 weeks ago
0
[test] remove trtllm flan t5 integration test
#2052
sindhuvahinis
closed
3 weeks ago
0
[vllm, lmi-dist] add support for top_n_tokens
#2051
sindhuvahinis
closed
2 weeks ago
0
Fix benchmark deb build
#2050
zachgk
closed
3 weeks ago
0
[0.28.0-dlc][lmi] Update lmi-dist to 10.0.1
#2049
maaquib
closed
3 weeks ago
2
[serving] Update default max worker to 1 for GPU
#2048
xyang16
closed
3 weeks ago
0
[docs] Updates embedding user guide
#2047
frankfliu
closed
3 weeks ago
0
Previous
Next