issues
search
deepjavalibrary
/
djl-serving
A universal scalable machine learning model deployment solution
Apache License 2.0
192
stars
64
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[docker] remove tensorflow native from cpu-full image
#2136
frankfliu
closed
2 months ago
0
[awscurl] Fixes missing \"text\" case in jsonlines output
#2135
frankfliu
closed
2 months ago
0
[CI] Add trust remote code option in TRTLLM CI
#2134
ydm-amazon
closed
2 months ago
2
[cherry-pick] [secure-mode] add per-model configs to allowlist (#2132)
#2133
siddvenk
closed
2 months ago
0
[secure-mode] add per-model configs to allowlist
#2132
ethnzhng
closed
2 months ago
0
[ci] remove HF_MODEL_ID env for lmi_dist_1 test
#2131
sindhuvahinis
closed
2 months ago
0
[secure-mode] Add properties allowlist validation (#2129)
#2130
frankfliu
closed
2 months ago
0
[secure-mode] Add properties allowlist validation
#2129
ethnzhng
closed
2 months ago
1
[0.28.0-dlc] spec decoding cache dirty fix
#2128
sindhuvahinis
closed
2 months ago
0
[docs]Update DJL-Serving Read Me
#2127
Varun-Dutta
closed
2 months ago
0
[serving] Fixes snake case pattern (#2123)
#2126
frankfliu
closed
2 months ago
0
Support multi node for lmi-dist
#2125
xyang16
closed
2 months ago
0
[lmi][hf]remove usage of HF Conversational pipeline as it is deprecated
#2124
siddvenk
closed
2 months ago
0
[serving] Fixes snake case pattern
#2123
frankfliu
closed
2 months ago
0
[python] add support for 3p use-case
#2122
siddvenk
closed
2 months ago
1
[docker] Fixes onnxruntime engine installation
#2121
frankfliu
closed
2 months ago
0
[awscurl] Handles Bedrock special url case
#2120
frankfliu
closed
2 months ago
0
[docker] Update DJL to 0.29.0-SNAPSHOT
#2119
frankfliu
closed
2 months ago
0
update flags to prevent deprecation
#2118
lanking520
closed
2 months ago
0
[cherrypick][0.28.0 dlc] fix default behavior for rb in neuron (#2116)
#2117
tosterberg
closed
2 months ago
0
[fix] align default neuron behavior between model server and handler
#2116
tosterberg
closed
2 months ago
0
[Neo] Neo compilation/quantization script bugfixes
#2115
a-ys
closed
2 months ago
0
[serving] make http response codes configurable for exception cases
#2114
siddvenk
closed
2 months ago
0
[serving] introduce interface to customize http response status retur…
#2113
siddvenk
closed
2 months ago
1
Fix acceptance history check in speculative telemetry
#2112
zachgk
closed
2 months ago
1
[CI] fix bugs
#2111
lanking520
closed
2 months ago
0
[0.28.0] fix telemetry tracking for Speculative Decoding
#2110
lanking520
closed
2 months ago
0
[CI] Integration tests through pytest
#2109
zachgk
closed
2 months ago
1
[ci] Fix gpt2 integ test failure
#2108
maaquib
closed
2 months ago
0
[0.26.0-dlc] pin datasets version in tensorrt llm
#2107
sindhuvahinis
closed
2 months ago
0
[0.27.0-dlc] pin datasets version in tensorrt llm
#2106
sindhuvahinis
closed
2 months ago
0
[0.28.0-dlc][cherry-pick][dockerfile] pin datasets to 2.19.1 in trtllm (#2104)
#2105
sindhuvahinis
closed
2 months ago
0
[dockerfile] pin datasets to 2.19.1 in trtllm
#2104
sindhuvahinis
closed
2 months ago
0
[0.28.0-dlc][cherry-pick][fix] Set tokenizer on output_formatter for TRT-LLM Handlers (#2100)
#2103
sindhuvahinis
closed
2 months ago
0
[ci] add trtllm chat test
#2102
sindhuvahinis
closed
2 months ago
0
[ci] fix pinning numpy inf2 error
#2101
tosterberg
closed
2 months ago
0
[fix] Set tokenizer on output_formatter for TRT-LLM Handlers
#2100
maaquib
closed
2 months ago
2
[Draft][Do-not-merge] 0.28.0 multi node initial changes
#2099
nikhil-sk
closed
1 month ago
0
[ci] fix trtllm nightly wheel pip installs
#2098
tosterberg
closed
2 months ago
0
[ci] fix ground truth for neuron unit test
#2097
tosterberg
closed
2 months ago
0
[ci] fix nightly wheel pip installs
#2096
tosterberg
closed
2 months ago
0
[Neuron] Fix Neuron compilation logging
#2095
a-ys
closed
2 months ago
0
[0.28.0-dlc] Record telemetry including acceptance rate
#2094
zachgk
closed
2 months ago
0
Llama 2 7b chat model output quality is low
#2093
VrushaliJoshi-v37040
opened
2 months ago
4
[python] move parse input functions to input_parser.py
#2092
sindhuvahinis
closed
2 months ago
2
[CI] Inferentia tests through pytest
#2091
zachgk
closed
2 months ago
0
[python] refactor rolling batch inference method
#2090
sindhuvahinis
closed
2 months ago
0
[CI] Fix post autoAwq cleanup
#2089
zachgk
closed
2 months ago
0
Record telemetry including acceptance rate
#2088
zachgk
closed
2 months ago
0
codeql for python
#2087
siddvenk
closed
2 months ago
0
Previous
Next