djl Search Results - Githubissues

1000+ results
for djl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

deepjavalibrary/djl-serving #1792

DJL-TRTLLM: Error while detokenizing output response of tekn…

## Description I followed the recipe given [here](https://docs.djl.ai/docs/serving/serving/docs/lmi/tutorials/trtllm_manual_convert_tutorial.html) to manually convert teknium/OpenHermes-2.5-Mistral-7…

omarelshehy updated 2 months ago
1
microsoft/DeepSpeed-MII #381

Error: "Only able to place X replicas, but Y replicas were r…

I ran ``` client = mii.serve("mistralai/Mistral-7B-Instruct-v0.2") response = client.generate(inputs, max_new_tokens=128, tensor_parallel=2, replica_num=2) ``` on AWS ml.g5.12xlarge with 4 GPUs…

spring1915 updated 4 months ago
2
akshitagupta15june/Face-X #1784

Feature request Resume Parser and Job description comparisio…

**Is your feature request related to a problem? Please describe.** The feature request is related to the problem of manually matching job seeker resumes with relevant job descriptions, which is a ti…

Damini2004 updated 2 months ago
1
deepjavalibrary/djl-serving #1214

Ability to transform model outputs in DJL Serving

## Description We are using SageMaker for large model inference (LMI) as documented [here](https://docs.aws.amazon.com/sagemaker/latest/dg/large-model-inference-dlc.html) With this notebook http…

rachitchauhan43 updated 8 months ago
3
deepjavalibrary/djl-serving #1352

Streaming with rolling batch for starcoderbase model not wor…

## Description Tokens not streaming not working with rolling batch ### Expected Behavior (what's the expected behavior?) ### Error Message ## How to Reproduce? (If you developed your own…

prgawade updated 7 months ago
3
deepjavalibrary/djl #2849

JVM内存没有释放

## Description 在使用OnnxRuntime引擎不会回收jvm内存下面是代码，已经有很多使用者提出内存不会回收，不知道你们为啥这么自信你们一直没有问题，我们已经转到python在生产环境。你们有时间看下吧 ``` mport ai.djl.Device; import ai.djl.MalformedModelException; import ai.djl…

201723201401012 updated 4 months ago
14
deepjavalibrary/djl #2821

执行yolov5模型内存泄漏问题(异步执行或者放在spring环境执行会出现内存无法回收问题，在main线程不会)

## Description 本来是把djl推理yolov5模型整合到springboot项目中，压测发现内存无法回收，随着压测的增加内存也增加。注意：和这个issues并不是一个问题https://github.com/deepjavalibrary/djl/issues/2800 ，它这个在main线程中执行确实不会有问题，但是异步执行就会有内存泄漏问题。在spring项目中内存…

90600 updated 4 months ago
6
gx304419380/ai-service #1

服务器部署启动失败

楼主您好啊，我服务器去部署该的时候，普通启动项目，nohub java -jar 会一直报错Caused by: java.lang.UnsatisfiedLinkError: /root/.djl.ai/pytorch/1.9.1-cpu-linux-x86_64/libtorch_cpu.so: /lib64/libm.so.6: version `GLIBC_2.23' not found …

fjwnb updated 2 years ago
2
deepjavalibrary/djl #3145

Support for FP8 quantization with TensorRT-LLM

DJL does not support (or has not documented support) for FP8 quantization ([docs](https://demodocs.djl.ai/docs/serving/serving/docs/lmi/user_guides/trt_llm_user_guide.html#quantization-support)). …

nathan-az updated 2 months ago
2
deepjavalibrary/djl #2970

When I build a project supporting DJL in Android Studio, I e…

## Description When I build a project supporting DJL in Android Studio, I encounter the following error. How can I resolve it? `Could not resolve all files for configuration ':app:debugRuntimeClas…

tonglingwen updated 5 months ago
3

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for djl

1000+ results
for djl