information-inference Search Results

1000+ results
for information-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

intel/intel-npu-acceleration-library #138

It seems that the model is not loaded on NPU Memory

**Describe the bug** My CPU is Ultra 7 258v, and the system is Windows 11Home 24H2. I just tried running the qwen2.5-7b-instruct-model using your example code for the first time. However, I noticed t…

BigYellowTiger updated 3 weeks ago
1
huggingface/text-generation-inference #2571

Deploy error for Llama-3.2-vision-11B: "Sharded is not suppo…

### System Info Hi Team, When deploying the model on AWS with `huggingface-pytorch-tgi-inference:2.3.0-tgi2.2.0`, I got the above error. Could you tell me when can TGI provide the new image? Is t…

xuan1905 updated 1 month ago
4
larq/compute-engine #814

Does LCE support deployment on ARM32 Cortex-M7?

Hi all, I am urgently seeking to deploy the TFLite models converted using Larq Compute Engine (LCE) on an ARM32 device, specifically a Cortex-M7 CPU, the STM32F7 series MCU. I have seen some rel…

Oslomayor updated 3 weeks ago
3
PaddlePaddle/Paddle #56193

PreconditionNotMetError: Tensor holds no memory. Call Tensor…

### bug描述 Describe the Bug File "/data/mlops/Open-Assistant/inference/server/oasst_inference_server/plugins/vectors_db/loaders/data_loader.py", line 383, in path_to_doc1 res = file_to_doc(file, …

WenWuZhiDao updated 1 week ago
15
chaidiscovery/chai-lab #153

Feature request: support larger crop sizes (>2048 tokens)

Chai-1 is limited to 2048 tokens (token=canonical AA or atom), and the main reason is high memory consumption. We received several requests to support larger crop sizes, but it requires _significa…

arogozhnikov updated 10 hours ago
2
spring-projects/spring-ai #1282

AWS Bedrock - Add support for cross-region inference new fea…

Hey team, First of all, thanks for the effort you are doing for this amazing project. I would like to ask for the support of a recently and very important addition in AWS Bedrock, that's the cross-r…

DEG-7 updated 1 month ago
2
fishaudio/fish-speech #687

the results of single words synthesis are all noises

### Self Checks - [X] I have thoroughly reviewed the project documentation (installation, training, inference) but couldn't find any relevant information that meets my needs. [English](https://speech…

yowrhihoil updated 1 day ago
1
mirah/mirah #256

Inference errors don't provide useful information

If the compiler hits an inference error, it doesn't provide a line number for the error, nor does it provide any information in the error message as to what caused the error. E.g. ``` y = lambda Mou…

shannah updated 10 years ago
8
huggingface/peft #2208

TypeError: LoraConfig.__init__() got an unexpected keyword a…

### System Info Name: peft Version: 0.13.2 ### Who can help? when i try to load the adapter for inference its showing the following error. `TypeError: LoraConfig.__init__() got an unexpected ke…

imrankh46 updated 2 weeks ago
25
triton-inference-server/tensorrtllm_backend #645

tensortllm backend fails when kv cache is disabled

**Description** Error ``` model_instance_state.cc:1117] "Failed updating TRT LLM statistics: Internal - Failed to find Max KV cache blocks in metrics." ``` when kv cache is disabled when building…

ShuaiShao93 updated 2 weeks ago
5

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for information-inference

1000+ results
for information-inference