inference Search Results

1000+ results
for inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #4653

[Bug]: NCCL timed out during inference

### Your current environment Using: * vllm 0.4.1 * nccl 2.18.1 * pytorch 2.2.1 ### 🐛 Describe the bug During inference I sometimes get this error: ```bash (RayWorkerWrapper pid=2376582…

enkiid updated 3 weeks ago
7
ttanida/rgrg #12

Unable to run inference

rgrg/src$ python ./full_model/generate_reports_for_images.py Traceback (most recent call last): File "/media/Win11/rgrg/src/./full_model/generate_reports_for_images.py", line 20, in from src…

raghavzns updated 1 month ago
2
vllm-project/llm-compressor #926

Got Error when I load a 2of4 model using vllm.

**Describe the bug** I'm compressing a qwen2.5_7b model using `examples/quantization_2of4_sparse_w4a16/llama7b_sparse_w4a16.py`, but I failed to load the stage_sparsity model. The error is shown belo…

jiangjiadi updated 4 hours ago
12
huggingface/text-generation-inference #2530

xpu/cpu: docker images referenced in documentation do not ex…

xpu and cpu Intel images referenced in documentation do not exist: * https://huggingface.co/docs/text-generation-inference/en/installation_intel * https://github.com/huggingface/text-generation-infe…

dvrogozh updated 2 months ago
3
FunAudioLLM/SenseVoice #147

The demo.py can not work correctly

Notice: In order to resolve issues more efficiently, please raise issue following the template. （注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节） ## 🐛 Bug When I run the demo.py , the error is : ``` Tracebac…

chongkuiqi updated 4 weeks ago
1
pytorch/torchtune #1020

QLoRA Inference

Can I load QLoRA fine-tuning weights into a Hugging Face model as shown below? ```python model_id = "meta-llama/Meta-Llama-3-8B-Instruct" quantization_config = BitsAndBytesConfig( load_in_4bit=T…

jeff52415 updated 4 months ago
1
elastic/elasticsearch #116140

[CI] TextEmbeddingCrudIT class failing

**Build Scans:** - [elasticsearch-periodic #4727 / openjdk17_checkpart2_java-fips-matrix](https://gradle-enterprise.elastic.co/s/suslzsefkzbd4) - [elasticsearch-periodic #4712 / openjdk17_checkpart2_j…

elasticsearchmachine updated 2 weeks ago
2
elastic/kibana #196707

[Search:Indices:Pipelines page] Missing tooltip on Starting …

**Description** Tooltips should be present for users if they are present on element. For users using only keyboard as well (not only for the users using mouse). **Preconditions** Stateful Indices -> …

L1nBra updated 1 day ago
1
microsoft/onnxruntime #21737

Inferencing FP16 model using onnxruntime

### Describe the issue I have a detector with FP16 and FP32 weights(onnx). Below is the code for FP32 which gives the correct detections when inferencing on FP32 weights. ``` void process_image…

navyverma updated 2 months ago
5
THU-ESIS/Chinese-Mistral #4

batch inference

Hi authors, I want to test the performance of the Mistral7B on the test dataset. Is it only possible to do single sample inference (with model. generate(...))? Are there any methods to accelerate t…

x6p2n9q8a4 updated 5 months ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for inference

1000+ results
for inference