tpot Search Results - Githubissues

vllm-project/vllm #9722

[Performance]: How to Improve Performance Under Concurrency

### Proposal to improve performance I am using vllm version 0.6.3.post1 with four 4090 GPUs to infer the qwen2-72B-chat-int4 model. The request speed is very fast for a single request, but the perf…

ljwps updated 1 week ago

vllm-project/vllm #9476

[Performance]: speed regression 0.6.2 => 0.6.3?

### Report of performance regression Using your benchmark ``` git clone https://github.com/vllm-project/vllm cd vllm/benchmarks wget https://huggingface.co/datasets/anon8231489123/ShareGPT_Vi…

stas00 updated 1 month ago

openvinotoolkit/openvino.genai #1084

Intel(R) Core(TM) Ultra 5 125H NPU so slowly?

when I using Intel(R) Core(TM) Ultra 5 125H to test, npu is so slowly? ``` install npu driver follow this: https://github.com/intel/linux-npu-driver/blob/main/docs/overview.md pip install optim…

mnlife updated 4 days ago

HabanaAI/vllm-fork #204

[Usage]: tensor-parallel-size=2 second token latency is high…

### Your current environment ``` vllm 0.5.3.post1+gaudi117 ``` tensor_parallel_size=1 script ```text export PT_HPU_ENABLE_LAZY_COLLECTIVES=true export VLLM_GRAPH_…

Zjq9409 updated 3 days ago

vllm-project/vllm #8531

[Bug]: benchmark_serving.py generates different numbers of t…

### Your current environment 4xH100. ### Model Input Dumps _No response_ ### 🐛 Describe the bug When benchmarking the performance of vllm with `benchmark_serving.py`, it will generate different…

LiuXiaoxuanPKU updated 1 month ago

Niketkumardheeryan/ML-CaPsule #817

Tpot Library demonstration

Explaining and demonstrating the use of tpot library that can be used to find the best model with the best parameters for classification and regression task without much efforts. Please assign this…

PranavTyagi-3 updated 5 months ago

openml/automlbenchmark #622

When I try to run it through Windows on the Docker machine, it gives this error. However, I updated the python I'm running. Python is currently version 3.11.4 and still presents this error. The docker…

juliocartier updated 2 months ago

vllm-project/vllm #7635

[Misc]: TTFT profiling with respect to prompt length

### Anything you want to discuss about vllm. I am profiling TTFT and TPOT on my machine, I could not explain the behavior of TTFT thus opened this issue to seek for advice. Below figure shows the …

luowenjie14 updated 1 week ago

teco-kit/autox_teamblue #4

tsflex + tpot

Use teapot feature selection strategy on tsflex generated features.

ravivanpong updated 1 year ago

vllm-project/vllm #6781

[Performance]: Slow TTFT(?) for Qwen2-72B-GPTQ-Int4 on H100 …

I did some tests in order to find better parameter to speed up, and it appears that there hasn't been a significant change in TTFT (Time To First Token). Is my TTFT correct? I feel it might be a bit t…

cyc00518 updated 3 months ago

1000+ results
for tpot