[Usage]: Is this an error ? "async_llm_engine.py:154] Aborted request cmpl-xxxxx"

Your current environment

Versions of relevant libraries:
[pip3] numpy==1.26.4
[pip3] nvidia-nccl-cu12==2.18.1
[pip3] torch==2.1.2
[pip3] triton==2.1.0
[conda] nomkl                     3.0                           0    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
[conda] numpy                     1.26.4                   pypi_0    pypi
[conda] nvidia-nccl-cu12          2.18.1                   pypi_0    pypi
[conda] torch                     2.1.2                    pypi_0    pypi
[conda] triton                    2.1.0                    pypi_0    pypiROCM Version: Could not collect
Neuron SDK Version: N/A
vLLM Version: 0.4.0
vLLM Build Flags:
CUDA Archs: Not Set; ROCm: Disabled; Neuron: Disabled
GPU Topology:
GPU0    GPU1    CPU Affinity    NUMA Affinity
GPU0     X      SYS     0-23,48-71      0
GPU1    SYS      X      24-47,72-95     1

Legend:

  X    = Self
  SYS  = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI)
  NODE = Connection traversing PCIe as well as the interconnect between PCIe Host Bridges within a NUMA node
  PHB  = Connection traversing PCIe as well as a PCIe Host Bridge (typically the CPU)
  PXB  = Connection traversing multiple PCIe bridges (without traversing the PCIe Host Bridge)
  PIX  = Connection traversing at most a single PCIe bridge
  NV#  = Connection traversing a bonded set of # NVLinks
(vllm) [jack@localhost model_product]$

How would you like to use vllm

I keep seeing the output of the vllm that :

async_llm_engine.py:154] Aborted request cmpl-xxxxx

I wonder if this is some kind of error message? Anyone can give me a hand to understand this message ？

Thank you.

vllm-project / vllm

[Usage]: Is this an error ? "async_llm_engine.py:154] Aborted request cmpl-xxxxx" #5712

Your current environment

How would you like to use vllm