Closed HPUedCSLearner closed 2 months ago
@ilya-lavrenov @helena-intel can you look into this?
@HPUedCSLearner thanks for the clear report! I see the same issue with the other benchmarking scripts. The issue is with writing the usage_stats.json file. The benchmarking itself should still work fine. We will create a PR to fix this issue. In the meantime, to prevent the Exception, you can workaround this by editing https://github.com/vllm-project/vllm/blob/main/vllm/usage/usage_lib.py and remove the non-json-serializable object from the data:
--- a/vllm/usage/usage_lib.py
+++ b/vllm/usage/usage_lib.py
@@ -200,6 +200,7 @@ class UsageMessage:
logging.debug("Failed to send usage data to server")
def _write_to_file(self, data):
+ data.pop("kv_cache_dtype")
os.makedirs(os.path.dirname(_USAGE_STATS_JSON_PATH), exist_ok=True)
Path(_USAGE_STATS_JSON_PATH).touch(exist_ok=True)
with open(_USAGE_STATS_JSON_PATH, "a") as f:
Your current environment
🐛 Describe the bug
start from : https://docs.vllm.ai/en/latest/getting_started/openvino-installation.html
Then use the follow command, I get a Exception.
The log :