Open soyoapp opened 1 month ago
yes, we keep a cache of gpu buffers for popular sizes. we can maybe free that cache when the last session gets disposed, assuming that an app would not do create/close/create sequences.
This issue has been automatically marked as stale due to inactivity and will be closed in 30 days if no further activity occurs. If further support is needed, please provide an update and/or more details.
Describe the issue
I use onnxruntime web with following code
and following is my test code
but after infer, nvidia-smi show the gpu mem is still in use, only refresh browser tab or close browser tab can free gpu mem
To reproduce
Just run above code
Urgency
No response
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
1.17.3
Execution Provider
'webgpu' (WebGPU)
Env
Microsoft Edge 127.0.2651.74 (Official build) (64-bit) Revision dbf5b0aa014c4e70e3d5e2d73248e21264f82957 Chromium version 127.0.6533.73 Operating system Linux JavaScript V8 12.7.18.6 User agent Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/127.0.0.0 Safari/537.36 Edg/127.0.0.0 Command-line /usr/bin/microsoft-edge --disable-web-security --password-store=basic --user-data-dir=/home/roroco/.config/JetBrains/WebStorm2023.2/edge-user-data --remote-debugging-port=39765 --no-default-browser-check --flag-switches-begin --enable-unsafe-webgpu --enable-features=Vulkan --flag-switches-end about:blank