Open atomrun39 opened 4 hours ago
In addition, there are always warnings like this during the saving process. How can I eliminate them?
/usr/local/python3.9.10/lib/python3.9/site-packages/dlrover/python/common/multi_process.py:271: ResourceWarning: unclosed <socket.socket fd=127, family=AddressFamily.AF_UNIX, type=SocketKind.SOCK_STREAM, proto=0, laddr=/tmp/ckpt_sock/1857345181448912897/sharedlock_shm_lock_1.sock>
connection, _ = self._server.accept()
ResourceWarning: Enable tracemalloc to get the object allocation traceback
When using dlrover to save checkpoints, the following error will always occur:
The code used is as follows:
How to solve this problem? I really hope to receive a reply.