Upon executing the training command, I have observed a phenomenon wherein the memory initially increases to 24GB and subsequently decreases to 15GB before the training process starts.
I am seeking your insights in understanding the underlying cause behind this memory peak during the initial stage of the process.
Thank you for your time.
Dear author,
Upon executing the training command, I have observed a phenomenon wherein the memory initially increases to 24GB and subsequently decreases to 15GB before the training process starts.
I am seeking your insights in understanding the underlying cause behind this memory peak during the initial stage of the process. Thank you for your time.