Closed alphrc closed 4 months ago
The OS will "swap" in which some active memory gets stored on "disk" if you use more memory than the machine has. This can be very slow so it's best to avoid it as much as possible. In your case if it only happens once in a while then it's probably fine. If it happens regularly then maybe your toks/sec will start to get really slow.
I am fine-tuning a 72B model Qwen/Qwen2-72B-Instruct with ~50000 data. The peak mem reaches 201GB but my machine only have 192GB RAM. It is still running fine at the moment. How is this possible? Would there be any problem in the future?