Open scottsun94 opened 1 year ago
@rkooo567 this seems more like a core issue?
This actually seems like a bug based on that it is printed 1m after things are printed. It is not clear if it is core or tune (I remember Tune has a special way to print logs, and it could be related to it?) @scottsun94 can you provide a repro script and bump the priority?
Tune just uses regular stdout/stderr logging. The only custom thing we do is sometimes duplicating logs to a file, but this is not the case here and shouldn't affect it. I also believe this is a core issue.
What happened + What you expected to happen
I ran this torch fine-tuning resnet example: https://docs.ray.io/en/latest/train/examples/pytorch/pytorch_resnet_finetune.html
The full output is attached below.
The additional output at the end is the same as that at relatively the beginning of the script. This feels like a bug to me. The entire training takes relatively a long time, it shouldn't be entirely due to fast training.
Versions / Dependencies
nightly
Reproduction script
https://docs.ray.io/en/latest/train/examples/pytorch/pytorch_resnet_finetune.html
Issue Severity
Low: It annoys or frustrates me.