about training time - Githubissues

jinnh / GSAD

[NeurIPS 2023] Global Structure-Aware Diffusion Process for Low-Light Image Enhancement

114 stars 9 forks source link

about training time #19

Closed conutqiqi closed 1 month ago

conutqiqi commented 3 months ago

Thank you very much for the efforts . Recently, I tried to reproduce the training process, but found that the training time was quite long. May I ask the author how long the model ran and what kind of graphics card was used？

jinnh commented 3 months ago

Hello, thank you for your interest. We utilized the RTX3090 for training. Taking a single RTX3090 as an example, the first stage and the second stage take about 28+ and 60+ hours respectively. Taking into account the multi-person use of the server GPU and CPU, the time may vary. At the same time, we also provide the first-stage pre-training model. If you need to speed up the training, you can download it directly and use it.

AXNing commented 2 months ago

Thanks for your work. Can you please tell me if the first stage of training is also done on the whole diffusion model? I noticed during training that I am only using about 4G of memory with my batch-size setting of 8. Is there a problem with this? Thank you for your answer!

jinnh commented 1 month ago

@biedaxiaohua Thank you for your interest in our work. I am sorry for the late response. This is normal since we only use the patch size of 96x96.