AnotherSamWilson / miceforest

Multiple Imputation with LightGBM in Python
MIT License
340 stars 31 forks source link

GPU does not improve performance #78

Closed xydisla closed 1 year ago

xydisla commented 1 year ago

I was able to get lightgbm working with GPU, but it doesn't seem to substantially improve performance (~8% improvement for 64 CPU + GPU vs 64 CPU alone). I've also been monitoring CPU and GPU usage, and it seems like the GPU is only active for brief periods of time during training, and most of the training/imputing time is spent with the CPU being used.

Interestingly, the GPU takes substantially longer to tune parameters than CPU.

AnotherSamWilson commented 1 year ago

I have noticed this phenomenon on small data. GPU can take considerably longer if you do not have many millions of rows in your dataset, and even then it is dependent on datatypes, architecture, etc etc. I'm going to close this, because it does not seem to be miceforest specific.