Code is bottlenecked on CPU workers

Shengjiewang-Jason / EfficientZeroV2

[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data

GNU General Public License v3.0

20 stars 3 forks source link

Code is bottlenecked on CPU workers #7

Open c0g opened 1 week ago

c0g commented 1 week ago

Firstly thanks for open sourcing your work.

I have a 12 core PC with a 4090, and it seems like the code is bottlenecked on BatchWorker - all my CPUs are at 100%, while my GPU is mostly un-utilized. If I'm reading the code right BatchWorkers are doing roll outs with the trained model - is that right? Is this expected, or is there a trick to optimizing? I'm running scripts/train.sh.

Shengjiewang-Jason commented 1 week ago

Yeah, you are right. You can modify the number of Batchworkers and Dataworkers. But it maybe cost more time for training. If you wanna reduce the cost of CPU, you can reduce the dataworkers.