saareliad / FTPipe

FTPipe and related pipeline model parallelism research.
41 stars 7 forks source link

CPU support? #4

Open pablogranolabar opened 2 years ago

pablogranolabar commented 2 years ago

Hi, very neat project.

Question: is it possible to use FTPipe with massively parallel CPU clusters? Say for example 256 VMs?

saareliad commented 2 years ago

Hi @pablogranolabar, tweaks will be needed, but it can be made possible.

Should consider the following parts:

Finally, there are some partitioning heuristics which would need to be changed according to your system, e.g., memory threshold in the master branch is hardcoded to 11GB for RTX2080ti: https://github.com/saareliad/FTPipe/blob/c3d853080e0bebde50deef78892baf0f3663daf1/autopipe/autopipe/model_partitioning/heuristics.py#L327