Open finkga opened 2 years ago
Hi Glenn,
I'm glad you find my work useful!
I did not implement multi-gpu support in this code, as I just have a single one myself and didn't have time to make the code exhaustive. I will probably not have time to implement it in the near future neither unfortunately.
You can fork my work and start from here with Lightning's doc help, and eventually suggest a modification of my code.
Cheers
Antoine
Thank you Antoine! Lightning makes multi gpu support much easier, but I never find it to be like Will Falcon says, “just set gpus > 1 and it works.” I wish. If I figure out how to do it in your code base, I’ll be happy to contribute. I think I’ll have to redo the sampler to make it work.
--Glenn
From: Antoine101 @.> Date: Friday, June 24, 2022 at 1:30 AM To: Antoine101/CIFAR-100 @.> Cc: Fink, Glenn A @.>, Author @.> Subject: Re: [Antoine101/CIFAR-100] Sampling not working with multiple gpus (Issue #1) Check twice before you click! This email originated from outside PNNL.
Hi Glenn,
I'm glad you find my work useful!
I did not implement multi-gpu support in this code, as I just have a single one myself and didn't have time to make the code exhaustive. I will probably not have time to implement it in the near future neither unfortunately.
You can fork my work and start from here with Lightning's doc help, and eventually suggest a modification of my code.
Cheers
Antoine
— Reply to this email directly, view it on GitHubhttps://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FAntoine101%2FCIFAR-100%2Fissues%2F1%23issuecomment-1165335646&data=05%7C01%7CGlenn.Fink%40pnnl.gov%7C4fb696f89b784312129008da55bbcedf%7Cd6faa5f90ae240338c0130048a38deeb%7C0%7C0%7C637916562391763314%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=PM4hDkz4nnrwnWWvmKV3r5%2BPcUK%2BGlim5ckS3PgcnIw%3D&reserved=0, or unsubscribehttps://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAAJYYG7GYWTTFZREEHJLJEDVQVW2TANCNFSM5ZVL7BIQ&data=05%7C01%7CGlenn.Fink%40pnnl.gov%7C4fb696f89b784312129008da55bbcedf%7Cd6faa5f90ae240338c0130048a38deeb%7C0%7C0%7C637916562391763314%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=XwlYpwVuI79%2FV73D5vRoPvq9jlZvAIhTllbg%2FQBnPQM%3D&reserved=0. You are receiving this because you authored the thread.Message ID: @.***>
Thank you for providing such a well-written and understandable example. I did have problems running on multiple gpus (which sometimes seems like voodoo anyway). Here's my command line and status message including traceback:
Using a single GPU trains just fine. Thank you again.
--Glenn