keroro824 / HashingDeepLearning

Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"
MIT License
1.07k stars 169 forks source link

Profile for performance considerations #33

Open robers97 opened 4 years ago

robers97 commented 4 years ago

I'm seeing scale-out behavior that needs some further study. In particular, larger thread numbers seem to increase the page usage, but I don't have the profiler completely setup. Has anyone come up with a configuration file that is good for profiling? IE something with a 10-15 min run time but still enables a data access pattern that is in the spirit of the paper..