fengggli / gpu-computing-materials

A simple deep learning framework that optimizes task scheduling and memory usage on different CPU/GPU architectures.
1 stars 0 forks source link

device convolution experiments #48

Closed fengggli closed 4 years ago

fengggli commented 5 years ago

This page explains the experiments we plan to finish.

Test file

Important

Sample input/output

(py36) lifen@sievert(:):~/Workspace/gpu-computing-materials/build_cuda$./tests/bench-conv-device  &> result.txt
(py36) lifen@sievert(:):~/Workspace/gpu-computing-materials/build_cuda$less result.txt |grep stat-cudnn                                                                                                                                       
stat-cudnn      1       4       32      1       3       17.425  3.513
stat-cudnn      1       4       32      4       3       5.626   5.027
stat-cudnn      1       4       32      16      3       13.015  9.791
stat-cudnn      4       4       32      1       3       5.683   5.260
stat-cudnn      4       4       32      4       3       13.332  10.296
stat-cudnn      4       4       32      16      3       42.737  29.357
stat-cudnn      16      4       32      1       3       13.562  11.304
stat-cudnn      16      4       32      4       3       42.980  30.230
stat-cudnn      16      4       32      16      3       160.499 106.413