Add an exercise for muli gpu. Uses the yT*A*x example common in many other exercises.
The Begin/ part of the exercise can be run where the yTAx computation is done with 2 different sets of y,A,x views all on the same device. Then the Solution/ is to split the computation between 2 different devices, hopefully seeing a 2x speedup.
Add an exercise for muli gpu. Uses the
yT*A*x
example common in many other exercises.The
Begin/
part of the exercise can be run where theyTAx
computation is done with 2 different sets of y,A,x views all on the same device. Then theSolution/
is to split the computation between 2 different devices, hopefully seeing a 2x speedup.I ran on weaver (V100) and got