Closed GoogleCodeExporter closed 8 years ago
Original comment by wnbell
on 11 Feb 2011 at 9:32
I haven't been able to figure this out, but I've found some very puzzling
further results, that I'm hoping might help you diagnose the problem.
1) First, I checked to make sure that the operator is giving the same results
on the device as the host.
2) I started commenting out lines in fast_scan.inl, in the scan_intervals
kernel. In particular I commented out the scan_block function and instead just
added
sdata[K][threadIdx.x] = sum;
__syncthreads();
then had each thread just store the updated value passed to it.
i.e. Instead of:
OutputType tmp = sdata[k][threadIdx.x];
sdata[k][threadIdx.x] = binary_op(sum, tmp);
Just save the sum:
sdata[k][threadIdx.x] = sum;
3) I ran that code and looked at the numbers after the first call to
scan_intervals, and they were correct for that point in the code. I then undid
all of the commenting recompiled and ran the code again on the same GPU. It is
now giving the correct results on that input. But if I run the same binary on
one of the other GPUs (Its a S1070) it still gives the same incorrect results
as before.
Any ideas on what I should be looking for here? Have you seen a problem like
this before?
Original comment by scott.ro...@gmail.com
on 14 Feb 2011 at 6:53
Hi Scott,
Thanks for the detailed report! Could you try compiling your code with the
latest development version of Thrust [1] instead of v1.3? I was able to
reproduce the error using Thrust v1.3, but the current development version
(which is slated to become v1.4) checks out OK.
If the development version works for you too I'd wager this was the fix [2].
[1] http://code.google.com/p/thrust/source/checkout
[2]
http://code.google.com/p/thrust/source/detail?spec=svn69b983ffe693ed9c1e808c563e
9936cea55be418&r=0835daf0904c211737e20a9731e19a9cf3c1fad2
Original comment by wnbell
on 14 Feb 2011 at 7:11
Good to go! Guess I'll stick with the development code from now on.
Thanks.
Original comment by scott.ro...@gmail.com
on 14 Feb 2011 at 7:39
Original comment by wnbell
on 14 Feb 2011 at 7:45
Original issue reported on code.google.com by
scott.ro...@gmail.com
on 11 Feb 2011 at 4:42Attachments: