Open GoogleCodeExporter opened 8 years ago
Not to my knowledge. In general, I don't know of any to implement
backsubstitution
efficiently with 1000s of threads on a GPU.
However, if find any references that explain how it can be done, I'd be happy
to look
at them.
Original comment by wnbell
on 26 Feb 2010 at 3:00
Original comment by wnbell
on 26 Mar 2010 at 3:28
The algorithm in Cusparse works reasonably well.
Original comment by paulmull...@gmail.com
on 4 Jan 2012 at 11:44
There is version of the triangular solve method by Li and Saad in one of the
clones. In my opinion it isn't very robust without additional functionality to
perform reordering to reduce the number of sequential stages.
Original comment by sdalton1@gmail.com
on 28 Feb 2012 at 2:13
Original issue reported on code.google.com by
janick.m...@gmail.com
on 19 Feb 2010 at 3:47