Closed anhtienng closed 2 months ago
Thank you for your great work.
In your parallel scan, when Xa.size(2) == 2 or Xa.size(2) == 1, why you skip the up-sweep operation ? (line 64 in file)
Xa.size(2) == 2
Xa.size(2) == 1
Is it related to the fact this isn't a static tree but a representation of the evolution of our tensor in memory in your document ?
this isn't a static tree but a representation of the evolution of our tensor in memory
I found it by myself. So when Xa.size(2) == 2 or Xa.size(2) == 1, we don't need up-sweep to calculate the cumulative sums, they are already done during down-sweep
up-sweep
down-sweep
Thank you for your great work.
In your parallel scan, when
Xa.size(2) == 2
orXa.size(2) == 1
, why you skip the up-sweep operation ? (line 64 in file)Is it related to the fact
this isn't a static tree but a representation of the evolution of our tensor in memory
in your document ?