I am having difficulty understanding how the processor_ function interacts with the cothread and how threadIdx dimensions are used. Is there a thread pool created? Where is the processor function called? How is thread initialized at the start in the processor_()?
I see how the blocks are iterated in the Tile.hpp but how are threadIdx iterated?
I am having difficulty understanding how the processor_ function interacts with the cothread and how threadIdx dimensions are used. Is there a thread pool created? Where is the processor function called? How is thread initialized at the start in the processor_()?
I see how the blocks are iterated in the Tile.hpp but how are threadIdx iterated?