Closed haruhi55 closed 2 days ago
Add a template parameter to represent different Tile types?
A possible solution could be to have GlobalTile
and SharedTile
inherit from a common base class. Currently, since GlobalTile
and SharedTile
exhibit no differences in behavior, the computation results are correct.
I will carefully consider a suitable solution in next modifications.
Since we do not differentiate between
GlobalTile
andSharedTile
, aTileIterator
should be able to work with both types. However, the current implementation is tightly coupled withSharedTile
, which is a bug as shown belowhttps://github.com/TiledTensor/TiledCUDA/blob/cb7a3361f70fb1edce2c9f858705629bc8a0f305/include/types/tile_iterator.hpp#L73