Refactor the implementation of loading a shared memory tile into registers.
xToXLoader/XToXStorer: These are the highest-level interfaces that expose only very logical concepts, such as Row-major or Column-major Tiles on a specific memory hierarchy.
xToXLoaderImpl/XToXStorerImpl: These implement tile transfer between memory hierarchies. They are specialized based on the specific instruction used and the source and/or destination layout.
xxStoreBase/xxLoadBase: These are thin wrappers for specific instructions to transfer a single BaseTile.
[x] Move the instruction-related implementation into copy_atom.hpp.
[x] Fix load.
[x] Fix store.
[x] Enhance implementations based on the updated RegTile definition.
Refactor the implementation of loading a shared memory tile into registers.
xToXLoader/XToXStorer
: These are the highest-level interfaces that expose only very logical concepts, such as Row-major or Column-major Tiles on a specific memory hierarchy.xToXLoaderImpl/XToXStorerImpl
: These implement tile transfer between memory hierarchies. They are specialized based on the specific instruction used and the source and/or destination layout.xxStoreBase/xxLoadBase
: These are thin wrappers for specific instructions to transfer a singleBaseTile
.copy_atom.hpp
.