The neureka implementation here relies on TP_IN=32 input channels and is compatible with the highest bandwidth offered by the HCI which is only possible in the aligned access. Thus, force in the stimuli generation to store the weight in the aligned manner. __attribute__ ((aligned(4)))
The neureka implementation here relies on TP_IN=32 input channels and is compatible with the highest bandwidth offered by the HCI which is only possible in the aligned access. Thus, force in the stimuli generation to store the weight in the aligned manner.
__attribute__ ((aligned(4)))