Open sergisiso opened 3 years ago
The OpenCL implementation significantly slower in the Xilinx FPGA than a single CPU, the main reason seems to be a serialized interface to access memory through a single memory bank.
The OpenCL implementation significantly slower in the Xilinx FPGA than a single CPU, the main reason seems to be a serialized interface to access memory through a single memory bank.