tensil-ai / tensil

Open source machine learning accelerators
https://www.tensil.ai
Other
356 stars 29 forks source link

Support passing data between layers via local memory #74

Closed petrohi closed 2 years ago

petrohi commented 2 years ago

This PR introduces support for selecting compiler strategies between current (default) and three new strategies using --strategy command line option.

Following are performance numbers for ResNet20/CIFAR on ZCU104, both emulated and on FPGA:

Strategy Emulator* cycles (K) FPGA latency (ms) Minimum required local memory (kV)
local-isolated 292 5.878 2**
local-vars 236 4.663 8
local-consts 273 4.32 26
local-vars-and-consts 217 3.407 26

(*) Emulator is set to estimate DRAM latency as 1 cycle per vector; (**) This is bounded by the largest root in the model (conv2d_16 in layer 21);

PR includes following smaller changes:

shortcut-integration[bot] commented 2 years ago

This pull request has been linked to Shortcut Story #474: Support passing data between layers via local memory.