Hi, please refer to make_lat_lut_example.py. It is an example script to make latency lookup tables. Due to the fine-grained width search, we enumerate all possible width choices, which is time consuming especially for CPUs. You can also enumerate width choices every 8 steps and make interpolation for other choices as a rough approximation, referring to the comments in the script.
Note, before running the script, please set the number of threads for OpenMP and MKL to 1 and 1, respectively.
Hi, please refer to make_lat_lut_example.py. It is an example script to make latency lookup tables. Due to the fine-grained width search, we enumerate all possible width choices, which is time consuming especially for CPUs. You can also enumerate width choices every 8 steps and make interpolation for other choices as a rough approximation, referring to the comments in the script.
Note, before running the script, please set the number of threads for OpenMP and MKL to 1 and 1, respectively.