Closed ik-ids closed 1 year ago
Hi @ik-ids it is expected behavior to use 100% of the CPU provided to DeepSparse. You can control this by setting the num_cores
parameters to restrict how many CPU cores the engine will use, and for the entire process using the numactl
command i.e. numactl -C0-1 python script.py
to run on just the first two cores.
To answer your other questions:
benchmark_result
dictionary so it isn't documented - I'll make a task to do this. median
is the median latency, mean
is the average latency, std
is the standard deviation between latencies, and the percentages are the percentile latencies such that 50.0%
== median
and 99.9%
is the longest latency in 99.9% of all inferencesHi @mgoin, thanks for the detailed explanation. Much appreciated.
Describe the bug
DeepSparse uses 100% of the CPU. Is this expected behaviour?
How safe is this from the infrastructure point of view? Can we cap it to, say, 90%?
We got the below results.
Expected behavior
Environment Linode The test machine is 2 CPU Cores, 4 GB RAM, Ubuntu 20.04 LTS, AMD EPYC 7713 64-Core Processor.
To Reproduce Exact steps to reproduce the behavior:
Running benchmark:
Errors
Additional context P.S: search in the documentation would definitely save a lot of time.