Open goelayu opened 9 months ago
The mii inferencing benchmark script computes throughput as num_clients/latency. Shouldn't this be num_queries/latency?
num_queries/latency
Also why use P95 latency and not the total time it took to process all the requests, for the purposes of computing throughput?
The mii inferencing benchmark script computes throughput as num_clients/latency. Shouldn't this be
num_queries/latency
?Also why use P95 latency and not the total time it took to process all the requests, for the purposes of computing throughput?