when i am executing token_benchmark_ray.py we are getting below error
File "token_benchmark_ray.py", line 456, in
run_token_benchmark(
File "token_benchmark_ray.py", line 297, in run_token_benchmark
summary, individual_responses = get_token_throughput_latencies(
File "token_benchmark_ray.py", line 111, in get_token_throughput_latencies
request_metrics[common_metrics.INTER_TOKEN_LAT] /= num_output_tokens
TypeError: unsupported operand type(s) for /=: 'list' and 'int'
(SageMakerClient pid=15473) Warning Or Error: 'SageMakerRuntime' object has no attribute 'invoke_endpoint_with_response_stream'
(SageMakerClient pid=15473) None
when i am executing token_benchmark_ray.py we are getting below error File "token_benchmark_ray.py", line 456, in
run_token_benchmark(
File "token_benchmark_ray.py", line 297, in run_token_benchmark
summary, individual_responses = get_token_throughput_latencies(
File "token_benchmark_ray.py", line 111, in get_token_throughput_latencies
request_metrics[common_metrics.INTER_TOKEN_LAT] /= num_output_tokens
TypeError: unsupported operand type(s) for /=: 'list' and 'int'
(SageMakerClient pid=15473) Warning Or Error: 'SageMakerRuntime' object has no attribute 'invoke_endpoint_with_response_stream'
(SageMakerClient pid=15473) None