Open madhurprash opened 1 week ago
To add time to first token (TTFT), time per output token (TPOT) and time to last token (TTLT). This would be done using the streaming API that is now supported for SageMaker and Bedrock, and maybe EC2 as well over time.
To add time to first token (TTFT), time per output token (TPOT) and time to last token (TTLT). This would be done using the streaming API that is now supported for SageMaker and Bedrock, and maybe EC2 as well over time.