Closed yjymickey closed 10 months ago
if i use 750 for Maximum prompt token count, the input size must less than 75 word.So how can I solve the problem.are there any other ways to use tensorrt with 500-750 word or 7500 token
if i use 750 for Maximum prompt token count, the input size must less than 75 word.So how can I solve the problem.are there any other ways to use tensorrt with 500-750 word or 7500 token