Open Mercury1105 opened 2 years ago
Is there any record about various model and different decode length inference time for reference?
Is there any record about various model and different decode length inference time for reference?