Closed mmcdermott closed 1 month ago
If the user sets a certain max seq length, no more than that many tokens should be passed to the GPU.
If the user sets a certain max seq length, no more than that many tokens should be passed to the GPU.