Open inkinworld opened 3 months ago
@inkinworld Hi, have your tried with the latest main branch to see whether the issue still exist? Thanks June
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 15 days."
System Info
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
load Llama model, then send requests.
occurs some err.
CUDA Error: CUDA_ERROR_INVALID_VALUE
batch_manager
Expected behavior
work well
actual behavior
throw exception, can't process request
additional notes