Open leizhao1234 opened 3 months ago
Hi @leizhao1234 👋
Could you describe a way to reproduce the error? E.g.
CUDA_VISIBLE_DEVICES="3" text-generation-launcher --model-id ./cogvlm_exp3_03_2001/ --num-shard 1 --port 8083 --max-concurrent-requests 409600 --max-input-length 8190 --max-total-tokens 8192 --max-batch-prefill-tokens 8192 --trust-remote-code --max-waiting-tokens 2 --waiting-served-ratio 0.2 --block-size 32 --cuda-memory-fraction 0.97
and here is another problem:
Thank you! And you are you using the docker container or have you build this from soure?
I build from soure, and add the support for cogvlm.
I have encountered the problem mentioned in the title. Could someone help me understand what is going on and how to resolve it? Any assistance would be greatly appreciated.