Open Nyquist0 opened 1 month ago
Please check your onnx version, the inference step needs onnxruntime Does the inference script work?
I am directly training. Let me check the inference script. And the onnx version is completely aligned with yours in requirements.txt
Hi @xumingw Inference works well. Is it possible the onnx version you provided is compatible with Ampere architecture, but not with Ada architecture..? Any suggestions?
Dear Sir or Madam,
I met the following error that keeps interrupting my training process. This happened after 1000 steps and is a
ONNXRuntimeError
error Could you help to check if there is anything wrong?Environment:
commands:
CUDA_VISIBLE_DEVICES=1 accelerate launch -m --config_file accelerate_config.yaml --machine_rank 0 --main_process_ip 0.0.0.0 --main_process_port 20055 --num_machines 1 --num_processes 1 scripts.train_stage1 --config ./configs/train/stage1.yaml
error:
Looking forward your reply. Thanks.