Open DA21S321D opened 2 months ago
You are using the action-value network of Advanced-Colight trained in Hangzhou
to collect the policy refinement data for fine-tuning LightGPT for Jinan
. Please either train a new action-value network in Jinan
or collect the refinement data for Hangzhou
.
python run_policy_refinement_data_collection.py --llm_model dataCollect7BFT --llm_path /root/autodl-tmp/HuggingFace-Download-Accelerator/modelDownload/merged --dataset jinan --traffic_file anon_3_4_jinan_real.json