Open Pisces032 opened 3 months ago
Thanks for your interest in our project, we've updated the guidance for preparing Colossal AI training data. Please refer to Prepare the data.
Additionally, we recommend using LLaMA Factory to train the llm4decompile model, as it is more user-friendly. For more details, please visit LLaMA-Factory.
Thank you!
I'm trying to use PEFT to improve the model. I wonder how
AnghaBench_compile.jsonl
is used to train. i noticeddeclare -a dataset=( "path_to_llm4decompile_data/arrow/part-00000" )
inrun_llm4decompile_train.sh
, but i can't make out the training process. Maybecolossalai
format hides some details about the model or the training process? Thank you so much!