DachengLi1 / LongChat

Official repository for LongChat and LongEval
Apache License 2.0
504 stars 29 forks source link

The purpose of pretrain script? #17

Closed fahadh4ilyas closed 1 year ago

fahadh4ilyas commented 1 year ago

Doesn't LongChat is a fine-tune from Llama Model but with a tweak script to rotation embedding layer? Or you have to do pretraining first before fine-tuning with sharegpt data? Because the script for pretraining exists and makes me assume we have to pretrained Llama Model first before fine-tuning.

DachengLi1 commented 1 year ago

@fahadh4ilyas Thanks for the question! The pre-training script was not used for LongChat. It is for people who wish to do some pre-training experiments.

fahadh4ilyas commented 1 year ago

Thank you for the clarification...