DAMO-NLP-SG / CLEX

[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
MIT License
73 stars 11 forks source link

Parameters for training #8

Open YL-9 opened 6 months ago

YL-9 commented 6 months ago

Could you please tell me what the parameters for training each model in train_lm.sh are? Thank you!

guanzhchen commented 1 week ago

Hi please kindly refer to our paper in Sec A.2. Sorry for the late reply since there is no reminder from GitHub for me.