About pretrain code - Githubissues

AILab-CVC / SEED-X

Multimodal Models in Real World

Other

392 stars 16 forks source link

Closed HalvesChen closed 5 months ago

HalvesChen commented 5 months ago

Hi, thanks for your greate work. I would like to know if you will release pre training code？

geyuying commented 5 months ago

Hi, the pretraining code is the same as the instruction tuning execpt for:

Change the pretrained model of "llm_seed_x.yaml" to llama2 pre-trained model;
Change "target: src.models.mllm.seed_x.ContinuousLVLM.from_pretrained" in "agent_seed_x.yaml" to "target: src.models.mllm.seed_x.ContinuousLVLM" and remove "pretrained_model_path"
Remove the instruction prompt in the data config for pre-training.

geyuying commented 5 months ago

The only differences between pre-training and instruction tuning are the pre-trained model and the data loaders.

HalvesChen commented 5 months ago

Thanks~