InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
310
stars
52
forks
source link
feat(usability): Refine model inject helper to support huggingface models #331
Closed
zigzagcai closed 2 months ago
Motivation
Improve usability:
pack_sample_into_one=True
mode forstreaming
dataloader. (Completed)model_inject_helper
to support more general modeling files (such like huggingface) without even one line of code change. (Completed)model_inject_helper
support pipeline parallel modeMockedDataset
. (Completed)Related repo:
https://github.com/InternLM/InternEvo-HFModels
Examples:
[x] Huggingface InternLM1-7B
[x] Huggingface InternLM2-7B
[x] Huggingface Yi-6B
[x] Huggingface LLaMA2-7B
[x] Huggingface Baichuan2-7B
[x] Huggingface Qwen2-7B
Modification
internlm/train/pipeline.py
BC-breaking (Optional)
None
Use cases (Optional)
None
Checklist
Before PR:
After PR: