InternLM / InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
https://internevo.readthedocs.io/zh-cn/latest/?badge=latest
Apache License 2.0
285 stars 47 forks source link

feat(dataloader): Implement megatron dataloader and mocked dataloader #323

Closed zigzagcai closed 2 weeks ago

zigzagcai commented 2 weeks ago

This PR's main functionality is okay and runnable, but still need some refinement.

Motivation

Modification

internlm/data/megatron/* internlm/data/mocked/*

BC-breaking (Optional)

None

Use cases (Optional)

None

Checklist

Before PR:

After PR:

sunpengsdu commented 2 weeks ago

LGTM