hpcaitech / ColossalAI-Examples

Examples of training models with hybrid parallelism using ColossalAI
Apache License 2.0
334 stars 102 forks source link

The directory content of gpt_2 and gpt_3 are the same. #26

Closed dujiangsu closed 2 years ago

dujiangsu commented 2 years ago

As in the title, it is not essential to take two dirs for gpt-2 and -3 respectively. Besides, train_gpt.py is for transformers.GPT2LMHeadModel, not the Colossal version.

FrankLeeeee commented 2 years ago

Hi @Gy-Lu , can you take care of this issue as well?

FrankLeeeee commented 2 years ago

Fixed by #28