issues
search
AFDWang
/
Hetu-Galvatron
Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you have any interests, please visit/star/fork https://github.com/PKU-DAIR/Hetu-Galvatron
11
stars
4
forks
source link
Galvatron version update.
#3
Closed
Fizzmy
closed
3 months ago
Fizzmy
commented
3 months ago
Update search engine.
Update Megatron version (bcce6f).
Add llama_hf, modify llama_fa.
Support Megatron-SP, adapt to gpt_hf, llama_hf.
Support vocab parallel, adapt to gpt_hf.