hy5468 / TransLLM

Why Not Transform Chat Large Language Models to Non-English?
Apache License 2.0
1 stars 0 forks source link

TransLLM: Why Not Transform Chat Large Language Models to Non-English?

made-with-python Active Ask Me Anything !

TransLLM is implemented based on the Chinese-LLaMA-Alpaca-2 project.

Data

We provide the following data:

Traning

Model Extension

Use SentencePiece to learn the Thai vocabulary on mc4-TH. Merege the vocabulary as described in Chinese-LLaMA-Alpaca-2.

Target Language Pre-Training

Translation Pre-Training

Transfer Fine-Tuning

Evluation

We provide the following scripts for evaluation

Notice

We have modified some files in ./Chinese-LLaMA-Alpaca-2/scripts/training.

License

The code and data is released under Apache License 2.0.

Citation

Please cite as:

@misc{geng2024TransLLM,
      title={Why Not Transform Chat Large Language Models to Non-English?}, 
      author={Xiang Geng and Ming Zhu and Jiahuan Li and Zhejian Lai and Wei Zou and Shuaijie She and Jiaxin Guo and Xiaofeng Zhao and Yinglu Li and Yuang Li and Chang Su and Yanqing Zhao and Min Zhang and Hao Yang and Xinglin Lyu and Jiajun Chen and Shujian Huang},
      year={2024},
      eprint={2405.13923},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}