PaddlePaddle / X2Paddle

Deep learning model converter for PaddlePaddle. (『飞桨』深度学习模型转换工具)

http://www.paddlepaddle.org/

Apache License 2.0

740 stars 165 forks source link

[PaddleV3] 新增大模型 `pytorch` 中的 `GPT2` #1085

Closed megemini closed 1 week ago

megemini commented 4 weeks ago

Create A Good Pull Request

新增大模型 pytorch 中的 GPT2 ～

需要在 dataset 的 PyTorch/dataset/ 中新建 gpt2 文件夹，其目录结构如下：

其中，checkpoints 目录放置模型文件：

config.json
pytorch_model.bin

模型文件需要从 https://hf-mirror.com/openai-community/gpt2/tree/main 中下载～

另外，输入输出文件：

pytorch_input.pkl
pytorch_output.pkl

output 是通过 pytorch 运行获得，input 是 Hello world! 转换后的结果～可从如下共享网盘中获取：https://pan.baidu.com/s/1YE4NF1F13EiqQGZ2xBzacg ～提取码请私信获取～

模型文件上传至服务器后，可重新运行 CI ～

以下为本地验证的结果：

> sh run_convert.sh

> sh run_benchmark.sh

> python deploy_infer.py

@luotao1

luotao1 commented 1 week ago

本地验证（看上去我的本地环境GPU没配对，是CPU环境）：

sh run_convert.sh 能过 sh run_benchmark.sh 和 python deploy_infer.py 出现以下环境问题

等我跑下CI