alibaba / x-deeplearning

An industrial deep learning framework for high-dimension sparse data
Apache License 2.0
4.24k stars 1.03k forks source link

预测时出错:Checkpoint Not Found #319

Open 1980695671 opened 4 years ago

1980695671 commented 4 years ago

我在config.json里设置ckpt的value为checkpoints的目录/checkpoints,is_training设置为false, 为什么不对呢?

ustcdane commented 4 years ago

config.train.json checkpoint输出目录: "checkpoint": { "output_dir": "XX/checkpoint" },

查看checkpoint输出目录 XX/checkpoint/ 如有 ckpt-..............XXX 修改 data/tdm.json "saver_ckpt": "ckpt-..............XXX"

1980695671 commented 4 years ago

config.train.json checkpoint输出目录: "checkpoint": { "output_dir": "XX/checkpoint" },

查看checkpoint输出目录 XX/checkpoint/ 如有 ckpt-..............XXX 修改 data/tdm.json "saver_ckpt": "ckpt-..............XXX"

太感谢啦,我试试

1980695671 commented 4 years ago

config.train.json checkpoint输出目录: "checkpoint": { "output_dir": "XX/checkpoint" },

查看checkpoint输出目录 XX/checkpoint/ 如有 ckpt-..............XXX 修改 data/tdm.json "saver_ckpt": "ckpt-..............XXX"

Net Found For Name: xdl_global_step.这次又报这种错误了,我的config是: { "checkpoint": { "saver_ckpt": "/home/yuan.jin/ESMM/script/ckpt/ckpt-................1500" }, "files": ["../data/build/output_prefix.1"], "is_training": false } 命令是:python esmm.py --run_mode=local --task_index=0 --config=config.json --task_type test 请问哪里不对了呢,,

jeffzhengye commented 3 years ago

同样问题,没人维护啊。 阿里玩票的太多