Open dongdaoking opened 2 years ago
I need to clarify two questions:
https://github.com/bytedance/lightseq/tree/master/examples/inference/python You can try this way to inference after training
Hi, i check the training log.
As i describe above, why can't i train based on the master branch directly?
The master branch works fine: https://github.com/bytedance/lightseq/blob/master/examples/training/fairseq/ls_fairseq_wmt14en2de.sh Can you give us some detail about your repair to fix the "No module named 'fairseq_user_dir_13687.ls_fs_transformer_decoder_layer'"
Hi, thanks for your reply.
I run the command cp lightseq/training/cli/fs_modules/ls_fs_transformer_decoder_layer.py lightseq/training/ops/pytorch/
And point to this path
diff --git a/lightseq/training/cli/fs_modules/ls_transformer.py b/lightseq/training/cli/fs_modules/ls_transformer.py
index a6832ed..015f2fa 100644
--- a/lightseq/training/cli/fs_modules/ls_transformer.py
+++ b/lightseq/training/cli/fs_modules/ls_transformer.py
@@ -406,7 +406,7 @@ class LSTransformerDecoder(FairseqIncrementalDecoder):
TransformerDecoderLayer,
)
else:
- from .ls_fs_transformer_decoder_layer import (
+ from lightseq.training.ops.pytorch.ls_fs_transformer_decoder_layer import (
LSFSTransformerDecoderLayer as TransformerDecoderLayer,
)
Oh, i want to make sure our enviroment is the same. Can you provide a based docker image? Now my Enviroment
based docker images nvidia/cuda:11.6.0-cudnn8-devel-ubuntu20.04
pytorch 1.8.0(compile)
cmake 3.20 (compile)
protobuf and HDF5 follow the https://github.com/bytedance/lightseq/blob/master/docs/inference/build.md
git clone --recursive https://github.com/bytedance/lightseq.git
Then i can run the lightseq but meeting the error.
Hi, i want to try some new feature in lightseq and follow here compiling from source in master branch. But when i train and inference follow example, it doesn't work. When training, it seem something wrong in ls_transformer.py.
I try to fix the problem and it can train. But when i try to inference using the checkpoint, i get BLEU=0 and it seems lightseq doesn't work.
So here are questions: