Export the weights in normal pytorch stat_dict

bytedance / lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

Other

3.19k stars 328 forks source link

Export the weights in normal pytorch stat_dict #180

Open butterluo opened 3 years ago

butterluo commented 3 years ago

Can lightseq export the weights in normal pytorch stat_dict after training using lightseq? It can be used for hugging face transformers or other pytorch lib to load and later use.

godweiyang commented 3 years ago

If you train without LightSeq and fine-tune using LightSeq, use pre-trained parameters to initialize the LightSeq layers.
If you train with LightSeq and fine-tune using LightSeq, directly save the state_dict and load for futher fine-tuning or inference without any modification.
If you train with LightSeq and want to export for inference using LightSeq inference library, refer to examples/inference/python/export/ls_fs_transformer_export.py. We provide an example of training using Fairseq+LightSeq, and exporting to pb format, then use LightSeq inference library to predict.

butterluo commented 3 years ago

If you train without LightSeq and fine-tune using LightSeq, use pre-trained parameters to initialize the LightSeq layers.

If you train with LightSeq and fine-tune using LightSeq, directly save the state_dict and load for futher fine-tuning or inference without any modification.

If you train with LightSeq and want to export for inference using LightSeq inference library, refer to examples/inference/python/export/ls_fs_transformer_export.py. We provide an example of training using Fairseq+LightSeq, and exporting to pb format, then use LightSeq inference library to predict.

So LightSeq doesn't support 'Traing with LightSeq and Fine-tune or Inference with pytorch or huggingface' ? Is there any plan to support it ?

Taka152 commented 3 years ago

It can be done by changing LightSeq layer back to pytorch or huggingface layer, just like the way you change pytorch layer to lightseq layer.

daeing commented 2 years ago

If you train without LightSeq and fine-tune using LightSeq, use pre-trained parameters to initialize the LightSeq layers.

If you train with LightSeq and fine-tune using LightSeq, directly save the state_dict and load for futher fine-tuning or inference without any modification.

If you train with LightSeq and want to export for inference using LightSeq inference library, refer to examples/inference/python/export/ls_fs_transformer_export.py. We provide an example of training using Fairseq+LightSeq, and exporting to pb format, then use LightSeq inference library to predict.

for reply 1, how to load pre-trained parameters to initialize the lightseq? Is there any samples? Thanks.

Taka152 commented 2 years ago

@daeing Please You can check the huggingface Bert example.

daeing commented 2 years ago

@daeing Please You can check the huggingface Bert example.

sorry, I have pretrained pytorch transformer-decoder layer. I want to use lightseq to accelerate inference speed. But I don't know how to load pretrained parameters to lightseq decoder layer. You mean I show use hugging face to create transformer decoder layer, and then load pretrained parameters, and then trans hugging face to lightseq?

Taka152 commented 2 years ago

You can check the inference example and modify the export script for your pretrained model.

daeing commented 2 years ago

You can check the inference example and modify the export script for your pretrained model.

ok, many thanks