intel / xFasterTransformer

Apache License 2.0
322 stars 56 forks source link

[Model/Layer] New forward to support CB (CommonDecoder->DecoderBlock->DecoderLayer->Attention/MLP) #375

Closed pujiang2018 closed 2 months ago

pujiang2018 commented 2 months ago

@abenmao @Duyi-Wang pls help to review.