Apply ctranslate2 to KNN-MT

OpenNMT / CTranslate2

Fast inference engine for Transformer models

MIT License

3.4k stars 302 forks source link

Hi! I want to apply ctranslate2 to KNN-MT (There are some pytorch implementations, knn-box, and sockeye for example). Is there a corresponding interface to get the output hidden state of the model in order to do vector retrieval? In addition, since KNN-MT needs to do vector retrieval for each decoding step, it needs to be decoded word by word, while currently ctranslate2 only provides an interface to decode the whole sentence at once. Is it possible to provide an interface to reuse the encoder output at each decoding step to reduce redundant calculations?

OpenNMT / CTranslate2

Apply ctranslate2 to KNN-MT #1100