Questions about Beam search algorithim

AkariAsai / self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

MIT License

1.84k stars 171 forks source link

First of all, if you want to understand the logic of beam search in the selfrag algorithm, then you need to first understand the beam search algorithm based on tokens. The difference between the beam search algorithm used in the long form is that sentence is used as the basic unit. For your question, I have to admit that it is still very difficult to understand the longform beam search in the selfrag algorithm, I think the best way is to compare the code and the paper at the same time, and use debug to understand the reasoning process of the beam search step by step, it is really difficult to explain accurately by language and can be easily understood. There are some bugs in the official selfrag code longform reasoning that can confuse beginners. I fixed some bugs in the selfrag code in raglab, and rewrote this part of the algorithm according to the algorithm proposed in the paper, and implemented the real sentence level beam search, selfrag longform beam search Hopefully, the above answers can help you ~

AkariAsai / self-rag

Questions about Beam search algorithim #90