Closed Zarzard closed 1 year ago
Hi @Zarzard , max_q
is a trick to prevent the number in exp()
explode. It improves numerical stability without affecting the final results. You can also find the same trick in PyTorch's logsumexp
function, see here
Got it, thanks!
Hi! I wonder what role does the term
max_q
play in the implementation of the dual function. This term seems not mentioned in the paper, did I miss anything?