UnderStand Mask model to _get_action_masks in LogitsProcessor

In this linecode I saw that the code sets MaskLogitsProcessorCasualLM Init process uses deepcopy(self._policy_model).eval() and during the generate process, GenerationMixinWithRawScores.sampler executes pre-process distribution and Hook calls to the custom LogitsProcessor. I compared next_token_logits_raw from policy_model is indeed different from next_token_logits from mask_model in the same generate pipeline, what is the meaning of doing here? I really want to know?

allenai / RL4LMs

UnderStand Mask model to _get_action_masks in LogitsProcessor #31