Open YovaKem opened 3 years ago
Can you tell me what hyperparameters were used for the beam search at inference time and anything concerning penalty for length and repetition? Thanks!
Hi, we used greedy decoding for inference.
Can you tell me what hyperparameters were used for the beam search at inference time and anything concerning penalty for length and repetition? Thanks!