Bug in generation when generating with Encodec

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

MIT License

2.32k stars 249 forks source link

Bug in generation when generating with Encodec #236

Closed FrancescoVV closed 9 months ago

FrancescoVV commented 9 months ago

Encodec doesn't support "-1" that is used to mask the tokens after EOS.

In particular, the coarse_token_ids here contain some trailing -1 and thus the line coarse_and_fine_ids = torch.cat((coarse_token_ids, sampled_fine_token_ids), dim = -1)

Will still have some "-1s" that will not be recognised by Encodec