Confusion about the coarse transformer trainer

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

MIT License

2.33k stars 249 forks source link

Closed xtluo closed 1 year ago

xtluo commented 1 year ago

@lucidrains Here at line 1414 and 1415, why not:

batch = raw_wave_for_codec.shape[0]
num_timesteps = raw_wave_for_codec.shape[1]

lucidrains commented 1 year ago