Closed TKELKAR123 closed 4 months ago
How did you get it to run, i am stuck in the model checkpoint loading (I am using a Macbook m1 pro), i already tried pip and i am now using conda.
RuntimeError: Error(s) in loading state_dict for PENGI: Unexpected key(s) in state_dict: "caption_encoder.base.embeddings.position_ids", "caption_decoder.gpt.transformer.h.0.attn.bias", "caption_decoder.gpt.transformer.h.0.attn.maskedbias", "caption (it keeps going..)
@AFMSB It looks similar to this issue: https://github.com/microsoft/Pengi/issues/11
Keep getting generic output:
This is my input:
My audio file works - is the audio file not the right format?