MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
MIT License
469
stars
55
forks
source link
fixing dimensionality error in argmax sampling #43
Transposing the vector to align with dimensions expected downstream.
When setting up temperature to 0, the error is:
File "magma/magma/sampling.py", line 107, in generate
out = torch.cat((out, next_token), dim=-1)
RuntimeError: Tensors must have same number of dimensions: got 2 and 1
Transposing the vector to align with dimensions expected downstream.
When setting up temperature to 0, the error is:
Example of the next token output:
while the expected output is