kohjingyu fromage issues

kohjingyu / fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

https://jykoh.com/fromage

Apache License 2.0

466 stars 34 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Unexpected key(s) in state_dict

#39 mlarrarte closed 1 week ago
0
Replacing OPT LLM with other LLMs

#38 oferidan1 opened 1 month ago
0
How to download dataset and prepare tsv file?

#37 LiJichen0114 opened 1 month ago
0
retrieval only mode

#36 oferidan1 opened 2 months ago
1
Can I use the embedding for training

#35 LiJichen0114 closed 2 months ago
2
Evaluation code of VQAv2

#34 Yui010206 opened 6 months ago
0
torch.distributed.all_gather does not have grads

#33 MrZilinXiao closed 8 months ago
2
=> no checkpoint found at '=/home/...

#32 eveningwalk closed 9 months ago
0
Freezing the final linear layer when adding new token [RET]

#31 ptirupat closed 7 months ago
1
I got Unexpected key(s) in state_dict error

#30 eveningwalk closed 9 months ago
2
I got 'KeyError: 'best_score'' while trying to fine-tuning

#29 kxxseola closed 9 months ago
7
can you give me a pre-trained weight file not pruning?

#28 seungwoo-Jang closed 9 months ago
5
What is CC3M Embeddings

#27 ziqipang closed 10 months ago
2
[RET] Embedding

#26 pUmpKin-Co closed 10 months ago
3
Dealing with Corrupted Images in CC3M

#25 ziqipang closed 10 months ago
4
The ability of in-context learning

#24 yongliang-wu closed 11 months ago
0
Evaluation code for VQAv2

#23 ys-zong opened 11 months ago
4
The reproduction of FROMAGe training

#22 Ziyang412 closed 1 year ago
6
The cross entropy loss in training stage

#21 Ziyang412 closed 1 year ago
2
The evaluation speed of IT2T on VisDial

#20 Ziyang412 closed 1 year ago
8
Evaluation for VisDial

#19 Ziyang412 closed 1 year ago
1
Weights of `lm_head` were not frozen during training?

#18 ys-zong closed 1 year ago
3
How are the inputs arranged for in-context retrieval evaluation?

#17 ys-zong closed 1 year ago
5
How does generate work?

#16 zhaoshitian closed 1 year ago
2
Huggingface pipeline

#15 Marcusntnu closed 1 year ago
2
Add log-likelihood score function to Fromage

#14 vishaal27 closed 1 year ago
1
Computing output likelihoods with the model

#13 vishaal27 closed 1 year ago
7
How to load dataset?

#12 zhaoshitian closed 1 year ago
2
Hello, I wanna konw the purpose of create_image_of_text

#11 SZhanZ closed 1 year ago
2
Choice of retrieval embedding dimension q = 256

#10 EIFY closed 1 year ago
3
Failure in testing the demo

#9 Yingjia-Wan closed 1 year ago
1
What is "fromage_vis4" model?

#8 ahnjaewoo closed 1 year ago
3
Do you think bigscience/bloom can be a replacement of facebook/opt model ?

#7 svjack closed 1 year ago
5
Question about the frozen language model

#6 sijeh closed 1 year ago
3
Concatenating two captions in retrieval mode

#5 jeasinema closed 1 year ago
6
Should the last_embedding_idx = caption - 2 ?

#4 sijeh closed 1 year ago
2
Add public demo

#3 alvanli closed 1 year ago
0
Asking for roadmap with more details?

#2 ZeinabTaghavi closed 1 year ago
1
when the source codes can be released?

#1 runzeer closed 1 year ago
3