issues
search
kohjingyu
/
fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
https://jykoh.com/fromage
Apache License 2.0
466
stars
34
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Unexpected key(s) in state_dict
#39
mlarrarte
closed
1 week ago
0
Replacing OPT LLM with other LLMs
#38
oferidan1
opened
1 month ago
0
How to download dataset and prepare tsv file?
#37
LiJichen0114
opened
1 month ago
0
retrieval only mode
#36
oferidan1
opened
2 months ago
1
Can I use the embedding for training
#35
LiJichen0114
closed
2 months ago
2
Evaluation code of VQAv2
#34
Yui010206
opened
6 months ago
0
torch.distributed.all_gather does not have grads
#33
MrZilinXiao
closed
8 months ago
2
=> no checkpoint found at '=/home/...
#32
eveningwalk
closed
9 months ago
0
Freezing the final linear layer when adding new token [RET]
#31
ptirupat
closed
7 months ago
1
I got Unexpected key(s) in state_dict error
#30
eveningwalk
closed
9 months ago
2
I got 'KeyError: 'best_score'' while trying to fine-tuning
#29
kxxseola
closed
9 months ago
7
can you give me a pre-trained weight file not pruning?
#28
seungwoo-Jang
closed
9 months ago
5
What is CC3M Embeddings
#27
ziqipang
closed
10 months ago
2
[RET] Embedding
#26
pUmpKin-Co
closed
10 months ago
3
Dealing with Corrupted Images in CC3M
#25
ziqipang
closed
10 months ago
4
The ability of in-context learning
#24
yongliang-wu
closed
11 months ago
0
Evaluation code for VQAv2
#23
ys-zong
opened
11 months ago
4
The reproduction of FROMAGe training
#22
Ziyang412
closed
1 year ago
6
The cross entropy loss in training stage
#21
Ziyang412
closed
1 year ago
2
The evaluation speed of IT2T on VisDial
#20
Ziyang412
closed
1 year ago
8
Evaluation for VisDial
#19
Ziyang412
closed
1 year ago
1
Weights of `lm_head` were not frozen during training?
#18
ys-zong
closed
1 year ago
3
How are the inputs arranged for in-context retrieval evaluation?
#17
ys-zong
closed
1 year ago
5
How does generate work?
#16
zhaoshitian
closed
1 year ago
2
Huggingface pipeline
#15
Marcusntnu
closed
1 year ago
2
Add log-likelihood score function to Fromage
#14
vishaal27
closed
1 year ago
1
Computing output likelihoods with the model
#13
vishaal27
closed
1 year ago
7
How to load dataset?
#12
zhaoshitian
closed
1 year ago
2
Hello, I wanna konw the purpose of create_image_of_text
#11
SZhanZ
closed
1 year ago
2
Choice of retrieval embedding dimension q = 256
#10
EIFY
closed
1 year ago
3
Failure in testing the demo
#9
Yingjia-Wan
closed
1 year ago
1
What is "fromage_vis4" model?
#8
ahnjaewoo
closed
1 year ago
3
Do you think bigscience/bloom can be a replacement of facebook/opt model ?
#7
svjack
closed
1 year ago
5
Question about the frozen language model
#6
sijeh
closed
1 year ago
3
Concatenating two captions in retrieval mode
#5
jeasinema
closed
1 year ago
6
Should the last_embedding_idx = caption - 2 ?
#4
sijeh
closed
1 year ago
2
Add public demo
#3
alvanli
closed
1 year ago
0
Asking for roadmap with more details?
#2
ZeinabTaghavi
closed
1 year ago
1
when the source codes can be released?
#1
runzeer
closed
1 year ago
3