issues
search
kohjingyu
/
gill
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
https://jykoh.com/gill
Apache License 2.0
433
stars
38
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Potentially Misleading Notation in Similarity Calculation for Image-Text Retrieval Loss
#47
suhyeok-jang
opened
2 weeks ago
0
can not distributed training
#46
ALR-alr
opened
1 month ago
0
environment conflict
#45
StephenQSstarThomas
opened
2 months ago
1
about [img] token and train data
#44
ALR-alr
opened
3 months ago
0
i try to dowmload cc3m using tools recommand by readme.md, but the number of picture can be download only 10% . is it normal?
#43
zhenghuawang6
opened
4 months ago
0
FID Evaluation on CC3M and VIST
#42
shubhamagarwal92
opened
6 months ago
0
shape mismatch in the example "Multimodal Dialogue"
#41
czhhzc
closed
7 months ago
1
param.grad is None !
#40
txdtplus
opened
7 months ago
1
RuntimeError: CUDA error: no kernel image is available for execution on the device CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
#39
huangyuf
opened
8 months ago
1
Error size mismatch when load decision model
#38
haunt98
opened
8 months ago
2
Visdial相关问题
#37
geknow
opened
9 months ago
0
Inference shape is not 8
#36
taemin6697
opened
9 months ago
1
GILL Image Retrieval Code on VIST
#35
haochuan-li
closed
8 months ago
1
why don't you use universal representation in one task?
#34
hsjkdjj
opened
10 months ago
0
[solved]
#33
TongLiu-github
closed
10 months ago
0
shape mismatch in the example notebook
#32
yzeng58
closed
9 months ago
2
Update README.md
#31
ray-ruisun
closed
11 months ago
1
About error when running Precomputing Text Embeddings and Train
#30
ray-ruisun
closed
12 months ago
2
How could this affect the performance?
#29
MiladMt11
closed
10 months ago
10
Normalization of cc3m features
#28
forence
closed
10 months ago
1
About the running log
#27
XuRui314
closed
1 year ago
4
How to get cc3m_embeddings
#26
Control-derek
closed
1 year ago
1
Clarification on precomputing the visual embeddings
#25
MiladMt11
closed
1 year ago
1
Unrecognized 'num-gen-tokens' Argument in model_args.json During Model Checkpoint Pruning
#24
MiladMt11
closed
1 year ago
4
Great Work!!! Need clarification for calculating the R@1 for retrieval
#23
VIROBO-15
closed
1 year ago
1
Incorrect Loss Calculation in 'generation' Mode in validate.py
#22
MiladMt11
closed
1 year ago
1
training setting for reproduce the paper
#21
sungonce
closed
1 year ago
4
Issue with multiple image inputs. (Only last image input taken into consideration)
#20
Aafiya-H
closed
1 year ago
1
Queries regarding the Precomputed Text Embeddings
#19
VIROBO-15
closed
1 year ago
1
Shape mismatch in example notebook
#18
HireTheHero
closed
1 year ago
6
Multimodal generation in one pass
#17
avipartho
closed
1 year ago
5
instruction tuning on other datasets
#16
ChocoWu
closed
1 year ago
1
Question about PartiPrompts
#15
vishaal27
closed
1 year ago
4
Question about the generation pipeline
#14
avipartho
closed
1 year ago
1
Is there a code to prune the pre-trained model to decision model?
#13
shugerdou
closed
1 year ago
3
Error while loading the gill model
#12
Aafiya-H
closed
1 year ago
2
Question about the training code
#11
Epiphqny
closed
1 year ago
5
Problem in running the evaluation script
#10
VIROBO-15
closed
1 year ago
13
Problem in running the evaluation script
#9
VIROBO-15
closed
1 year ago
1
Generated image quality
#8
Epiphqny
closed
1 year ago
3
Query regarding the preprocessing of the data
#7
VIROBO-15
closed
1 year ago
1
Great Work!!!! Few Queries regarding the preprocessing of the data
#6
VIROBO-15
closed
1 year ago
2
A few questions about the training pipeline
#5
avipartho
closed
1 year ago
15
Custom SD pipeline with hard-coded left truncation of text prompts
#4
avipartho
closed
1 year ago
4
release the evaluation code
#3
ytian8
closed
1 year ago
3
Add diffusers and accelerate to requirements
#2
vishaal27
closed
1 year ago
1
Estimated time-line for code and weights?
#1
vishaal27
closed
1 year ago
2