issues
search
microsoft
/
XPretrain
Multi-modality pre-training
Other
471
stars
37
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Details of zero-shot performance on SSv2
#44
bpiyush
opened
1 month ago
0
about clipvip-vit-16 pretrained weights file
#43
musicman217
opened
4 months ago
0
Pretrained Checkpoints of CLIP-VIP
#42
hardlipay
opened
4 months ago
0
About activitynet captions dataset in CLIP-ViP
#41
musicman217
opened
4 months ago
0
Is there no classification in the HD-VILA dataset?
#40
shuangtianjiuyou
opened
5 months ago
1
Pretrained Checkpoints of LF-VILA
#39
pilibb0712
closed
5 months ago
1
Hi, how to understand the LF-hdvila-8m?
#38
sunwhw
opened
7 months ago
1
Dockerfile and requirements for Clip-ViP
#37
ncTimTang
opened
7 months ago
0
About LF-VILA code in PatchEmbed3D of video encoder
#36
musicman217
opened
8 months ago
0
Update Dockerfile for Issue #34
#35
MasoudKaviani
opened
8 months ago
0
Error on starting horovod
#34
MasoudKaviani
opened
8 months ago
0
Bump transformers from 4.30.0 to 4.36.0 in /LF-VILA/docker
#33
dependabot[bot]
opened
11 months ago
0
Code for transcript text processing
#32
wwyy1234
opened
1 year ago
1
Model checkpoints
#31
michaeltian108
opened
1 year ago
0
Error in finetuning
#30
Ravindu-Yasas-Nagasinghe
opened
1 year ago
1
video caption of HD-VILA-100M Dataset
#29
zyyyz
closed
1 year ago
1
Code improvements
#28
tosemml
opened
1 year ago
2
How long does CLIP-VIP pretraining takes?
#27
daizuozhuo
opened
1 year ago
1
About the zero-shot performance
#26
LiuRicky
closed
1 year ago
1
where are the train9k.jsonl and test1ka.jsonl files in MSRVTT retrieval?
#25
wangpichao
closed
1 year ago
3
Asking for a simple script to get text and video features
#24
yotammarton
opened
1 year ago
8
Bump transformers from 4.15.0 to 4.30.0 in /LF-VILA/docker
#23
dependabot[bot]
closed
1 year ago
0
CLIP-VIP OFA caption generate
#22
tikboaHIT
closed
1 year ago
1
MSR-VTT fine tune epochs number
#21
ffnc1020
closed
1 year ago
2
Captions for HD-ViLA-100M
#20
hanoonaR
closed
1 year ago
1
Ways to open the .mdb caption files
#19
ffnc1020
closed
1 year ago
2
How to prepare pretrain data for LF-VILA?
#18
yliu-cs
closed
1 year ago
2
Video compression/decoding methods of each dataset in CLIP-ViP
#17
fadzaka12
closed
1 year ago
1
Question regarding video proxy mechanism in CLIP-ViP
#16
fadzaka12
closed
1 year ago
4
About the zero-shot performance
#15
LiuRicky
closed
1 year ago
2
About OFA-Caption generated captions on HD-VILA-100M
#14
LiuRicky
closed
1 year ago
1
Reproducing the result of CLIP-ViP performance on MSRVTT
#13
justopit
closed
1 year ago
4
How to use HD-VILA as multimodal TextEncoder?
#12
Celia0u0
closed
1 year ago
3
Where is the MSRVTT json file in CLIP-ViP?
#11
justopit
closed
1 year ago
2
In CLIP-ViP, what is the results of OFA captions + HD-VILA-10M?
#10
SCZwangxiao
closed
1 year ago
1
[CLS] token in CLIP-ViP
#9
goonbamm
closed
1 year ago
2
Long Video Processing in LF-VILA
#8
vateye
closed
1 year ago
3
Where can i get the asr text
#7
Satan012
closed
1 year ago
1
Code for transcript text processing
#6
RyanMarten
closed
2 years ago
27
releasing code and pretrain
#5
maralzar
closed
2 years ago
3
where to download the ASR transcriptions?
#4
TXH-mercury
closed
2 years ago
1
Questions about HD-VILA
#3
HenryHZY
closed
2 years ago
4
HD-VILA-100M dataset, where is the text corresponding to each video?
#2
Qiliqing
closed
2 years ago
2
Update README.md
#1
TiankaiHang
closed
2 years ago
0