microsoft XPretrain issues

microsoft / XPretrain

Multi-modality pre-training

Other

471 stars 37 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Details of zero-shot performance on SSv2

#44 bpiyush opened 1 month ago
0
about clipvip-vit-16 pretrained weights file

#43 musicman217 opened 4 months ago
0
Pretrained Checkpoints of CLIP-VIP

#42 hardlipay opened 4 months ago
0
About activitynet captions dataset in CLIP-ViP

#41 musicman217 opened 4 months ago
0
Is there no classification in the HD-VILA dataset?

#40 shuangtianjiuyou opened 5 months ago
1
Pretrained Checkpoints of LF-VILA

#39 pilibb0712 closed 5 months ago
1
Hi, how to understand the LF-hdvila-8m?

#38 sunwhw opened 7 months ago
1
Dockerfile and requirements for Clip-ViP

#37 ncTimTang opened 7 months ago
0
About LF-VILA code in PatchEmbed3D of video encoder

#36 musicman217 opened 8 months ago
0
Update Dockerfile for Issue #34

#35 MasoudKaviani opened 8 months ago
0
Error on starting horovod

#34 MasoudKaviani opened 8 months ago
0
Bump transformers from 4.30.0 to 4.36.0 in /LF-VILA/docker

#33 dependabot[bot] opened 11 months ago
0
Code for transcript text processing

#32 wwyy1234 opened 1 year ago
1
Model checkpoints

#31 michaeltian108 opened 1 year ago
0
Error in finetuning

#30 Ravindu-Yasas-Nagasinghe opened 1 year ago
1
video caption of HD-VILA-100M Dataset

#29 zyyyz closed 1 year ago
1
Code improvements

#28 tosemml opened 1 year ago
2
How long does CLIP-VIP pretraining takes?

#27 daizuozhuo opened 1 year ago
1
About the zero-shot performance

#26 LiuRicky closed 1 year ago
1
where are the train9k.jsonl and test1ka.jsonl files in MSRVTT retrieval?

#25 wangpichao closed 1 year ago
3
Asking for a simple script to get text and video features

#24 yotammarton opened 1 year ago
8
Bump transformers from 4.15.0 to 4.30.0 in /LF-VILA/docker

#23 dependabot[bot] closed 1 year ago
0
CLIP-VIP OFA caption generate

#22 tikboaHIT closed 1 year ago
1
MSR-VTT fine tune epochs number

#21 ffnc1020 closed 1 year ago
2
Captions for HD-ViLA-100M

#20 hanoonaR closed 1 year ago
1
Ways to open the .mdb caption files

#19 ffnc1020 closed 1 year ago
2
How to prepare pretrain data for LF-VILA?

#18 yliu-cs closed 1 year ago
2
Video compression/decoding methods of each dataset in CLIP-ViP

#17 fadzaka12 closed 1 year ago
1
Question regarding video proxy mechanism in CLIP-ViP

#16 fadzaka12 closed 1 year ago
4
About the zero-shot performance

#15 LiuRicky closed 1 year ago
2
About OFA-Caption generated captions on HD-VILA-100M

#14 LiuRicky closed 1 year ago
1
Reproducing the result of CLIP-ViP performance on MSRVTT

#13 justopit closed 1 year ago
4
How to use HD-VILA as multimodal TextEncoder?

#12 Celia0u0 closed 1 year ago
3
Where is the MSRVTT json file in CLIP-ViP?

#11 justopit closed 1 year ago
2
In CLIP-ViP, what is the results of OFA captions + HD-VILA-10M?

#10 SCZwangxiao closed 1 year ago
1
[CLS] token in CLIP-ViP

#9 goonbamm closed 1 year ago
2
Long Video Processing in LF-VILA

#8 vateye closed 1 year ago
3
Where can i get the asr text

#7 Satan012 closed 1 year ago
1
Code for transcript text processing

#6 RyanMarten closed 2 years ago
27
releasing code and pretrain

#5 maralzar closed 2 years ago
3
where to download the ASR transcriptions?

#4 TXH-mercury closed 2 years ago
1
Questions about HD-VILA

#3 HenryHZY closed 2 years ago
4
HD-VILA-100M dataset, where is the text corresponding to each video?

#2 Qiliqing closed 2 years ago
2
Update README.md

#1 TiankaiHang closed 2 years ago
0