-
Hi! I'm trying to pretrain VindLU using 5M data, can you provide the pretraining logs for reference? Thanks!
-
Thanks for sharing your code. Is it normal to get R1=30 with train_titles.py? After running the score fusion, the title matrix does not improve the video matrix.
-
Hi,
I was trying to download the pre-extracted features through the link https://bit.ly/2TX9rlZ. But accessing the link gives me the error "We're sorry, but qh53@cornell.edu can't be found in the …
-
-
Hi,
I followed the instructions in the TRAIN_AND_VALIDATE.md file, downloaded the dataset, and ran the evaluation script. However, I only achieved 46% accuracy on MSRVTT and 60% accuracy on MSVD, whi…
-
Can you share some recordings of your experiments like some graphs in neptune.ai or other logs tracking the performance/loss changes in training steps.
I would like to compare the effects of some c…
-
Thanks for the great work. I have some questions about the BLIP feature extractor interface.
1. In the example code, you wrote
```
# torch.Size([1, 12, 768]), use features_multimodal[:,0,:] for m…
-
**Describe the bug**
```json
{
"name": "Python: debug_cl",
"type": "debugpy",
"request": "launch",
"program": "swift/cli/main.py",
…
-
When using the 'extract_frames.py' to process MSRVTT, it works well. However, when I adopted it to process MSVD and Breakfast datasets, I met the following error:
> No such file or directory: 'myf…
-
https://github.com/salesforce/LAVIS/blob/59273f651b9bffb193d1b12a235e909e9f826dda/lavis/models/blip2_models/blip2_qformer.py#L242
Hello,
I was going through the code in BLIP-2's repository and I…