-
It doesn't mean we read them all because some of them are project specific.
-
Hi, thanks to share your great codes!
i inference my own videos, and success (use pre-trained models)
but i didn't know what a dataset are..
**1. I found dataset, can you check it is correct?**…
-
For Multi-Modal section:
[VinVL: Revisiting Visual Representations in Vision-Language Models](https://arxiv.org/abs/2101.00529)
[12-in-1: Multi-Task Vision and Language Representation Learning](ht…
-
Hi, thanks for the wonderful work.
I want to caption my own videos giving the video frames (without transcript), can I use the pretrained weight (`univl.pretrained.bin`) provided in the repository di…
-
Hi, could you kindly offer the faster rcnn features with google driver or baidu driver, because the link you offered is invalid. Thank you!
UserProjectMissing
Bucket is a requester pays bucket b…
-
Hello,
Thank you for sharing the code and responding to all issues. I have two questions:
1. For the uni-modal training, did you use the "linear embedder"?
2. How did you decide the dimension…
-
tensorflow/docs related.
## URL(s) with the issue:
Code example for `MultiHeadAttention` at:
https://www.tensorflow.org/tutorials/text/transformer?hl=en#multi-head_attention
and code examp…
-
Hi, I've been interested in image captioning and specifically automatic medical report generation, and I stumbled across your VisualGPT which seemed to take a promising approach, and I've been trying …
-
I am very interested in ur work. But I can't find the original paper. Could you please share a link to read your paper"Attentive Visual Semantic Specialized Network for Video Captioning", Thanks a lot…
-
## Description of bug / unexpected behavior
trying to set up logging in a config file fails
## Expected behavior
logs written to a file
## How to reproduce the issue
Code for reproduci…