-
作者您好,如何使用你们预训练好的GroundingDINO模型训练自己的数据集?(在自己的数据集数量很少的情况下)
-
Dear author, should you share the CMU-MOSEI features you use? I am mainly concerned with the audio modality, as this is one of the few works that shows the result of a single modality of audio. I woul…
-
## Problem statement
1. CLIP variants의 이미지와 텍스트 사이의 관계 학습은 텍스트의 각 토큰들과 이미지 패치의 관계에 대해 학습하기에는 학습과 추론 시 효율성이 떨어진다 -> finer-level alignment할 수 있는 방법을 찾아보자
2. 이미지 패치와 텍스트 토큰 간의 attention 이용하는 기존 연구의 약점 …
-
Dear Author,
The ARCH dataset is divided into two subsets: the **books_set** and the **pubmed_set**.
I have noticed that the **pubmed_set** appears to overlap with BioMedCLip, which sources from…
-
https://arxiv.org/pdf/1702.05464.pdf
Adversarial learning methods are a promising approach to training robust deep networks, and can generate complex samples across diverse domains. They also can i…
leo-p updated
7 years ago
-
Dear author:
Hello,
I am running your publication 'Bilateral Cross Modality Graph Matching Attention'
There were some errors in the source code of the paper 'For Feature Fusion in Visual Question A…
Ysis0 updated
8 months ago
-
### Description of the problem
It is not a bug per se, but I feel it is a quite misleading default. Getting the data through "get_data" and "to_data_frame()" give different output values by default, …
-
- part of #209 and discussion https://github.com/AllenNeuralDynamics/dynamic-foraging-task/issues/209#issuecomment-2093691613
## Steps
- [x] Test the service locally on my PC
- [x] prepare te…
-
Hi,
we have a question regarding the usage of the R -package.
Our experiment is a random study design with 100 scans, and 9 readers, where each scan is assessed by a block of randomly selected 3 …
-
### Model/Dataset/Scheduler description
Hello Everyone,
I am training mvxnet model with the default config file available in the repository, which is for the point_fusion and kitti dataset. I want…