multimodal-datasets Search Results

819 results
for multimodal-datasets

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Bioconductor/EuroBioc2019 #10

GROUP: MultiOmics single-cell containers

drisso updated 4 years ago
1
jadore801120/attention-is-all-you-need-pytorch #202

preprocess error

There are doubts in the preprocessing process, is the spatial language model in the code the four datasets of train, test, val, and split in the WMT16 multimodal translation task, and what code comman…

zhoup150344 updated 5 months ago
6
google-research/scenic #552

[MBT] Inference only

Are there any ways to bypass the data-preprocessing step for MBT ("Attention Bottlenecks for Multimodal Fusion") if I only wanna do inference without passing in the actual data from AS? I notice the m…

BDHU updated 1 year ago
6
TadasBaltrusaitis/OpenFace #807

Has OpenFace been used with faces of children?

Dear Tada I am wondering if OpenFace has been used to identify face landmarks of Children. Also, I am assuming that children face landmarks can be identified with OpenFace but I am not sure about t…

mxochicale updated 4 years ago
1
haotian-liu/LLaVA #1396

LLaVA-1.6 training dataset and training code.

### Describe the issue When will the llava-1.6 training dataset and training code be open-sourced? Hello, I'm glad to see that the performance of llava-1.6 has improved so significantly. I believe i…

shengyuwoo updated 2 months ago
2
neulab/ExplainaBoard #54

Support more QA tasks?

- [ ] NQ (https://ai.google.com/research/NaturalQuestions/dataset) - [ ] TriviaQA (https://nlp.cs.washington.edu/triviaqa/) datasets - [ ] HotpotQA - [ ] DROP

pfliu-nlp updated 2 years ago
1
showlab/Show-1 #15

Do you have a plan to release the evaluation code of SHOW-1 …

Hi, nice work! Do you have a plan to release the evaluation code of SHOW-1 in UCF-101 and MSRVTT? If you can open source the evaluation code, I believe that future work can be fairly compared to sh…

Lauren-wh updated 5 months ago
2
google-research/scenic #510

[MBT] Input Data Format for AudioSet

Hello, I'm working on reproduce the results in your paper "Attention Bottlenecks for Multimodal Fusion" and try to implement MBT for other audiovisual video classification tasks. However, the pr…

nku-zhichengzhang updated 6 months ago
2
marshuang80/gloria #16

Does the CheXpert dataset include reports now?

Hi, Thank you very much for releasing the source code of your work. I noticed that you use CheXpert for multimodal pre-training of your model. However, as far as I'm aware, the CheXpert dataset doe…

PabloMessina updated 1 year ago
2
Nerogar/OneTrainer #313

[Feat]: Caption/tags enhancement with multimodal LLMs

### Describe your use-case. There are multiple simple models used in this repository: Blip, Clip and WD-taggers. However, when it comes to detailed description, they are all dwarfed by modern multi…

kabachuha updated 2 weeks ago
3

上一页 1...4 5 6 7 8 9 10...82 下一页

819 results for multimodal-datasets

819 results
for multimodal-datasets