-
Wrong URL, the correct should point to the dataset mentioned here:
`https://cf.10xgenomics.com/samples/xenium/2.0.0/Xenium_V1_humanLung_Cancer_FFPE/Xenium_V1_humanLung_Cancer_FFPE_he_image.ome.tif`
-
HI!
I am trying to use the eval_coco_retrieval script. However, I am running into the following error:
File "evaluation/eval_coco_retrieval.py", line 33, in
from multimodal_bert.datasets…
-
### Feature Name
mPLUG-DocOwl 1.5
### Feature Description
Research about mPLUG-DocOwl 1.5
### Research Findings
## mPLUG-DocOwl 1.5
mPLUG-DocOwl 1.5 is a state-of-the-art multimodal large lang…
-
## ASR
- [ ] ASR2K: Speech Recognition for Around 2000 Languages without Audio https://arxiv.org/abs/2209.02842
- [x] Whisper: Whisper is a general-purpose speech recognition model. https://github…
-
In GitLab by @sharkovsky on Jan 4, 2023, 17:38
We define as "multimodal" any data that are not represented by a single tensor, but rather by (potentially nested) collections of tensors.
For example,…
-
ImageNet labels are way too coarse-grained. @themachinefan put ImageNet through a SAM pipeline to get a label for each patch.
The results are here: https://huggingface.co/datasets/Prisma-Multimodal…
-
[kgbench: A Collection of Datasets for Multimodal and Relational Learning on Heterogeneous Knowledge](https://openreview.net/forum?id=yeK_9wxRDbA) ([pdf](https://openreview.net/pdf?id=yeK_9wxRDbA), [G…
-
Hello, great work!!! Could you please provide a script to transform my personal dataset into the MERR data format?
-
NusaCatalogue: https://indonlp.github.io/nusa-catalogue/card.html?id_mm_pmd
| Dataset | id_mm_pmd |
|-------------|---|
| Description | Introduced in the FLAVA paper, Public Mu…
-
Context:
@snat-s has made significant progress in reviewing and updating the planned multimodal dataset (combination of many datasets) for the NEKO model.
there are numerous older issues that we wa…