multimodality Search Results

314 results
for multimodality

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MIMBCD-UI/dataset-uta4-dicom #3

Global-Chem Partnership with Medical Imaging Multimodality B…

Hello My name is Suliman Sharif and I am author of a python package called Global-Chem - A Dictionary from common chemical names to their molecular definition. We have been keeping tracking of y…

Sulstice updated 1 year ago
3
paperswithlove/papers-we-read #13

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Do…

**0. Summary** - mPLUG 시리즈 중 하나로 Text-Rich Image 타겟 (Document, Webpage, Table, Chart, Natural Image) - Adaptive Crop (UReader) + Multimodality-Adaptive Module (Owl2) + H-Reducer (Proposed) - H-Redu…

hjeun updated 3 months ago
2
OpenGVLab/InternVideo #110

Installation Issues with Demo Notebook

While attempting to set up and run the demo notebook from the repository, I encountered multiple issues related to environment setup, package dependencies, and code configurations that significantly h…

raviy0807 updated 3 weeks ago
3
tangtaogo/alignmif #3

Is it multi-stage training?

Hi, I'm trying to run the training. ``` # single modality python main_alignmif.py -L --workspace kitti360-1908/lidar --enable_lidar --config configs/kitti360_1908.txt python main_alignmif.py …

chenyuntc updated 2 months ago
1
exitudio/MMM #7

Can you share the training log of t2m_trans

Can you share the training log of t2m_trans? I found it difficult to train t2m_trans.

CDLCHOI updated 2 months ago
6
lucidrains/perceiver-pytorch #61

Audio + Text data?

Can someone please guide me on how you can process both audio and .txt data through perceiver simultaneously for multimodality learning? An example code would be nice. Thanks

Sidz1812 updated 2 years ago
1
extreme-assistant/CVPR2024-Paper-Code-Interpretation #86

Please add our Oral paper on Knowledge Distillation of 3D ob…

Could you help to add the paper the list? Paper (Oral): Boosting 3D Object Detection by Simulating Multimodality on Point Clouds Paper Link: https://arxiv.org/abs/2206.14971 Thanks!

Vegeta2020 updated 2 years ago
1
cheshire-cat-ai/core #564

Support for model multimodal

**Is your feature request related to a problem? Please describe.** I'm frustrated when I can't use multimodal models like "gpt-4-vision-preview" in Cheshire-cat-ai to process and retrieve information…

Jhonnyr97 updated 1 month ago
10
iai-group/DialogueKit #199

Create new class `MultimodalUtterance`?

Should we create a new type of instance to handle multimodality (e.g., images, buttons)?

NoB0 updated 1 year ago
3
ggerganov/llama.cpp #6027

llava-cli: improve llava-cli and the API for using LLaVA

From: - https://github.com/ggerganov/llama.cpp/issues/4216#issuecomment-1991730224 1. cleaning up the clip/llava libs and improving the API 2. in the old implementation, there were many internal…

phymbert updated 2 weeks ago
4

上一页 1...1 2 3 4 5 6 7...32 下一页

314 results for multimodality

314 results
for multimodality