-
-
Should we create a new type of instance to handle multimodality (e.g., images, buttons)?
-
**0. Summary**
- mPLUG 시리즈 중 하나로 Text-Rich Image 타겟 (Document, Webpage, Table, Chart, Natural Image)
- Adaptive Crop (UReader) + Multimodality-Adaptive Module (Owl2) + H-Reducer (Proposed)
- H-Redu…
hjeun updated
6 months ago
-
AMLB is converting object dtype and string dtype to category dtype: https://github.com/openml/automlbenchmark/blob/master/amlb/datasets/file.py#L310-L312
This would bring trouble when the framework…
-
Thanks for your great work! I am trying to reproduce the performance of ReMoDiffuse as described in the paper, but my FID score is not satisfactory, and my MultiModality score is particularly high. Is…
-
Free-floating modelling
-
When different parameter dimensions need to be treated differently, e.g. with a different bandwidth scale, or only partial multimodality, or different discrete setups, it would be good to allow the tr…
-
Can you share the training log of t2m_trans?
I found it difficult to train t2m_trans.
-
**Is your feature request related to a problem? Please describe.**
I'm frustrated when I can't use multimodal models like "gpt-4-vision-preview" in Cheshire-cat-ai to process and retrieve information…
-
Requires a change in the argument of the LoadAllDicomFiles in line 56 of Deforminator_PIL.
If your dicom images are labeled "IM_" followed by a number, this line should be changed to:
[IM_unreg]=Lo…