-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
- `llamafactory` version: 0.9.1.dev0
- Platform: Linux-6.5.0-35-generic-x86_64-with-glibc2.35
- P…
-
There is no way for now to express that a field should be a multidimensional array, for example a 4x4 matrix.
An example of dataset with such a need: MatrixCity (https://github.com/city-super/Matri…
-
Hi Marcel,
I was wondering - are you planning on combining multiple datasets or just using one multimodal dataset?
-
## Background
Aligning sets of coordinates and their pixel intensity data accurately between MSI and other modalities is often visually evaluated which is subjective but not uninformative. Other metr…
-
Hello! I read the paper and want to implement the code, and I want to add some KA features on vit for image classification. My ideas are as below:
Provide a comprehensive and robust pipeline and a da…
-
Due to the large size of the ImageNet dataset, I am using the MiniImageNet dataset. I modified the YAML file accordingly.
datasets:
_target_: flava.definitions.TrainingDatasetsInfo
selected:
…
-
On a recent study (https://dl.acm.org/doi/abs/10.1145/3597312) I've noticed that the difference between the top-N (N = 15 or more) algorithms in most datasets are insignificant. They only differ on a …
-
# Bidsme: flexible bidsifier for multimodal datasets
***By Nikita Beliy, Cyclotron Research Center, University of Liege, Liege, Belgium***
- Theme: Open Data 2.0
- Format: Software/process demo
…
-
### Summary
This issue is to track progress on implementing new pretrained weights from related literature into torchgeo:
- [ ] Clay: [GitHub](https://github.com/Clay-foundation/model), [weights](ht…
-
Model:
- ModelScope: https://www.modelscope.cn/models/iic/mPLUG-Owl3-7B-240728
- Huggingface: https://huggingface.co/mPLUG/mPLUG-Owl3-7B-240728
Usually, fine-tuning a multimodal large model invol…