-
Hi!
I’ve started developing the Multimodal DataLoader. After taking a (deep) look at this whole multimodal universe, I would like to discuss a couple of things before continuing. I’m using the [to…
-
# URL
- https://arxiv.org/abs/2411.02571
# Authors
- Sheng-Chieh Lin
- Chankyu Lee
- Mohammad Shoeybi
- Jimmy Lin
- Bryan Catanzaro
- Wei Ping
# Abstract
- State-of-the-art retrieval mod…
-
- [x] Push Branch with starter code
- [x] #54
- [ ] Add dataloading support in the pytorch_dataset class
- [ ] Add modeling support
-
### Search before asking
- [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…
-
Hi,
is it possible to also upload the training scripts and resulting network weights for the multimodal configuration? (Training on both Optical and Radar data with RandomSensorDrop)
-
Hello, I would like to reproduce your work as soon as possible. Can you provide me with the preprocessed Learn2Reg multimodal abdominal dataset.
-
I am currently planning to prepend an image to the query section, meaning the query will consist of an image along with a question about it. The system will then search the provided documents to find …
-
One of the strengths of `mlr3torch` is that it can easily handle multimodal data. This is because a neural network built out of `PipeOpTorch` operators can have multiple inputs (`PipeOpTorchIngress`).…
-
**Is your feature request related to a problem? Please describe.**
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
I have to manage content with images an…
-
### Priority
P1-Stopper
### OS type
Ubuntu
### Hardware type
Xeon-SPR
### Installation method
- [ ] Pull docker images from hub.docker.com
- [X] Build docker images from source
…