-
### 🐛 Bug
Adding `thunder.jit` to `ResBlock` in the UNet stage of NeMo SD is raising an error. From looking at the ResBlock call in NeMo code, the class is called correctly with right arguments. In…
-
NEON has a treasure trove of different data types that are ripe for being ingested into a multimodal model. Are those data AI ready? If not, how can NEON serve up data to the community in an AI/ML rea…
-
https://arxiv.org/abs/2212.14453
-
I'm trying to run Multimodal RAG for processing videos using OpenAI GPT4V and LanceDB vectorstore
https://github.com/run-llama/llama_index/blob/main/docs/docs/examples/multi_modal/multi_modal_video…
-
Hello!
We are writing a master's thesis and our supervisor has provided us with a dataset where there are texts of negotiators (winners and losers), their neurometry (data from an encephalogram) and …
-
### Contact Details
_No response_
### Dataset description
A Long-term Gap-free High-resolution Air Pollutants concentration dataset (abbreviated as LGHAP) is of great significance for environmental…
-
Hi Alex,
I really like your tutorials and used them as a good example for starting own projects ;) but I think
there is a major error in the preprocessing, performed by the `split_into_XY` - func…
-
got prompt
Loading model from /home/sam/ComfyUI/models/Molmo/molmo-7B-D-bnb-4bit
Unused kwargs: ['_load_in_4bit', '_load_in_8bit', 'quant_method']. These kwargs are not used in .
We will use 90% of…
-
Hi,
I am noticing discrepancy in model performance between runs on latest version of MOSI from the CMU multimodal SDK and the numbers reported in the paper. Upon digging further, turns out the dime…
-
### 🐛 Bug
NeMo's Stable Diffusion uses [CLIPTokenizer](https://github.com/NVIDIA/NeMo/blob/v1.23.0/nemo/collections/multimodal/modules/stable_diffusion/encoders/modules.py#L264) from HuggingFace. A…