-
I run the program in pycharm, one error listed below occurs, how to solve it?
ValueError: Unrecognized model in weights/icon_caption_florence. Should have a `model_type` key in its config.json, or co…
-
![image](https://user-images.githubusercontent.com/43195646/113852368-1322bc00-97cf-11eb-90be-c851d2ffa096.png)
-
像是缺失了文件
Unrecognized model in D:\LIUGEGE\ComfyUI\models\Joy_caption_alpha\text_model. Should have a `model_type` key in its config.json, or contain one of the following strings in its name: albert, a…
-
### Model description
Align Before Fuse (ALBEF) is a vision-language (VL) model that showed competitive results in numerous VL tasks such as image-text retrieval, visual question answering, visual …
-
Hello.
I have a question about the VLBertEmbeddings class.
In its forward function, a global image feature is added into linguistic tokens
The last token in vision sequence is used as the globa…
-
- https://arxiv.org/abs/2103.16110
- 2021
我々は、Kaleido-BERTと名付けられた新しい視覚言語(VL)事前学習モデルを発表します。
このモデルは、変換器からのクロスモダリティ表現をファッション化するための新しいカレイド戦略を導入しています。
最近のVLモデルのランダムなマスキング戦略とは対照的に、我々は、画像とテキストの意味的な関係に共…
e4exp updated
3 years ago
-
# ComfyUI Error Report
## Error Details
- **Node Type:** Joy_caption_two
- **Exception Type:** ValueError
- **Exception Message:** Unrecognized model in E:\comfyui-auto\models\Joy_caption_two\te…
-
```
$ python captions_generator.py --save_path synthetic_captions --generation_idx 0 --concept_bank_size -1 --me…
-
From my understanding, we get visuo-linguistic embeddings using VilBert (and LXMERT and VL-Bert for that matter too). Is it possible to simply use these as a layer/feature extractor backbone for visua…
-
### Describe the bug
I installed text generation webui and downloaded the model(TheBloke_Yarn-Mistral-7B-128k-AWQ) and I can't run it. I chose Transofmer as Model loader. I tried installing autoawq b…