-
I rush into the same question like before, #71 , #78 .
I modify the config in configs/prompt_tuning_coco/, generate custom embedding file, to fine-tune my dataset which has 4 categories.
When infer…
-
- [ ] [MoAI/README.md at master · ByungKwanLee/MoAI](https://github.com/ByungKwanLee/MoAI/blob/master/README.md?plain=1)
# MoAI/README.md at master · ByungKwanLee/MoAI
## Description
![MoAI: Mixture…
-
## Problem statement
1. CLIP variants의 이미지와 텍스트 사이의 관계 학습은 텍스트의 각 토큰들과 이미지 패치의 관계에 대해 학습하기에는 학습과 추론 시 효율성이 떨어진다 -> finer-level alignment할 수 있는 방법을 찾아보자
2. 이미지 패치와 텍스트 토큰 간의 attention 이용하는 기존 연구의 약점 …
-
# Prerequisites
Please answer the following question for yourself before submitting an issue.
- [ x] I checked to make sure that this issue has not been filed already.
## 1. The entire URL of…
-
Hi!
Let's bring the documentation to all the Spanish-speaking community 🌐
Who would want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/transformers/blob/m…
-
hello, 这个工作很出色。我想知道这个工作的泛化性怎么样?如果只在BEAT数据集中训练,迁移到TED数据集中是否还能很好的work?
所学习的语义是否鲁棒。据我所知,大家还是脚本-手势做对比预训练来解耦语义,但是它们之间存在噪声。如果说在潜空间做语义的引导,还是受到词划分的影响。未来如何做语义的感知呢?
期待您的恢复
-
**Kibana version:** 8.5.3
**Elasticsearch version:** 8.5.3
**Browser version:** Chrome 108.0.5359.124
**Original install method (e.g. download page, yum, from source, etc.):** ECK
**Descri…
-
Thank you for your code, the effect is great, but I am training under windows11 platform, encountered problems, so I modified the training script, but the script occupied by the video memory will incr…
-
sorry for your time,
when i trained on the dataset Voc2012,
terminal line: python train.py
it comes out like these bellow:
Create YOLOv3 model with 9 anchors and 16 classes.
Load weights mode…
-
Thanks for your brilliant work!
I'm wondering if the model can detect all objects, such as a 'grounding dino'?