-
## Goal
Release an academic paper for our effort of training the sound modality.
## Description
We are pushing out the paper for claim our result for a better position in the research community.
We a…
-
I have observed that the command you provided for launching the training with the Food101 dataset is:
```
--lorb m3ae --modulation Normal --dataset Food101
```
However, in the class structure …
-
I am trying to finetune llama3.2 Vision Instruct, and I am using the distributed recipe and example (lora) config as a starting point. Eventually, I am looking to use a custom dataset, but first, I am…
-
[The format of the issue]
Paper name/title:
Paper link:
Code link:
amusi updated
2 months ago
-
Hi,
I have implemented text prompt-controlled segmentation using selective search and CLIP. Can you suggest any additional techniques I can include? I am considering trying CLIP-GradCAM #4
h…
-
跑了几遍修了一些bug,但还是没跑通😢
-
Post a link for a "possibility" reading of your own on the topic of Sound and Image Learning [for week 7], accompanied by a 300-400 word reflection that: 1) briefly summarizes the article (e.g., as we…
lkcao updated
2 years ago
-
1.Deap learning 기반 COF 이미지 검사시스템
- 학위 논문으로 연락처나 email이 안 남아 있음
2.YOLO 개선을 통한 PCB 불량 검출 시스템 설계 및 구현
- lynnshin.tistory.com/47 // YOLO는 객체 위치와 클래스를 감지하는 모델이라 적합하지 않음
- 오히려 Resnet, DenseNet이 효율적인 이미…
-
Thank you for your contribution. Regarding the c-index value of the experiment, it does not reach the result in your paper.Use resnet 50, x20, 256×256patches in CLAM for feature extraction. Multimodal…
H-Q-N updated
2 months ago
-
Curious if there would be any benefit from using images from the internet. That being said, I'm not sure if there would be any issues w/ licensing.
The Poo detector using YoloX-tiny works, but I th…
njho updated
8 months ago