-
# CVPR 2024 - Open-Vocabulary Video Anomaly Detection
* 论文:
![image](https://github.com/lartpang/blog/assets/26847524/39e42922-523a-4f5f-b350-23fa5164eeab)
这篇文章主要研究了开放词汇视频异常检测(openvocabulary …
-
Hi @yhcao6 when I run the second step of Open-Vocabulary Detection:`python tools/v3det_ovd_utils/split_base_novel.py datasets/V3Det/annotations/v3det_2023_v1_train.json`, I found this step to do was t…
-
Thanks for the great work!
In the paper, the authors use MViT to extract high-quality class-specific proposals using image-level labels based on its great generalization ability. Thus, a straightfo…
fushh updated
5 months ago
-
Florence-2 is a new VLM that is capable of very accurately finding specific objects within images such as "purple balloon," "red blimp," "yellow square" even in terrible conditions such as low light, …
-
Hello, I'm currently reading through papers and accompanying code. First of all, thank you for helping me conduct some great experiments!
However, after reviewing the citation you provided, I wou…
-
this repository gives me much help for learning, i appreciate it.
-
![image](https://github.com/YoojLee/paper_review/assets/52986798/4133f5cb-d108-472c-86a5-2db4f4983933)
## Summary
CLIP과 같은 open vocabulary image classification model (VLMs)으로부터 two stage detector에…
-
Thanks for sharing the wonderful work, the paper differentiate GLIP with GroundingDINO, FIBER, the former is classified into open vocabulary object detection, while the latter is named bi-functional m…
-
Hi! Guys. Please comment the missing paper in this issue. We will check and add them accordingly.
lxtGH updated
3 months ago
-
Hello, I downloaded the pre-trained weights of yoloworld for inference. The results are consistent with the results in the readme. md.However, how to reproduce the results in Table 2 in the paper:YOLO…