Open NarcissusInMirror opened 5 years ago
The degradation problem suggests that the solvers might have difficulties in approximating identity mappings by multiple nonlinear layers. 衰退问题意味着,程序很难不擅长通过多个非线性层构成一个恒等映射
cited 456 times up to 19.6.20
The fundamental challenge in visual recognition is modeling the intra-class appearance and shape variation of objects.
In this work, we focus on weakly supervised learning where only image-level labels indicating the presence or absence of objects are required.
本篇文章中提到的弱监督学习就是只有一个image-level的标签,而不用bounding box给出具体的目标在哪里,其实毕设做的多标签分类,和很多图片分类问题都属于这种。文章中提到主要的困难是目标形态多样,被遮挡等等。其实之前做的都涉及到这些问题了,但没有考虑
The proposed method can also be seen as a variant of Multiple Instance Learning [21, 29, 54] if we refer to each image as a “bag” and treat each image window as a “sample”
We build on the fully supervised network architecture of [37] that consists of five convolutional and four fully connected layers and assumes as input a fixed-size image patch containing a single relatively tightly cropped object.
cited 1 time up to 19.8.14
cited 0 time up to 19.8.14
本论文使用的方法不使用anchor,是“一刀流”的
The proposed method is anchorfree and belongs to the one-stage category.
Face detection is the prerequisite to some face-related applications, such as face alignment and face recognition. 人脸检测是很多问题的前置步骤,先检测,再……
Specifically, we rethink the importance of receptive field (RF) and effective receptive field (ERF) in the background of face detection. Essentially, the RFs of neurons in a certain layer are distributed regularly in the input image and theses RFs are natural “anchors”. Combining RF “anchors” and appropriate RF strides, the proposed method can detect a large range of continuous face scales with 100% coverage in theory.
阅读论文笔记