kingheadcat commented 3 years ago

Face recognition

Face verification vs. face recognition

Verification (Binary Classification)

Input image, name/ID
Output whether the input image is that of the claimed person
Recognition (Multi Class Classification)
Has a database of K persons
Get an input image
ID Output ID if the image is any of the K persons (or “not recognized”)

Multi Class Classification vs Multi Label Classification

ref: scikit-learn

One-shot learning: intro

하나의 example만으로 classifier를 학습하는 연구.

few-shot learning으로 추상되며, 보통 meta learning 방법론을 참조한다.

강의에서는 데이터들 간의 유사도 함수를 학습하는 형태의 방법론을 보였다.

One-shot learning: Meta Learning

Meta refers to a level above. 목표과제(labeled data)와는 간접적일 수 있는 목표함수(meta)를 학습하는 방법론 (한국웹에서는 보통 학습/해결 방법을 학습하는 방법 이라고 소개한다.) 일반적인 ML에서는 모델의 결과와, 데이터의 레이블에 기반한 loss(cost) function을 만들어 최소화 시키는 방법으로 학습하는데 $$ f_{loss}(y, \hat{y}) $$ Meta leaning 에서는 보통 문제를 해결하기 위한 다른 함수(e.g. 유사도)를 정의하고 해당 함수의 기능을 최대화하는 방식으로 학습한 후 해당 함수를 이용하며 문제를 해결한다.

One-shot learning: refs

refs

Triplets loss: intro

Triplets loss: code

def batch_all_triplet_loss(labels, embeddings, margin, squared=False):
    """Build the triplet loss over a batch of embeddings.

    We generate all the valid triplets and average the loss over the positive ones.

    Args:
        labels: labels of the batch, of size (batch_size,)
        embeddings: tensor of shape (batch_size, embed_dim)
        margin: margin for triplet loss
        squared: Boolean. If true, output is the pairwise squared euclidean distance matrix.
                 If false, output is the pairwise euclidean distance matrix.

    Returns:
        triplet_loss: scalar tensor containing the triplet loss
    """
    # Get the pairwise distance matrix
    pairwise_dist = _pairwise_distances(embeddings, squared=squared)

    anchor_positive_dist = tf.expand_dims(pairwise_dist, 2)
    anchor_negative_dist = tf.expand_dims(pairwise_dist, 1)

    # Compute a 3D tensor of size (batch_size, batch_size, batch_size)
    # triplet_loss[i, j, k] will contain the triplet loss of anchor=i, positive=j, negative=k
    # Uses broadcasting where the 1st argument has shape (batch_size, batch_size, 1)
    # and the 2nd (batch_size, 1, batch_size)
    triplet_loss = anchor_positive_dist - anchor_negative_dist + margin