🧑 Face Recognition with Super Resolution

This project implements a deep learning model performing face recognition by using super-resolution techniques in order to enhance images of faces acquired by a camera with a very low resolution or from a long distance. Our hypothesis is that increasing the images resolution, we can leverage more information and build a model which can perform better in the face recognition task w.r.t. a model which uses the low resolution images.

We describe and evaluate several methods to perform the upscaling and compare them with a base model without using super-resolution. Moreover, we propose and test two models, which are Generative Adversarial Networks (GANs), able to perform upscaling from images with a resolution lower than the one used by the most popular state-of-the-art models.

The proposed system performs the open set identification task and its architecture is as follows:

![architecture](https://user-images.githubusercontent.com/23276420/219739645-11dd3ca2-1e78-4cf2-a226-bc69055cd1cd.png)

To perform the face localization task, two different techniques are compared:

Haar Cascade classifier implemented following the Viola-Jones algorithm
Multi-Task Cascaded Convolutional Neural Network (MTCNN) model

The cropped faces are then upscaled from 32×32 to 128×128 using and comparing 6 different approaches:

OpenCV Resize using bilinear interpolation
Enhanced Deep Super-Resolution (EDSR)
Super-Resolution Generative Adversarial Network
Enhanced SRGAN (ESRGAN)
Our baseline GAN model
Our improved GAN model using Edge Detection

Finally, the upscaled faces are processed by our simple Face Recognition model based on the ResNet architecture, which has been implemented just to compare the results of a baseline model by using the different versions of the input images.

In order to train and test the models two different datasets were used:

CASIA-WebFace for training
Labeled Faces in the Wild (LFW) for testing

An interactive Colab Notebook is available in order to follow the whole dataset processing, model training and evaluation.

Moreover, a full Report.pdf and a Presentation.pdf are available in the repo.

Results

Face detection

For the face detection task, we took in consideration both the qualitative results obtained and the processing speed of the two methods. The results obtained by the Haar Cascade Classifier and MTCNN are comparable, while we measured that the time required in order to process and extract faces from our dataset is much less using the first one. For this reason, at the end we decided to opt for the faster method since we don’t lose too much in accuracy and we can save precious processing time.

Super Resolution

In the following image we present a comparison of the results we obtained using the different super resolution techniques:

![super-res-comparison](https://user-images.githubusercontent.com/23276420/219738969-8a8c2e6f-6045-42c9-9d8a-4eb15e728c10.png)