Image describe pipe

In this project, we developed a lightweight web service app wherein different modules -integrated with existing ones- detect the available faces in the input image. Later they will evaluate the input image and compare it with the user's provided images stored in the database. The output includes the name, coordinations, sentiment of each extracted face, and besides a brief description of the scene's context.

How it works?

To work with this app, right off the bat, the user should create an account so, he can upload some images that he wants to be compared with the image he wishes to describe. Then, whenever he wishes to evaluate or describe an image, he can easily input the image's URL and wait for the result to appear on the screen. In order to manipulate the faces, I used the deepface framework. Besides, to predict and describe the context of the image I made use of CA⫶TR: Image Captioning with Transformers implemented in this repository.

You can also find a short clip illustrating the appearance of the app here.

How to run it?

Clone the repository:

git clone https://github.com/behzadshomali/Image-Describe-Pipe.git

Go to the cloned directory:

cd Image-Describe-Pipe/

Create a virtual environment and then activate it:

python3 -m venv <VIRTUALENV_NAME> ; source <VIRTUALENV_NAME>/bin/activate

Install the dependencies:

pip3 install -r requirements.txt

Run the container associated with the database
Run the flask server on your local machine:

python3 main.py

Enter the following address in your favorite browser:

127.0.0.1:5000/home

Future works

[ ] Empower users to directly upload images (not only by providing images' URL)
[ ] Make ImageDescribePipe a web service (put it on a server)
[ ] Make ImageDescribePipe an Android/iPhone app

behzadshomali / Image-Describe-Pipe

readme

Image describe pipe

How it works?

How to run it?

Future works