abhisheks008 / DL-Simplified

Deep Learning Simplified is an Open-source repository, containing beginner to advance level deep learning projects for the contributors, who are willing to start their journey in Deep Learning. Devfolio URL, https://devfolio.co/projects/deep-learning-simplified-f013
https://quine.sh/repo/abhisheks008-DL-Simplified-499023976
MIT License
342 stars 294 forks source link

De-photobombing Project using DL #518

Closed abhisheks008 closed 3 months ago

abhisheks008 commented 3 months ago

Deep Learning Simplified Repository (Proposing new issue)

:red_circle: Project Title : De-photobombing Project using DL
:red_circle: Aim : The aim is to create a DL project where we can de-photobomb the images which are already photobombed.
:red_circle: Dataset : https://www.kaggle.com/datasets/vatsapatel09/image-de-photobombing-benchmark-dpd-300-dataset
:red_circle: Approach : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model.


πŸ“ Follow the Guidelines to Contribute in the Project :


:red_circle::yellow_circle: Points to Note :


:white_check_mark: To be Mentioned while taking the issue :


Happy Contributing πŸš€

All the best. Enjoy your open source journey ahead. 😎

hiteshhhh007 commented 3 months ago

Full name : Hitesh K GitHub Profile Link : https://github.com/hiteshhhh007 Email ID : hiteshkrishna43@gmail.com Participant ID (if applicable): -NIL- Approach for this Project : I am going to use pre-trained Models using Gated Convolutional NN's to remove the photobombic elements from the image. Once the input image, and the corresponding masks is passed on to the model, the output would consist of the elements being removed. What is your participant role? Contributor under GSSoC'24

Kindly assign this issue to me under GSSoC'24.

abhisheks008 commented 3 months ago

@hiteshhhh007 this is a good approach. Can you mention the other two approaches for this project?

hiteshhhh007 commented 3 months ago

The next 2 approaches would be:

  1. To use image segmentation, based on the input mask (the ground truth), I can segment the masked parts from the original image, and will subtract off the unwanted elements i.e the photobombed elements.

  2. Use some pre-trained computer vision models to simplify the process .

I believe the latter would be the good approach as the pre-trained models are already quantised and the inference in real-time would not take much time, due to the pre-trained models having a reduced model size.

abhisheks008 commented 3 months ago

Chalo looks good to me! Issue assigned to you @hiteshhhh007. You can start working on it.

Adhivp commented 3 months ago

I will love to take over the issue , if @hiteshhhh007 is busy / has some difficulties. what do you suggest @abhisheks008

hiteshhhh007 commented 3 months ago

I was busy with my exams, since it got over I'll be working on this issue, I just wanted to know...do i have to implement all the 3 approaches or only the best approach among the 3, @abhisheks008 ?

Adhivp commented 3 months ago

Ok best of luck @hiteshhhh007

abhisheks008 commented 3 months ago

I was busy with my exams, since it got over I'll be working on this issue, I just wanted to know...do i have to implement all the 3 approaches or only the best approach among the 3, @abhisheks008 ?

All the three approaches needs to be implemented. After that conclude the best fitted model based on the accuracy scores.