gchochla / ccmi-explorecsr

Repository for workshop colabs and coding exercises
0 stars 0 forks source link

ExploreCSR Sp23 CV Notebook II #2

Closed gchochla closed 1 year ago

gchochla commented 1 year ago

Introduction to CV models:

gchochla commented 1 year ago

When notebook is ready, I'll share link here

gchochla commented 1 year ago

Utilities are up and seemingly running: https://colab.research.google.com/drive/1Gpe47AP378P0tsPA3ss0E_Imazu9llvx?usp=share_link @ketakilolage @Aditya-1500

ketakilolage commented 1 year ago

Project Ideas

  1. Image captioning
    https://github.com/sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning

  2. Pedestrian detection

Datasets: https://www.cis.upenn.edu/~jshi/ped_html/; https://www.kaggle.com/datasets/karthika95/pedestrian-detection

  1. Google QuickDraw dataset
    https://github.com/googlecreativelab/quickdraw-dataset

  2. A deep learning approach for brain tumor detection using magnetic resonance imaging
    https://arxiv.org/ftp/arxiv/papers/2210/2210.13882.pdf
    "an architecture containing five convolution layers, five max-pooling layers, a Flatten layer, and two dense layers has been proposed for detecting brain tumors from MRI images. The proposed model includes an automatic feature extractor, modified hidden layer architecture, and activation function."

gchochla commented 1 year ago

Great! Try to implement one of these using the given utilities in the notebook. Example usage in "Examples of utilities" section. The notebook was set up to extract feature vectors from images (+ text optionally). You'll have to set up utilities on top of that for more intricate projects. Try to use sklearn on top of extracted features if possible

ketakilolage commented 1 year ago

Another repo for image captioning. This one doesn't mention the pytorch version, but I believe it should be a recent one since the repo is from 2022. https://github.com/amirhosein-mesbah/Deep_Learning/tree/main/Image_Captioning

I believe students would be able to change and replace anything outdated. The first repo is still better from a learning pov.