너의 알약이 보여 - by. Team Medic(CV-16)

📚 Project Overview

Project Period: 2022.04.07 ~ 2022.06.10
Project Presentation Video: Link to YouTube
Project Presentation File: CV_16조_알약분류_최종프로젝트 발표자료.pdf
Project Wrap-up Report: 최종 프로젝트_CV_16_Wrap UP Report.pdf

👀 너의 알약이 보여 💊

Metric Learning을 활용한 Reverse Pill Image Search
streamlit 실행 예시

프로젝트 시연.gif

😎 Members

권순호	서다빈	서예현	이상윤	전경민

🤗 Contribution

권순호: FastAPI, BentoML, streamlit, GCP, OCR, Text Recognition
서다빈: FastAPI, streamlit, OCR, Text Recognition
서예현: Data EDA, Data Pre-processing, Image Classification, Custom Dataset Production
이상윤: Metric learning, Segmentation, Database, Docker
전경민: Data EDA, Data Pre-processing, Data Annotation, OCR, Text Recognition

❓ About This Project

Purpose

사용자의 알약 이미지로부터 알약을 식별하는 인공지능 서비스

Objective

사용자의 알약 이미지로부터 성상, 제형, 색상을 식별 후 조건에 맞는 알약을 검색하여 알약의 종류를 식별한다.

Target Audience

지리적, 물리적 한계로 약국이나 병원을 방문하기 어려운 사람
알약은 있지만 알약을 구분할 수 없는 사람

Background Information

종종 일어나는 처방 실수, 및 착각으로 인한 약물사고를 예방하고자 하였다.
실제 보건 계열 종사자에 따르면 노년 층의 경우 어떤 알약인지 병원에 방문하여 알약을 찾는 경우가 존재한다고 하며, 한국의 통계를 보았을 때도 약물 오복용에 의한 사고는 줄지 않고 계속 유지되고 있는 추세이다.

🗺 Service Architecture

Service Architecture

💾 Datasets

의약품 안전나라 데이터 (Link)
ePillID Benchmark (Link)
기타 이미지 데이터 (Link)
- Classification 및 Object Detection을 위해 직접 촬영 및 수집한, 라이센스가 없는 이미지들

💻 Development Environment

GPU: Tesla V100
OS: Ubuntu 18.04.5LTS
CPU: Intel Xeon
Python : 3.8.5 / 3.9.13

📁 Project Structure (Main branch)

final-project-level3-cv-16
├─ api_folder
│   ├─ .streamlit
|   |   └─ config.toml
│   ├─ backend
|   |   ├─ epillid_benchmark(cloned from Link)
|   |   ├─ Dockerfile
|   |   ├─ Backend.py
|   |   └─ requirements.txt
│   ├─ frontend
|   |   ├─ Dockerfile
|   |   ├─ frontend.py
|   |   └─ requirements.txt
│   └─ Docker
|       └─ docker-compose.yml  
└─ image_classification
    ├─ data_preprocessing
    |   ├─ download_pill_data.py 
    |   └─ normalize_pill_data.py
    ├─ image_concatenation
    |   └─ concatenation_images.py
    ├─ kaggle_pill_data_preprocessing
    |   ├─ 1_annotation_file_name_to_txt.py
    |   ├─ 2_edit_xml_path.py
    |   └─ 3_xml_to_json.py
    ├─ pill_excel_data
    |   └─ README.md
    ├─ .gitignore
    ├─ data.py
    ├─ dataset.py
    ├─ log.py
    └─ train.py

✏️ Evaluation

Top-1 accuracy: 43%
Top-5 accuracy: 80%

🚀 How to Start

Image Classification: Link
OCR: Link
Object Detection (yolov5): Link
Metric learning: Link

🔎 Future Research

모델의 정확도 향상 및 inference time 단축
Mobile Application 제작
실용성 향상
OCR 적용

📎 Appendix

📄 Experiments & Submission Report

📜 Reference

ePillID Dataset: A Low-Shot Fine-Grained Benchmark for Pill Identification (Link)
YOLACT: Real-time Instance Segmentation (Link)
How to make deep-text-recognition-benchmark model to recognize both Korean and English (Link)
OCR Model (Link)
Jaccard Similarity (Link)
Text-Recognition Model (Link)
Background-Removal program (Link)
Object Detection model YOLOv5 (Link)
timm (Link)

boostcampaitech3 / final-project-level3-cv-16

readme

너의 알약이 보여 - by. Team Medic(CV-16)

📚 Project Overview

👀 너의 알약이 보여 💊

😎 Members

🤗 Contribution

❓ About This Project

Purpose

Objective

Target Audience

Background Information

🗺 Service Architecture

💾 Datasets

💻 Development Environment

📁 Project Structure (Main branch)

✏️ Evaluation

🚀 How to Start

🔎 Future Research

📎 Appendix

📜 Reference