Add Computer Vision Classification Template with MobileNetV3 Baseline

j-cunanan commented 1 week ago

Overview

This pull request introduces a new template for computer vision classification tasks, along with a strong baseline implementation using MobileNetV3. The additions have been thoroughly tested end-to-end to ensure reliability and ease of use.

Key Features

CV Classification Template: A reusable template for computer vision classification tasks, promoting consistency and efficiency in future projects.
MobileNetV3 Baseline: Implementation of a strong baseline model using MobileNetV3, known for its efficiency and performance in various computer vision applications.
End-to-End Testing: Comprehensive testing to validate the entire pipeline from data input to model output.

Technical Details

model is based on pytorch implementation (https://github.com/pytorch/vision/blob/main/torchvision/models/mobilenetv3.py#L117) and could load pre-trained weights if needed.

Sample paper outputs

adaptive_channel_thresholding_mobilenetv3.pdf mb4-pre.pdf

I welcome any feedback or suggestions for improvement. Please let me know if you need any additional information or clarification.

conglu1997 commented 2 days ago

Thanks so much for the contribution - could you remove the changes to the README and follow the new contribution style, the template.tex file should have more appropriate base cites. There is also an incorrect modification to the nanogpt ideas.

j-cunanan commented 2 days ago

Thanks for the feedback @conglu1997 ! I've modified based on your suggestions above. Let me know if there's anything else I should change :)

SakanaAI / AI-Scientist