Samagra-Development / ai-tools

AI Tooling to bootstrap applications fast
41 stars 109 forks source link

Fine tune PaliGemma to Recognize Issue Dates on Cheques Goal #320

Open GautamR-Samagra opened 3 weeks ago

GautamR-Samagra commented 3 weeks ago

Goal

Fine-tune the PaliGemma model to accurately recognize and extract issue dates from images of cheques using entity recognition techniques.

Description

The objective is to test PaliGemma model's capabilities to include entity recognition for the specific task of extracting issue dates from cheque images. A practical use case will be implemented by fine-tuning the model on a publicly available cheque dataset focusing exclusively on the issue date entity.

Implementation Details

It'll include the following:

Organization Name

SamagraX

Domain

Document Processing

Tech Skills Needed

Python, Machine Learning, Entity Recognition, Image Processing, Deep Learning

Category

Feature

Feature

Entity Recognition

Mentor(s)

@kartikbhtt7

Complexity

Simple

Saswatsusmoy commented 2 weeks ago

Hi,

I've been working on this issue and recently completed the pre-processing part. Now I'm thinking of using "PaliGemma-3b-pt-224" for fine tuning, Now I don't have much access to GPUs personally is there a way in which I can work upon using atleast 8-12 GB of VRAM.

kartikbhtt7 commented 2 weeks ago

Hey @Saswatsusmoy, you can fine tune '224' model in google collab or kaggle notebook only just by freezing out some parameters. refer to this nb -> https://colab.research.google.com/github/google/generative-ai-docs/blob/main/site/en/gemma/docs/paligemma/fine-tuning-paligemma.ipynb#scrollTo=rv7w-cGuLj5o