Open GautamR-Samagra opened 3 weeks ago
Hi,
I've been working on this issue and recently completed the pre-processing part. Now I'm thinking of using "PaliGemma-3b-pt-224" for fine tuning, Now I don't have much access to GPUs personally is there a way in which I can work upon using atleast 8-12 GB of VRAM.
Hey @Saswatsusmoy, you can fine tune '224' model in google collab or kaggle notebook only just by freezing out some parameters. refer to this nb -> https://colab.research.google.com/github/google/generative-ai-docs/blob/main/site/en/gemma/docs/paligemma/fine-tuning-paligemma.ipynb#scrollTo=rv7w-cGuLj5o
Goal
Fine-tune the PaliGemma model to accurately recognize and extract issue dates from images of cheques using entity recognition techniques.
Description
The objective is to test PaliGemma model's capabilities to include entity recognition for the specific task of extracting issue dates from cheque images. A practical use case will be implemented by fine-tuning the model on a publicly available cheque dataset focusing exclusively on the issue date entity.
Implementation Details
It'll include the following:
Organization Name
SamagraX
Domain
Document Processing
Tech Skills Needed
Python, Machine Learning, Entity Recognition, Image Processing, Deep Learning
Category
Feature
Feature
Entity Recognition
Mentor(s)
@kartikbhtt7
Complexity
Simple