Closed isayahc closed 6 months ago
This PR is not finished. I Need to make it a draft again.
Still need to integrate the code to the existing codebase. @almutareb do i make a directory for the images, and have those image file locations as metadata. What other information should i put. Like in terms of the summary.
@isayahc yes, for now let's just use a folder, we can use an object store, e.g. S3 over minio, later. For the metadata we need to be able to construct a context for the generation, let's try the following: summary generated from the text referencing the image, summary and number of page it came from, section header, description. This will help provide context and align it with the generation.
@almutareb @vonderwoman i have notice there are instances where unrelated images are extracted (ie. logos, random desgin assests). We should use an image model to determine if the image is related to the text, and to help generate the image summary.
59 extract images
Description
Please include a brief description of the changes introduced by this PR.
Related Issue(s)
Changes Made
image_processing
module inside ofutils
caption_image
function ininnovation_pathfinder_ai/utils/image_processing/image_processing.py
Task in PRs
Checklist
Additional Notes
Reviewer(s)