Open danilojsl opened 1 month ago
This pull request introduces the BLIPForQuestionAnswering transformer, enabling enhanced image-based question-answering capabilities.
Usage Instructions To utilize this new transformer, a DataFrame with the following structure is required:
image
text
Enhance Spark NLP with visual transformer capabilities.
Description
This pull request introduces the BLIPForQuestionAnswering transformer, enabling enhanced image-based question-answering capabilities.
Usage Instructions To utilize this new transformer, a DataFrame with the following structure is required:
image
column: Contains the file paths for each image within the directory.text
column: Includes the specific question you would like to ask about each corresponding image.Motivation and Context
Enhance Spark NLP with visual transformer capabilities.
How Has This Been Tested?
Screenshots (if appropriate):
Types of changes
Checklist: