[SPARKNLP-1068] Introducing BLIPForQuestionAnswering transformer

Description

This pull request introduces the BLIPForQuestionAnswering transformer, enabling enhanced image-based question-answering capabilities.

Usage Instructions To utilize this new transformer, a DataFrame with the following structure is required:

image column: Contains the file paths for each image within the directory.
text column: Includes the specific question you would like to ask about each corresponding image.

Enhance Spark NLP with visual transformer capabilities.

[ ] Bug fix (non-breaking change which fixes an issue)
[ ] Code improvements with no or little impact
[x] New feature (non-breaking change which adds functionality)
[ ] Breaking change (fix or feature that would cause existing functionality to change)