JohnSnowLabs / spark-nlp

State of the Art Natural Language Processing
https://sparknlp.org/
Apache License 2.0
3.88k stars 711 forks source link

[SPARKNLP-1068] Introducing BLIPForQuestionAnswering transformer #14422

Open danilojsl opened 1 month ago

danilojsl commented 1 month ago

Description

This pull request introduces the BLIPForQuestionAnswering transformer, enabling enhanced image-based question-answering capabilities.

Usage Instructions To utilize this new transformer, a DataFrame with the following structure is required:

Motivation and Context

Enhance Spark NLP with visual transformer capabilities.

How Has This Been Tested?

Screenshots (if appropriate):

Types of changes

Checklist: