explainers-by-googlers / prompt-api

A proposal for a web API for prompting browser-provided language models
Creative Commons Attribution 4.0 International
267 stars 20 forks source link

Add multimodal API such as using image as part of prompt #40

Open yaoyaoumbc opened 2 months ago

yaoyaoumbc commented 2 months ago

Gemini Nano XS claims itself to be multimodal but I did not find any corresponding API in Chrome on desktop. Could you add such APIs? Thank you.