xwnb / GitGeminiHub

This is GitGeminiHub built by [GitHub Action] and [Google Gemini] API to achieve the automatically response by Issue
MIT License
4 stars 1 forks source link

Example: OCR (English) #13

Closed xwnb closed 6 months ago

xwnb commented 6 months ago

Let's read images

Prompt

- More: [Introduction to prompt design](https://ai.google.dev/docs/prompt_intro)

Examples:
- What's in the picture?
- Write a short, engaging blog post based on this picture. It should include a description of the meal in the photo and talk about my journey meal prepping.
- ...

Please recognize the text in the picture and convert to markdown


Content

- Attach an image or multiples images list
- Paste, drop or click to add images between line marks "---"

image


github-actions[bot] commented 6 months ago
Prompt:
Please recognize the text in the picture and convert to markdown

Content:
![image](https://github.com/xwnb/GitGeminiHub/assets/45145954/7b54ee06-292e-409d-9a2f-5693c9861c06)

🎉 Task executed successfully. Here is the response:


Gemini API tutorials

The Gemini API gives you access to the latest generative AI models from Google. The Gemini quickstarts will help you get started with Gemini using a programming language of your choice.

You can also use curl commands to try out the Gemini API:

If you're new to generative AI models, you might want to look at the concepts guide and the Gemini API overview before trying a quickstart.

If you're ready to start programming, but you're not sure which language to use, try the Python quickstart. It's available as a Colab notebook, so you can run code from the browser with minimal setup.


😉 Task completed, and close ticket.