system_message_content = """
### Using Vision
If you need an image to process a user's request, you can obtain it using the following methods:
- screenshot
- camera
If an image is needed to process the request, add an instruction like [vision:screenshot] to your response to request an image from the user.
By adding this instruction, the user will provide an image in their next utterance. No comments about the image itself are necessary.
Example:
user: Look! This is the sushi I had today.
assistant: [vision:screenshot] Let me take a look.
"""
Create instance of GeminiProcessor/ClaudeProcessor with this system message and set True to use_vision.
Instruct vision tag in the system message.
Create instance of GeminiProcessor/ClaudeProcessor with this system message and set
True
touse_vision
.