gpt-4v Search Results - Githubissues

477 results
for gpt-4v

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

TheoKanning/openai-java #397

Add support to GPT-4V

support the content in chat completion with format as messages=[ { "role": "user", "content": [ {"type": "text", "text": "What’s in this image…

wangbin77 updated 11 months ago
5
fulfulggg/Information-gathering #768

反復絞り込みによるGUI接地機能の向上

## タイトル: 反復絞り込みによるGUI接地機能の向上 ## リンク: https://arxiv.org/abs/2411.13591 ## 概要: GUIグラウンディングは、自然言語クエリからインターフェース画像上の正確な位置を特定するタスクであり、視覚言語モデル（VLM）エージェントの機能向上に不可欠です。GPT-4Vのような汎用VLMは様々なタスクで優れた性能を示しますが、GUI…

fulfulggg updated 1 week ago
2
vlf-silkie/VLFeedback #1

Impact of Including GPT-4V in LVLM Pool?

First and foremost, thank you for writing this paper; it was very intriguing and informative. I have a question that arose during my reading. What are the conceptual benefits when the supervisor mo…

Etelis updated 5 months ago
1
huggingface/trl #2136

[SFT VLM] Add support for Molmo models

### Feature request Extend the `sft_vlm.py` script to support the new Molmo models from AllenAI: https://huggingface.co/collections/allenai/molmo-66f379e6fe3b8ef090a8ca19 Paper: https://arxiv.org/…

lewtun updated 1 month ago
15
OthersideAI/self-operating-computer #207

[Question] If i want to use this project on Windows 10,what …

Found a bug? Please fill out the sections below. 👍 ### Describe the bug If i want to use this projuse on Windows 10,what kind of python version do i need to use. ### Steps to Reproduce 1…

xiaonan59 updated 1 month ago
3
matatonic/openedai-vision #14

Support video in MiniCPM-V 2.6

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video The claim is it performs very well for an 8 billion size model I am interested in learning what it takes to add suppor…

saket424 updated 2 months ago
10
continuedev/continue #1033

Adding Ctrl+v for vision models on Windows

### Validations - [ ] I believe this is a way to improve. I'll try to join the [Continue Discord](https://discord.gg/NWtdYexhMs) for questions - [X] I'm not able to find an [open issue](https://githu…

prefonty updated 1 week ago
3
yuecao0119/MMInstruct #2

How to use your data generation pipeline?

Thanks for your good work! Can you provide some guidance on how to use your data generation pipeline?

waltonfuture updated 3 weeks ago
1
poe-platform/fastapi_poe #89

How to use api to call a multi-model with local image?

Hi, I'm using the poe api to call a multimodal model, like gpt-4v or claude3-opus. I refer to an example in the diagram, but I can't find the code on how to load the local image into the request. May …

HarryZhou-618 updated 6 days ago
12
leochen-g/wechat-assistant-pro #71

使用 Dify 进行识图

## 使用GPT-4V来实现图像识别 ### 必要条件 1、智能微秘书平台会员 2、你有一个含gpt4权限的token ### 开启方式 GPT对话配置->自定义对话->添加配置 ![](https://img.aibotk.com/aibotk/help/7NbjFA20231213180652.png) ![](https://img.aibotk.c…

leochen-g updated 3 months ago
4

上一页 1...1 2 3 4 5 6 7...48 下一页

477 results for gpt-4v

477 results
for gpt-4v