gpt-4v Search Results - Githubissues

475 results
for gpt-4v

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

scvready123/IterWeGO #1

Have you tried multimodal LLM in this task.

ZihaoZheng98 updated 2 months ago
3
cogentapps/chat-with-gpt #201

Maintained fork: https://github.com/jp-ipu/chat-with-gpt

For those interested I have forked and am trying to maintain a more up-to-date version of chat-with-gpt at the following link: https://github.com/jp-ipu/chat-with-gpt It's a low priority task, but…

jp-ipu updated 10 months ago
2
jiayev/GPT4V-Image-Captioner #48

Clarify readme on Key requirements

At multiple points in the top-level README.md instructions, it says, "Enter the official OpenAI or third-party GPT-4V API Key and API Url at the top of the interface." I was put off using this c…

ppbrown updated 5 months ago
3
OthersideAI/self-operating-computer #177

[BUG] ModuleNotFoundError: No module named 'pkg_resources'

### Describe the bug When is launch “operate”, the console gives an error message : Traceback (most recent call last): File "", line 198, in _run_module_as_main File "", line 88, in _run_c…

lescratheurjar updated 6 months ago
4
Daisie-Bell/DataModels #3

Metamersion Bot Specification

**Datamodels needed:** OpenAI ElevenLab Text to Speech VLM - visual language model (OpenAI GPT-4V) Whisper Speech to Text Basis for bot behavior OpenAI GPT-4 phenomenological problem interviewer prom…

elacosse updated 10 months ago
2
TaskingAI/TaskingAI #75

Feature Request: Integration of Multimodal LLMs

I want to suggest a significant enhancement that could vastly expand the capabilities of TaskingAI - the integration of multimodal Large Language Models (LLMs), particularly those akin to GPT-4V, whic…

CaseyJordan897 updated 5 months ago
1
ishan0102/vimGPT #14

Open source models

Try to sub out GPT-4V for an open source model like CogVLM, Fuyu-8B, Qwen-VL, or LLaVA.

ishan0102 updated 12 months ago
1
Sunwood-ai-labs/Yukihiko #52

Wolf: 世界要約フレームワークを用いたあらゆるものへのキャプション生成

## タイトル: Wolf: 世界要約フレームワークを用いたあらゆるものへのキャプション生成 ## リンク: https://arxiv.org/abs/2407.18908 ## 概要: 正確な動画キャプション生成のための世界要約フレームワーク「Wolf」を提案する。Wolfは、画像言語モデル（VLM）の補完的な強みを活用する混合専門家アプローチを採用した自動キャプション生成フレームワー…

yukihiko-fuyuki updated 4 months ago
2
eth-sri/lmql #266

Vision Support

Is there any plans to integrate images input into LMQL? With the new GPT-4V and open-source lightweight vision language models such as [MPlug-Owl](https://github.com/X-PLUG/mPLUG-Owl) it would be incr…

ambroser53 updated 1 year ago
1
mnotgod96/AppAgent #29

请问人工输入完流程stop后，等待模型学习过程直接提示Resource not found 是什么原因？

京东app为例，报如下错误： Waiting for GPT-4V to generate documentation for the element androidx.recyclerview.widget.RecyclerView_1080_363_NewAppcenter_android.widget.RelativeLayout_204_161_2 Resource not fou…

who52023 updated 10 months ago
1

上一页 1...2 3 4 5 6 7 8...48 下一页

475 results for gpt-4v

475 results
for gpt-4v