multimodal-llm Search Results

1000+ results
for multimodal-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

modelscope/ms-swift #1115

glm4v加载保存的checkpoint继续训练时报错

**Describe the bug** What the bug is, and how to reproduce, better with screenshots(描述bug以及复现过程，最好有截图) 如题，用swift进行glmv4的微调，训练一段时间后因OOM断掉了，因此想接着上一个checkpoint来继续训练，但会有报错信息。训练命令如下： `NPROC_PER_NODE…

Marcovaldon updated 4 months ago
3
ManifoldRG/MultiNet #83

Research VLA to VL mapping

Research (both literature review and architectural scoping) around utilizing VLA models for Vision-Language or sole Language fine-tuning and inference.

pranavguru updated 2 months ago
1
spring-projects/spring-ai #144

Add support for GPT-4 with Vision

Having experimented with OpenAI's `GPT-4 with Vision` API, it would be amazing if Spring AI adds support for image-based input data (e.g. photos). This API allows you to post: - One or more images …

ghillert updated 3 months ago
7
intel-analytics/ipex-llm #11750

Please provide a method to benchmark Multimodal InternVL-4B …

Model link：[OpenGVLab/InternVL2-4B · Hugging Face](https://huggingface.co/OpenGVLab/InternVL2-4B)

zhouzhaojing updated 2 months ago
6
camel-ai/camel #454

[Roadmap] Multimodal Agent Roadmap

### Required prerequisites - [X] I have searched the [Issue Tracker](https://github.com/camel-ai/camel/issues) and [Discussions](https://github.com/camel-ai/camel/discussions) that this hasn't alre…

zechengz updated 4 months ago
4
irthomasthomas/undecidability #725

Anthropic cookbook: using sub-agents with claude-3

- [ ] [anthropic-cookbook/multimodal/using_sub_agents.ipynb at main · anthropics/anthropic-cookbook](https://github.com/anthropics/anthropic-cookbook/blob/main/multimodal/using_sub_agents.ipynb?short_…

irthomasthomas updated 7 months ago
1
NVIDIA/NeMo #8898

The error in loading Llama pretrain checkpoint for NeVa(LLAV…

when I train the Neva model, I got following error >> [NeMo I 2024-04-12 03:38:58 neva_model:252] Loading LLM weights from checkpoint /home/nemo/llama_weights/vicuna-2-7b.nemo Loading distributed …

WeianMao updated 2 months ago
5
NVIDIA/TensorRT-LLM #1190

[BUG] close_ipc_memory Error Exception ignored in: <function…

### System Info - CPU: x86_64 - GPU: A30 - Container: nvcr.io/nvidia/tritonserver:24.01-trtllm-python-py3 - PyTorch: 2.2.1 - tensorrt_llm: 0.9.0.dev2024022700 - tensorrt: 9.2.0.post12.dev5 -…

DefTruth updated 3 months ago
5
vllm-project/vllm #6187

[Feature]: lazy import for VLM

### 🚀 The feature, motivation and pitch I used [vLLM 0.5.0.post1](https://github.com/vllm-project/vllm/releases/tag/v0.5.0.post1) for `Mixtral-8x7B-Instruct-v0.1` inference ```bash python3 -m vll…

zhyncs updated 3 months ago
2
GoogleCloudPlatform/generative-ai #688

[Bug]: Grounding with Gemini doesn't work, no metadata is re…

### File Name https://github.com/GoogleCloudPlatform/generative-ai/blob/main/gemini/grounding/intro-grounding-gemini.ipynb ### What happened? Grounding with Gemini doesn't work. Gemini model doesn'…

sanjanalreddy updated 2 months ago
4

上一页 1...68 69 70 71 72 73 74...100 下一页

1000+ results for multimodal-llm

1000+ results
for multimodal-llm