-
**Describe the bug**
What the bug is, and how to reproduce, better with screenshots(描述bug以及复现过程,最好有截图)
如题,用swift进行glmv4的微调,训练一段时间后因OOM断掉了,因此想接着上一个checkpoint来继续训练,但会有报错信息。
训练命令如下:
`NPROC_PER_NODE…
-
Research (both literature review and architectural scoping) around utilizing VLA models for Vision-Language or sole Language fine-tuning and inference.
-
Having experimented with OpenAI's `GPT-4 with Vision` API, it would be amazing if Spring AI adds support for image-based input data (e.g. photos). This API allows you to post:
- One or more images
…
-
Model link:[OpenGVLab/InternVL2-4B · Hugging Face](https://huggingface.co/OpenGVLab/InternVL2-4B)
-
### Required prerequisites
- [X] I have searched the [Issue Tracker](https://github.com/camel-ai/camel/issues) and [Discussions](https://github.com/camel-ai/camel/discussions) that this hasn't alre…
-
- [ ] [anthropic-cookbook/multimodal/using_sub_agents.ipynb at main · anthropics/anthropic-cookbook](https://github.com/anthropics/anthropic-cookbook/blob/main/multimodal/using_sub_agents.ipynb?short_…
-
when I train the Neva model, I got following error
>> [NeMo I 2024-04-12 03:38:58 neva_model:252] Loading LLM weights from checkpoint /home/nemo/llama_weights/vicuna-2-7b.nemo
Loading distributed …
-
### System Info
- CPU: x86_64
- GPU: A30
- Container: nvcr.io/nvidia/tritonserver:24.01-trtllm-python-py3
- PyTorch: 2.2.1
- tensorrt_llm: 0.9.0.dev2024022700
- tensorrt: 9.2.0.post12.dev5
-…
-
### 🚀 The feature, motivation and pitch
I used [vLLM 0.5.0.post1](https://github.com/vllm-project/vllm/releases/tag/v0.5.0.post1) for `Mixtral-8x7B-Instruct-v0.1` inference
```bash
python3 -m vll…
-
### File Name
https://github.com/GoogleCloudPlatform/generative-ai/blob/main/gemini/grounding/intro-grounding-gemini.ipynb
### What happened?
Grounding with Gemini doesn't work. Gemini model doesn'…