-
How can we finetune text + image together ?
-
issue 1:When I run the command to view the demo : python demo/multi_modality_demo.py demo/data/kitti/000008.bin demo/data/kitti/000008.png demo/data/kitti/000008.pkl configs/mvxnet/mvxnet_fpn_dv…
-
**你想要什么功能或者有什么建议?**
支持多模态对话功能:对话的问题和回复都支持文本、图片、音频。
随着官方 ChatGPT 多模态的推出,期待未来 ChatGPT-Next-Web 有计划支持多模态的对话输入输出。
**有没有可以参考的同类竞品?**
官方 ChatGPT 多模态的功能
-
Some models (e.g. InternVideo2 multi modality) depend on flash attention extensions. We would like to add additional outputs for:
fused_dense_lib: csrc/fused_dense_lib
layer_norm: csrc/layer_norm
-
Follow the guide to set up qwen-VL
ipex-llm version:2.1.0b20240512
https://github.com/intel-analytics/ipex-llm/blob/main/python/llm/example/GPU/PyTorch-Models/Model/qwen-vl/chat.py
(qwen-vl) D:\…
-
## Description
This issue will be used to regroup most/all the relevant methods used for vertebral labeling.
## Awesome list
| Method name | Link | Open source | Task | Region | Modality | De…
-
Hello! I hope you're doing well. I'm interested in Multi-ion radiotherapy (MIRT) and would like to know if it's possible to use Matrad for creating MIRT plans. Is there a way to implement this?
rega…
-
Thank you for your wonderful work!
I would like to ask if you can let me know how you obtained the final action scores (final top-1 accuracy) of the two or four-stream network. As I have read in the …
-
### Prerequisite
- [X] I have searched [Issues](https://github.com/open-mmlab/mmdetection3d/issues) and [Discussions](https://github.com/open-mmlab/mmdetection3d/discussions) but cannot get the exp…
-
When I project the 3d bbox I read onto the image using the built-in method, there is an obvious misalignment. Why is this?