multimodal-llm Search Results

883 results
for multimodal-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlc-ai/mlc-llm #2646

[Question] How to optimize the scheduling of multimodal LLM …

## ❓ General Questions hi,all I'm trying to port Microsoft's Florence-2-large model to mlc recently. It seems to be able to run initially, but I have a problem. Multimodal LLM models usually have …

shifeiwen updated 1 month ago
5
google-research/magvit #13

SPAE code release

Great work! After reading your paper _SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs_ , I'm very interested in the implementation, especially how the image is reconstr…

Labmem009 updated 3 days ago
9
w3c/voiceinteraction #52

Improve description of Generative AI elements

The currently followed architecture of is still too closely bound to traditional NLU based voice interaction concepts. While it aimed at including LLM with speech, LLM with multimodality, ... it is po…

schnelle updated 1 month ago
3
ggerganov/llama.cpp #8010

server: Bring back multimodal support

Multimodal has been removed since https://github.com/ggerganov/llama.cpp/pull/5882 Depends on the refactoring of `llava`, we will be able to bring back the support: https://github.com/ggerganov/lla…

ngxson updated 6 days ago
5
run-llama/llama_index #15020

[Question]: Can we use `TokenCountingHandler` with different…

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question Hello! Let's say I use a multimodal modal like `gpt-4o` and a text model like `gemin…

paulpalmieri updated 1 month ago
2
NVIDIA/TensorRT-LLM #1644

[Model Requests] Add support for CogVLM2

Following up on Cogvlm, CogVlm2 is here: https://github.com/THUDM/CogVLM2 Easily one of the best open-source multimodal model, that is competitive to GPT-4 and Gemini. https://github.com/THUDM/Co…

harry-stark updated 1 month ago
4
ncsoft/offsetbias #2

ValueError: Model architectures ['LlamaForSequenceClassifica…

Hello， I download the model(NCSOFT/Llama-3-OffsetBias-RM-8B) from hugginface。 and then run the code below: ``` pip install -r requirements.txt ``` and then ``` from module import VllmModule …

big-bao updated 1 month ago
1
triton-inference-server/tensorrtllm_backend #594

In multi-GPU mode, is passing the prompt_embedding_table par…

### System Info - NVIDIA A100 80G * 2 - Libraries - TensorRT-LLM: 0.11.0.dev2024052800 - Driver Version: 525.105.17 - CUDA Version: 12.4 ### Who can help? @byshiue @schetlur-nv ##…

vonchenplus updated 1 day ago
1
soilwise-he/soil-health-knowledge-graph #4

Convert tables and figures in the report to RDF with multimo…

The report contains a large number of tables and figures that contain much information not mentioned in the text. In this stage, we focus on converting the text to RDF, but these tables and figures al…

wbcbugfree updated 2 months ago
1
NVIDIA/TensorRT-LLM #1227

Batch inference using Llava ModelRunner is much slower than …

### System Info GPU: a10g ### Who can help? @kaiyux ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] An officially supported task…

spoonbobo updated 2 months ago
1

上一页 1...3 4 5 6 7 8 9...89 下一页

883 results for multimodal-llm

883 results
for multimodal-llm