video-large-language-models Search Results

1000+ results
for video-large-language-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DARIAH-ERIC/dariah-campus #789

[new resource]: Text to Video Prompt Engineering Intensive

### Title of the resource Text to Video Prompt Engineering Intensive ### Resource type None ### Authors, editors and contributors Emily Genatowski ### Topics (keywords) AI, Large Language Model…

emilykateemilykate updated 1 year ago
1
changh95/WeeklySpatialAI #8

2024.08.28 - #6 - Sapiens, GaussianOcc, FAST-LIVO2, SOLiD-AL…

# Papers - Sapiens: Foundation for Human Vision Models - 메타에서 나온 Human foundation model ㄷㄷㄷ - 2D pose estimation, body-part segmentation, depth prediction and normal prediction이 하나의 모델에서 …

changh95 updated 2 months ago
3
modelscope/ms-swift #1969

mplug-owl3-7b-chat fine-tuning document

Model: - ModelScope: https://www.modelscope.cn/models/iic/mPLUG-Owl3-7B-240728 - Huggingface: https://huggingface.co/mPLUG/mPLUG-Owl3-7B-240728 Usually, fine-tuning a multimodal large model invol…

Jintao-Huang updated 1 month ago
17
microsoft/RAG_Hack #131

Raghack: Revolutionizing Education with the help of LLM

### Project Name educAIte ### Description ## Project Overview EducAIte is a web application designed to simplify text extraction and document interaction, specifically for educational purposes. By…

anzilparviz29 updated 1 month ago
1
searxng/searxng #3684

[RFC] A Multilingual plugin for SearXNG for Seamless Informa…

**Is your feature request related to a problem? Please describe.** In today's interconnected world, language barriers significantly restrict access to global information. Users often miss out on va…

phablulo updated 3 months ago
6
jwkanggist/SSL-narratives-NLP-1 #4

[3주차] Don’t Stop Pretraining: Adapt Language Models to Domai…

# Keywords RoBERTa, Language model, Domain-adaptive pretraining, Task-adaptive pretraining # TL;DR Multiphase adaptive pretraining with domain and task corpus offers large gains in task performance…

Dien-ES updated 2 years ago
1
SYSTRAN/faster-whisper #1030

Benchmark faster whisper turbo v3

#WIP ## Benchmark with [faster-whisper-large-v3-turbo-ct2](https://huggingface.co/deepdml/faster-whisper-large-v3-turbo-ct2) For reference, here's the time and memory usage that are required to tr…

asr-lord updated 4 weeks ago
29
microsoft/RAG_Hack #107

Project: PhotoRAG - Image search application

### Project Name PhotoRAG ### Description PhotoRAG is a fullstack Next.JS image search application that leverages Azure AI and infrastructure to implement a Retrieval-Augmented Generation (RAG) sys…

dubscode updated 1 month ago
2
ChatGPTNextWeb/ChatGPT-Next-Web #4030

[Feature] Plans to add model provider support

There have been many discussions in the community regarding support for multiple models. - ChatGPTNextWeb#3484 - ChatGPTNextWeb#3923 - ChatGPTNextWeb#960 - ChatGPTNextWeb#3431 - ChatGPTNextWeb#…

fred-bf updated 3 months ago
20
ZebangCheng/Emotion-LLaMA #26

modality encoder question

"In practice, to save GPU memory, we do not load all Encoders directly onto the GPU but instead load the extracted features“ Does it mean we don't need modality encoder, we already have the llama inp…

FortuneBush updated 3 days ago
3

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for video-large-language-models

1000+ results
for video-large-language-models