visual-language-learning Search Results

1000+ results
for visual-language-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

salesforce/LAVIS #237

[BLIP2] How to perform stage 1 Vision-Language Representati…

I think that stage 1 learning, that means visual-language representation learning with those three objectives mentioned in the article is not yet implemented. Am I right? Not implemented `load_pre…

klima7 updated 1 year ago
1
ollama/ollama #4257

Support for InternVL-Chat-V1.5

https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5 We introduce InternVL 1.5, an open-source multimodal large language model (MLLM) to bridge the capability gap between open-source and proprietary…

wwjCMP updated 1 week ago
5
conjure-cp/VIP #18

Visualising and Modelling Puzzles - Demystify-rs

demystify-rs is a Rust program for explaining pen and paper puzzles like Sudoku (what makes something a 'pen and paper' puzzle? You could print it out, and solve it with pen and paper :) ) It uses …

ChrisJefferson updated 1 month ago
2
lukashermann/hulc #19

data process

``` if "lang" in self.modality_scope: latent_goal = self.language_goal(dataset_batch["lang"]) ``` I found the part of the batch data is "vim" ,and part of them is 'lang" .why is this setting? …

Cherryjingyao updated 6 months ago
1
cleong110/sign-language-processing.github.io #2

Look through "Awesome Sign Language", etc and add missing it…

### Sources - https://github.com/ZechengLi19/Awesome-Sign-Language - https://github.com/Skye601/SLR - https://www.sign-lang.uni-hamburg.de/lrec/project/asllrp.html - https://www.semanticscholar.…

cleong110 updated 2 months ago
3
UserModels2223/group-project-team-studysmart #3

Suggestion

Hi, I just read through your project ideas and it seems like a really nice improvement in general. I am a bit unsure from your idea though, if the context will be presented in the users native lang…

lsickert updated 1 year ago
1
YoojLee/paper_review #71

BLIP-2: Bootstrapping Language-Image Pre-training with Froze…

# Summary 기존의 VLP는 from scratch로 학습을 시켰지만, 이는 pre-training cost가 너무 크며 기존에 잘 학습되었던 모델 (특히, LLM)에 대한 활용이 어려움. 따라서, frozen vision encoder와 frozen llm을 Q-Former (Querying Transformer)를 통해 잘 이어보는 방식으…

YoojLee updated 7 months ago
1
dotnet/machinelearning-modelbuilder #2689

Add\Machine Learning option not working

**System Information (please complete the following information):** - Model Builder or CLI Version: ML 2022 model builder. DotNET - 8.0.100-preview.4.23260.5 - Visual Studio Version (if applicable…

rrvenki updated 1 year ago
1
tamlhp/deepfake-benchmark #3

New relevant papers

Diffusion Deepfake https://arxiv.org/abs/2404.01579

tamlhp updated 2 weeks ago
34
swarmauri/swarmauri-sdk #299

[Feature Research]: Llava-next -34B

### Feature Name Llava-next -34B ### Feature Description Research about Llava-next -34B ### Research Findings ### LLaVA-NeXT-34B **LLaVA-NeXT-34B** is a model in the LLaVA-NeXT series, which e…

abdulsamodazeez updated 1 week ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for visual-language-learning

1000+ results
for visual-language-learning