multimodal-pre-trained-model Search Results

241 results
for multimodal-pre-trained-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mosaicml/diffusion #68

Why do we load clip-vit-large-patch14?

Hi, I was wondering why do I see the following log by using `stable_diffusion_2`? I didn't see the training code is supposed to load `openai--clip-vit-large-patch14`, isn't it? ``` mosaic/0 [0]:[INF…

viyjy updated 1 year ago
9
Alpha-VLLM/LLaMA2-Accessory #49

Which Pre-trained Path to Use at When

Hey, What are the pre-trained weights to use for stage 1 multi-modal training if not llama2 original weights? My current understanding is the following: > **inference** uses the checkpoints r…

qihan96 updated 1 year ago
4
huggingface/transformers #6535

Passing inputs_embeds into GenerationMixin.generate()

# 🚀 Feature request Currently `GenerationMixin.generate()` only accepts `input_ids` but not `inputs_embeds`. Therefore this method is not usable when custom input embeddings are required. In contra…

ymfa updated 10 months ago
38
invictus717/MetaTransformer #5

Questions about inference

hello, how to determine which modality is input during reasoning? Is a classification network used before the unimodal expert transfomer?

Kelsey2018 updated 1 year ago
13
ersilia-os/ersilia #867

✍️ Contribution period: <Kellen_Kinya>

### Week 1 - Get to know the community - [X] Join the communication channels - [X] Open a GitHub issue (this one!) - [x] Install the Ersilia Model Hub and test the simplest model - [x] Write a motiva…

kellenkinya updated 11 months ago
39
yikuan8/Transformers-VQA #5

Comparison of speed of extracting features?

Hi @YIKUAN8, @HanyinWang, @yuanluo, Is there any rough comparison of speed of extracting features, for [VisualBERT](https://github.com/uclanlp/visualbert/), [LXMERT](https://github.com/airsplay/lxmer…

yezhengli-Mr9 updated 1 year ago
5
haotian-liu/LLaVA #348

How to load LLaVA on a server with no Internet connection?

### When did you clone our code? I cloned the code base after 5/1/23 ### Describe the issue I manually download the pre-trained model at my path, here, which click the download button for each. ![…

gnimyang updated 1 year ago
10
ixxmu/mp_duty #4696

万字综述：大语言模型将为神经科学带来哪些前所未有的机会？| 追问顶刊（下）

https://mp.weixin.qq.com/s/YnO9IeNfvqcJq4gcNWaxeA

ixxmu updated 6 months ago
1
aimagelab/multimodal-garment-designer #6

inference result quality

After running the inference command on DressCode dataset . The result using test_pairs_unpaired.txt has some unexpected distortion on the body(picture attached), especially the missing arms. Maybe I'm…

zhangai6666 updated 1 year ago
10
dvlab-research/FocalsConv #9

How to compute the params and runtime(inference time?)

Dear author, first of all, thanks for your great work. After reading your paper, I really want to know how to calculate the params and the runtime of adding Focals Conv to VoxelRCNN as u mentioned in …

Jane-QinJ updated 1 year ago
9

上一页 1...14 15 16 17 18 19 20...25 下一页

241 results for multimodal-pre-trained-model

241 results
for multimodal-pre-trained-model