invictus717 MetaTransformer issues

invictus717 / MetaTransformer

Meta-Transformer for Unified Multimodal Learning

https://arxiv.org/abs/2307.10802

Apache License 2.0

1.52k stars 114 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Issues about Image Classification

#73 Lelucermaire111 closed 2 weeks ago
6
about Modality-Agnostic Models

#72 regainOWO closed 2 months ago
3
Can Meta-Transformer perform similar information retrieval to ImageBind?

#71 duguyue100 closed 2 months ago
3
Audio

#70 bjut-chunxiwang closed 5 months ago
4
Pre-trained weights for Data2Seq

#69 vittoriopipoli opened 5 months ago
1
Question in concating the features

#68 memesoo99 opened 6 months ago
0
Data2Seq for point cloud modality

#67 yozoral closed 8 months ago
1
Data2Seq中的Vedio.py有问题

#66 li-pengcheng closed 8 months ago
1
meta-transformer能否进行三维CT或者核磁数据的分割和检测？

#65 li-pengcheng closed 9 months ago
2
图片格式转换(B,C,H,W)

#64 CQU1213 closed 10 months ago
1
使用metatransformer训练自己的数据集

#63 CQU1213 closed 10 months ago
1
下游任务的代码

#62 CQU1213 closed 11 months ago
1
Amount of embeddings

#61 vzapylikhin closed 11 months ago
1
I have a question in the learning process.

#60 YooSungHyun closed 11 months ago
1
requirements

#59 anas2908 closed 1 year ago
2
Hardware configurations for fine-tuning?

#58 tctco closed 1 year ago
1
fix python version setting in Image/environment.yaml for MultiScaleDeformableAttention package

#57 chuxiuhong closed 1 year ago
0
运行环境配置说明

#56 jinpeifei2015 closed 1 year ago
1
请问使用 Data2seq的 tokenizer 时输入数据的格式

#55 Zoew420 closed 1 year ago
1
Data2Seq > Hyper_Spectrum.py update from self.cls_tokens to self.cls_token

#54 jawhster closed 1 year ago
1
为什么训练图像会出现这种错误

#53 CQU1213 closed 1 year ago
4
Explain please

#52 vzapylikhin closed 1 year ago
2
Questions about experiments

#51 yangbang18 closed 1 year ago
1
video

#50 Darcy0218 closed 1 year ago
5
Question about the ``pretrain-finetune'' pipeline

#49 yangbang18 closed 1 year ago
5
Video

#48 Darcy0218 closed 1 year ago
2
关于text模态的使用

#47 1223haohao closed 1 year ago
1
video

#46 Darcy0218 closed 1 year ago
1
audio

#45 Darcy0218 closed 1 year ago
1
如何使用较大版本的模型。

#44 1223haohao closed 1 year ago
1
paper

#43 caodonghui426 closed 1 year ago
1
How to compute similarity score between different modalities?

#42 alparius closed 1 year ago
2
how to use it?

#41 vzapylikhin closed 1 year ago
4
Data2Seq Usage/Embedding Dim

#40 s4lome closed 1 year ago
5
Fixed the bug of model test.

#39 Lum1104 closed 1 year ago
0
data2seq

#38 LH019 closed 1 year ago
3
audio

#37 Darcy0218 closed 1 year ago
3
audio

#36 Darcy0218 closed 1 year ago
2
How to pretrain Unified Multimodal Model?

#35 bbbdbbb closed 1 year ago
1
Enquiry of the training code of the Unified Multimodal Model

#34 Lum1104 closed 1 year ago
1
audio问题

#33 Darcy0218 closed 1 year ago
3
Is the model that each task will have a corresponding downstream HEAD MLP?

#32 moonriver0922 closed 1 year ago
2
FileNotFoundError: [Errno 2] No such file or directory: 'Meta-Transformer_base_patch16_encoder.pth'

#31 Eachen11 closed 1 year ago
3
how to export to Onnx model for faster inference

#30 eisneim closed 1 year ago
2
Whether to support BBOX data？

#29 nanfengguli closed 1 year ago
5
Multiple modals

#28 HaibiaoXuan closed 1 year ago
3
audio模块代码报错

#27 Darcy0218 closed 1 year ago
2
demo use

#26 Zhudogsi closed 1 year ago
2
Replicating training?

#25 yhyu13 closed 1 year ago
1
what if it not use the LAION-2B dataset CLIP backbone

#24 Dongshengjiang closed 1 year ago
1