zhjohnchan / M3AE

[MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.
111 stars 10 forks source link

HTTPError: 401 Client Error: Unauthorized for url: https://huggingface.co/downloaded/roberta-base/resolve/main/vocab.json #1

Closed 1112321sfdsaf closed 1 year ago

1112321sfdsaf commented 2 years ago

Hey, Thanks very much for the excellent work and repo. When I run 'bash run_scripts/pretrain_m3ae.sh', I got the above exception. I tried to click this link and found that the link is unavailable. So, I replaced ' tokenizer=downloaded/roberta-base' in 'pretrain_m3ae.sh' with ' tokenizer=roberta-base', and got a successful run. I would like to know if this operation is allowed. image

zhjohnchan commented 2 years ago

Hi there,

Thanks for your attention. The operation is allowed since the downloaded RoBERTa model is the same one.

Best, Zhihong

1112321sfdsaf commented 2 years ago

Thank you very much for your reply.

1112321sfdsaf commented 2 years ago

Hi again, I found that the default batch size per GPU is 32, and my own configuration seems far from that requirement, can you provide how much GPU memory is needed for 32 batches?

zhjohnchan commented 2 years ago

Hi,

I used A100-80G in my experiments.

Best, Zhihong

1112321sfdsaf commented 2 years ago

Thanks for your reply again.

1112321sfdsaf commented 2 years ago

Hi, when preparing the MELINDA dataset using the link introduced in the paper "MELINDA: A Multimodal Dataset for Biomedical Experiment Method Classification", I got a 404 on the page, which means the link does not exist. So, how can I download this dataset to follow your excellent work?

zhjohnchan commented 2 years ago

Hi,

I request the dataset from the authors. I think you can send an e-mail to the author.

Best, Zhihong

1112321sfdsaf commented 2 years ago

Thanks for your suggestion.

1112321sfdsaf commented 1 year ago

Hi, I'm back. 😅

When I download VQA-RAD on the official page, I found that the files are different from the description of this repo. image Moreover, reading the dataset paper, it seems just to divide the dataset into a training set and a test set. So, can you provide more details on the data splits including training, validation, and test sets?

Thanks in advance.