DLYuanGod / ArtGPT-4

Artistic Vision-Language Understanding with Adapter-enhanced MiniGPT-4
BSD 3-Clause "New" or "Revised" License
24 stars 4 forks source link

Request to publish the 3500-pair high-quality dataset of the second phase alignment #1

Open magicwang1111 opened 1 year ago

magicwang1111 commented 1 year ago

artgpt has a big boost to photo descriptions in art,I have seen that you create a small (3500 pairs in total) yet high-quality dataset,can you public this dataset?thx

magicwang1111 commented 1 year ago

I'm curious why the files you trained in the second stage have a size of 6G, and the files I trained are all 45MB,my dataset is 5000 pairs.

DLYuanGod commented 1 year ago

This dataset is the data of MiniGPT-4 in the second phase. We didn't make any changes. Thanks.

DLYuanGod commented 1 year ago

Sorry, we checked, and our paper also clearly states that it is 5000 pairs, 45M, and it may be incorrectly written on Github. Thank you for your feedback.

magicwang1111 commented 1 year ago

Sorry, we checked, and our paper also clearly states that it is 5000 pairs, 45M, and it may be incorrectly written on Github. Thank you for your feedback.

can you supply first pretraining stage model,my gpu is too low cant train.