DLYuanGod / ArtGPT-4

Artistic Vision-Language Understanding with Adapter-enhanced MiniGPT-4
BSD 3-Clause "New" or "Revised" License
24 stars 4 forks source link

Download of the Stage_I dataset #2

Open QQQfive opened 10 months ago

QQQfive commented 10 months ago

In the first stage, the paper mentions the use of a 200GB dataset, but why does the actual code involve downloading a 2.3TB dataset?

DLYuanGod commented 10 months ago

We use Laion-aesthetic from the LAION-5B dataset for stage 1, which amounts to approximately 200GB for the first 302 tar files.