kuprel / min-dalle

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch
MIT License
3.48k stars 255 forks source link

Size #98

Open Muqeeth99 opened 1 year ago

Muqeeth99 commented 1 year ago

What is the size of this model of min-dalle?

kartikpodugu commented 10 months ago

min-dalle uses three models in the pipeline to generate images from text.

  1. Dalle BART encoder
  2. Dalle BART decoder
  3. VQGAN Detokenizer

There are two variants of the pipeline

Variant Size Details
mini encoder - 386 MB, decoder - 448 MB, detokenizer - 178 MB
mega encoder - 2.06 GB, decoder - 2.75 GB, detokenizer - 178 MB

VQGAN Detokenizer is same in both variants.