brysonjones / mgen3d

mgen3d: A python library for generating 3D models and assets
0 stars 0 forks source link

Do research to find the best open-source, commercially viable image-captioning model #16

Open brysonjones opened 10 months ago

brysonjones commented 10 months ago

Popular VLMs for image captioning like blip2 and CogVLM are not true open source as either their weights or training data restrict commercial use

brysonjones commented 10 months ago

Going to delay this for now to focus on getting the rest of the pipeline working.

Will provide annotation at training time until this is found and integrated