mlfoundations / open_flamingo

An open-source framework for training large multimodal models.
MIT License
3.68k stars 277 forks source link

Create extra samples with surplus images #272

Open isaac-chung opened 12 months ago

isaac-chung commented 12 months ago

Addresses issue https://github.com/mlfoundations/open_flamingo/issues/231

isaac-chung commented 12 months ago

Few questions before I make more changes:

anas-awadalla commented 12 months ago

Yeah I think we can default to the mmc4 code for all of these.

For the first point this should be based on the image size. I think because most vision encoders have size 224x224 we just defaulted to that but that isn't the right way to do it.

isaac-chung commented 11 months ago

@anas-awadalla here's a first complete draft. please let me know what you think. Separately I think a pre-commit hook would benefit development. Could raise this in a separate PR if needed.

anas-awadalla commented 11 months ago

Great will check it tomorrow

anas-awadalla commented 11 months ago

What would the hook contain? The code formatting?

isaac-chung commented 11 months ago

What would the hook contain? The code formatting?

Yep, kind of like this or whatever else we want.

anas-awadalla commented 11 months ago

What would the hook contain? The code formatting?

Yep, kind of like this or whatever else we want.

This would be awesome!

isaac-chung commented 11 months ago

@anas-awadalla hoping to get your feedback on this when you get a chance next 🙏

anas-awadalla commented 11 months ago

Sorry @isaac-chung got busy with a paper I am pushing. I will definitely review and merge this week tho!

isaac-chung commented 11 months ago

No worries, good luck with the paper!

isaac-chung commented 11 months ago

@anas-awadalla a gentle nudge to bubble this back up in your inbox. Hoping to close this soon 🙏