Add inference snippets for image-text-to-text

huggingface / huggingface.js

Utilities to use the Hugging Face Hub API

https://hf.co/docs/huggingface.js

MIT License

1.37k stars 213 forks source link

Add inference snippets for image-text-to-text #927

Closed Wauplin closed 4 days ago

Wauplin commented 1 week ago

This PR adds inference snippets for image-text-to-text models, say meta-llama/Llama-3.2-11B-Vision-Instruct for example :smile:

I've tested all three examples locally and they work as expected :)

Wauplin commented 4 days ago

@coyotte508 @mishig25 thanks for the feedback. I addressed above the comment to use "conversational" tag. Otherwise let's merge this and come back to it when multiple inference snippets can be provided. Can I have a final review before merging?

mishig25 commented 4 days ago

github is showing that model is indendted wrongly?

Wauplin commented 4 days ago

github is showing that model is indendted wrongly?

Well well well, looks like it yes. Addressed in https://github.com/huggingface/huggingface.js/pull/927/commits/0f8452c3f8199cb3d0e4c9fd6a21b81c0a86e39b

Wauplin commented 4 days ago

Thanks! Sorry about the back and forth :grimacing:

mishig25 commented 4 days ago

trigerred https://github.com/huggingface/huggingface.js/actions/runs/11107930206 so that we can get it in moon