Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
This is an example notebook on how to create a Llama 3.2 vision-powered agent that can interact with web browsers on your behalf. It includes a detailed explanation of every section and example use cases.
Features
Visual understanding of web pages through screenshots
Autonomous navigation and interaction
Natural language instructions for web tasks
Persistent browser session management
For example, you can ask the agent to:
Search for a product on Amazon
Find the cheapest flight to Tokyo
Buy tickets for the next Warriors game
Before submitting
[ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Browser Use Llama-Recipe
This is an example notebook on how to create a Llama 3.2 vision-powered agent that can interact with web browsers on your behalf. It includes a detailed explanation of every section and example use cases.
Features
For example, you can ask the agent to:
Before submitting
Thanks for contributing 🎉!