meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
15.29k stars 2.21k forks source link

Browser use with Llama 3.2 Vision Quickstart #799

Open miguelg719 opened 23 hours ago

miguelg719 commented 23 hours ago

Browser Use Llama-Recipe

This is an example notebook on how to create a Llama 3.2 vision-powered agent that can interact with web browsers on your behalf. It includes a detailed explanation of every section and example use cases.

Features

For example, you can ask the agent to:

Before submitting

Thanks for contributing 🎉!