Comparison with Llama3 - Githubissues

McGill-NLP / webllama

Llama-3 agents that can browse the web by following instructions and talking to you

https://webllama.github.io

MIT License

1.35k stars 101 forks source link

Closed elmoBG8 closed 5 months ago

elmoBG8 commented 6 months ago

I see you compared your version with other models, but what about the version from which you derived it? Thanks

xhluca commented 6 months ago

In the weblinx paper, we have found that non-finetuned models are not as good as finetuned models, so it is unlikely that Llama-3-8b-Instruct surpasses Llama-3-8B-Web. However, if you think it's worth adding the results, it should be fairly straightforward to run the eval script with minor changes to the config: https://github.com/McGill-NLP/webllama/tree/main/modeling#run-llama-on-evaluation-splits