llm-edge / hal-9100

Edge full-stack LLM platform. Written in Rust
MIT License
362 stars 28 forks source link

code interpreter: pack an headless browser / browser mode or tool #43

Open louis030195 opened 6 months ago

louis030195 commented 6 months ago

basically LLM should be able to do this:

"yo here's a website https:abcd.com can you do x and y and get w and q from it plz?"

code interpreter would gen python code to do these stuff in headless browser step by step (including doing clicks, filling forms, etc. using playwright api for example) and then go back to the user and user happy

just gotta:

also i think code interpreter should have some options for ppl that dont want browser for example

(openai by default block scrapping in their code interpreter for security reason i assume)

louis030195 commented 6 months ago

it also would enable browsing like chatgpt ui does

anyone want this? or anyone want to contribute?

louis030195 commented 6 months ago

maybe it could be a different tool, not sure, also make sure not to diverge from openai api