buildingforalignment / .github

0 stars 0 forks source link

Create a proof of concept for an LLM based agent #1

Open kevinforalignment opened 1 month ago

kevinforalignment commented 1 month ago
kevinforalignment commented 1 month ago

Design environment:

kevinforalignment commented 1 month ago

Make it into a game that people can play?

kevinforalignment commented 1 month ago

Design environment:

kevinforalignment commented 1 month ago

Start proof of concept by using phi-3-mini model and scale up to a bigger model or swap to a different model class as needed. phi-3 is attractive because of it's small size, however, it may not be appropriate due to it's lack of pretraining on function calls. The initial goal is to set up a pipeline quickly and cheaply to experiment with and make adjustments from there.

kevinforalignment commented 1 month ago

Start a new repo for this issue: https://github.com/buildingforalignment/agent-experiments