Open kevinforalignment opened 1 month ago
Design environment:
Make it into a game that people can play?
Design environment:
Start proof of concept by using phi-3-mini model and scale up to a bigger model or swap to a different model class as needed. phi-3 is attractive because of it's small size, however, it may not be appropriate due to it's lack of pretraining on function calls. The initial goal is to set up a pipeline quickly and cheaply to experiment with and make adjustments from there.
Start a new repo for this issue: https://github.com/buildingforalignment/agent-experiments