Closed jamesbraza closed 1 week ago
This PR brings PaperQAEnvironment to support empty tool calls leading to the environment being done. This creates a back door for the agent to trigger a rollout to conclude.
PaperQAEnvironment
I ran some local testing and it seems if anything performance got better
This PR brings
PaperQAEnvironment
to support empty tool calls leading to the environment being done. This creates a back door for the agent to trigger a rollout to conclude.