Develop a UI interface for the app that enables the user to like or dislike the commands generated, so that we can use it for RLHF.
This could either be feedback on the entire chain-of-commands from the chat window, but it might might also consider individual command generations that have gone wrong. The latter seems more useful and practical for RLHF.
Develop a UI interface for the app that enables the user to like or dislike the commands generated, so that we can use it for RLHF.
This could either be feedback on the entire chain-of-commands from the chat window, but it might might also consider individual command generations that have gone wrong. The latter seems more useful and practical for RLHF.
This feature is only useful if #3 is set up.