Open brando90 opened 5 years ago
@ejgallego is there any point in having the tactic simpl.
be an action choice for the RL agent?
@ejgallego I remember in some previous discussion you suggested some tactic(s) that could have the RL agent potentially loop a proof forever. Was it symmetry.
? I forget which one it was but I thought it would be a good idea to try to give the agent some tactics it should learn to avoid repeating that cause this sort of problems.
@ejgallego is there any point in having the tactic
simpl.
be an action choice for the RL agent?
Oh indeed simpl
is quite complicated and can make a later rewrite
step for example succeed.
hat could have the RL agent potentially loop a proof forever. Was it
symmetry.
?
Yup, symmetry, and in general many rewrites, for example addnC n m : n + m = m + n
, rewriting with this will loop forever.
I guess the most reasonable choice will be to limit the depth at first; the agent will quickly learn that rewriting twice with a tactic is not a good thing to do :D
Discussion for design decisions for the AI agent.
The 1st agent will be a simple RL agent (conv net) acting on embeddings of Coq Terms.