Open troiwill opened 6 months ago
Thanks for the effort. Much better organized than before. Will take a pass soon. One thing - the CIs are failing. Could you fix this first? I just merged this PR that will make CI actions trigger when you update the source branch of this PR. Please update your branch with the latest changes in main.
I merged the changes from main
into ccpomcp
. Let me know if I need to update anything.
Great. Looks like CIs passed except for pre-commit. I will review the code soon. ~I should also enable CIs for all branches..~. It should be enabled, after you rebase/merge latest changes in main. @troiwill
This pull request implements the cost-constrained POMCP (CC-POMCP) algorithm and its dependencies. The algorithm is in
algorithms/ccpomcp.p*
, while the dependencies are in theframework/generalization.p*
andutils/cvec.p*
files. This pull request also proposes a generic model, called aResponseModel
, and a corresponding output, called aResponse
. The name "response" comes from the notion of independent and dependent variables, where a response (reward, cost, etc.) depends on the interaction with the real or simulated environment. Thus, a response model is a wrapper for more specific models, such as reward and cost models (and any others that will follow in the future). By extension, a response is a wrapper for the reward, cost, etc.The
framework/generalization.p*
files containThe pull request has the following:
test_util_vector_ops.py
), and