JuliaPOMDP / TabularTDLearning.jl

Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA
Other
10 stars 6 forks source link

Add prioritizied sweeping #30

Closed NeroBlackstone closed 1 year ago

NeroBlackstone commented 1 year ago

Thanks for the code review, this is a prioritized sweeping solver for deterministic MDP.

codecov[bot] commented 1 year ago

Codecov Report

Patch coverage: 100.00% and no project coverage change.

Comparison is base (df9af59) 100.00% compared to head (b24039e) 100.00%.

:exclamation: Current head b24039e differs from pull request most recent head 043f0b1. Consider uploading reports for the commit 043f0b1 to get more accurate results

Additional details and impacted files ```diff @@ Coverage Diff @@ ## master #30 +/- ## ========================================= Coverage 100.00% 100.00% ========================================= Files 4 5 +1 Lines 152 220 +68 ========================================= + Hits 152 220 +68 ``` | [Impacted Files](https://app.codecov.io/gh/JuliaPOMDP/TabularTDLearning.jl/pull/30?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=JuliaPOMDP) | Coverage Δ | | |---|---|---| | [src/TabularTDLearning.jl](https://app.codecov.io/gh/JuliaPOMDP/TabularTDLearning.jl/pull/30?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=JuliaPOMDP#diff-c3JjL1RhYnVsYXJURExlYXJuaW5nLmps) | `100.00% <ø> (ø)` | | | [src/q\_learn.jl](https://app.codecov.io/gh/JuliaPOMDP/TabularTDLearning.jl/pull/30?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=JuliaPOMDP#diff-c3JjL3FfbGVhcm4uamw=) | `100.00% <ø> (ø)` | | | [src/sarsa.jl](https://app.codecov.io/gh/JuliaPOMDP/TabularTDLearning.jl/pull/30?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=JuliaPOMDP#diff-c3JjL3NhcnNhLmps) | `100.00% <ø> (ø)` | | | [src/sarsa\_lambda.jl](https://app.codecov.io/gh/JuliaPOMDP/TabularTDLearning.jl/pull/30?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=JuliaPOMDP#diff-c3JjL3NhcnNhX2xhbWJkYS5qbA==) | `100.00% <ø> (ø)` | | | [src/prioritized\_sweeping.jl](https://app.codecov.io/gh/JuliaPOMDP/TabularTDLearning.jl/pull/30?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=JuliaPOMDP#diff-c3JjL3ByaW9yaXRpemVkX3N3ZWVwaW5nLmps) | `100.00% <100.00%> (ø)` | |

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Do you have feedback about the report comment? Let us know in this issue.