Closed SeanMcOwen closed 1 month ago
1, diff_ratio, reward_ratio 1, diff_ratio, log(reward_ratio)
and reward ratio is r_qi / r_kquai
Add these as narrative arcs
1, diff_ratio, reward_ratio 1, diff_ratio, log(reward_ratio)
and reward ratio is r_qi / r_kquai