-
# WIP: English version using Mermaid
## policy
- [ ] policy-based learning 基于策略函数的学习方法
- [ ] value-based learning 基于值函数的学习方法
- [x] 动态规划学习方法 (Dynamic programming learning)
-…
-
# Home | Jane's PS Blog
Jane의 PS 블로그
[https://janeljs.github.io/algorithms/programmers-%EB%B0%A9%EB%AC%B8-%EA%B8%B8%EC%9D%B4/](https://janeljs.github.io/algorithms/programmers-%EB%B0%A9%EB%AC%B8-%EA…
-
Problem: Process fails
catboost version: 1.2.7
Operating System: ??? (Kaggle)
CPU: ??? (Kaggle)
GPU: P100 / T4x2
Params:
{'learning_rate': 0.270640171567353, 'iterations': 1100, 'depth': 8, 'l…
-
Can _evaluation_loop use SyncDataCollector for non vectorized envs so that the evaluation is also parallel?
While running on Melting Pot envs, increasing n_envs_per_worker definitely improves execu…
-
**What happened**:
Many AKS maintained Pods are running with memory overcommitment, eg:
- omsagent (Daemonset; up to 375 MB)
- coredns (Deployment, up to 100MB)
- kube-proxy (Daemonset; unlimite…
-
In order to synchronize learning between EE agents, there needs to be a way to tell an agent about something that has already been learned by one of its parents. Consider using the following approach:…
-
I would like to get a slightly better understanding regarding the difference between the on-policy and off-policy as well as some clarifications regarding the formulas used to apply them. Namely, what…
-
`@click.command()
@click.option('--train_path',
help='Huggingface dataset name',
required=True
)
@click.option('--out_dir', default='../checkpoints/', help='Output directory'…
-
-
@nschneid wrote the other day regarding the analysis of reported speech in response to: @MagaliDuran, that "the policy was recently changed but not fully updated in the guidelines".
This took me b…