-
I have a largish Soar agent (120k productions). When I try to run fc, Soar essentially hangs (not a core dump, but the process pegs out the CPU and does not return after a long while -- have waited 30…
-
**Describe the bug**
Error running with Mamba: `'HookedMamba' object has no attribute 'W_E'`.
**Code example**
```
cfg = LanguageModelSAERunnerConfig(
model_name="state-spaces/mamba-2.8b"…
-
Dear Jorge,
My name is Peng Jian, and from mainland, China.
I am a teacher from College of Information and Intelligence, Hunan Agricultural University, and a translator of Tsinghua University P…
-
I want to outline the ideas and dreams I have for the `curriculum` module. Right now its features are very limited. We can only train a non-changing agent using a linear sequence of tasks that, becaus…
-
MultiAgent RL
## 문제 설정
- 협동 => chase
- 쫒는 애들은 MARL
- 도망치는 애들은 룰기반
- combat? 싸움 알고리즘?
- 평가? 룰기반 vs MARL 에이전트
잡는게 더 쉽다
축구는 적합한 상황이 아님. 패스 정도...
## 학습
방법론
- centralized
- de…
-
Running `zero_demo.py` crashes with the following error:
```
Traceback (most recent call last):
File "zero_demo.py", line 104, in
simulate_game(board_size, black_agent, c1, white_agent, c…
-
I'm trying SQL on a simple manipulator reaching task, the agent quickly learns to get to the vicinity of the goal but then the learning curve plateaus and the agent never quite get to the goal. Some o…
-
Hi guys!
I'm learning about Consul ACLs setup and found this great Docker POC, thank you for sharing this nice tutorial!
While bootstrapping the cluster I found an issue because of expired TLS …
-
Sometimes the starting point of the weights is so bad that every learning update has a reward of 0. The agent is not learning anything ... it's just wasting time. We already terminate eval early if th…
-
### Idea Contribution
- [X] I have read all the feature request issues.
- [X] I'm interested in working on this issue
- [X] I'm part of GSSOC organization
### Explain feature request
Adding proper …