-
The current simulation model has been successful in exploring basic agent dynamics, but to push the boundaries of emergent behavior and complexity, it would be valuable to introduce new agent types. T…
-
I am trying to fill in some gaps in the documentation (including source code comments),and I have seen in that file change or addition intro blurb with refereences to transformer smaller nets links.
…
-
I have a lot of ideas about the future of ActivityWatch that I haven't written down, and some of these are highly important since they concern the direction I want things to go in.
Communicating t…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Feature Description
The project aims to develop a reinforcement learning (RL) agent to optimize waste collecti…
-
# Bandits for Recommender Systems
Industry examples, exploration strategies, warm-starting, off-policy evaluation, and more.
[https://eugeneyan.com/writing/bandits/](https://eugeneyan.com/writing/ba…
-
Inferring strategies in repeated games: The French Defence
===========================================================
Background
----------
In a repeated game, players interact over a finite …
-
Hi, I have tried your linUCB disjoint implementation,
and I found that the lower alpha I set , the higher ctr rate it return.
When alpha = 0.01, the cumulate click rate almost converge to 0.9.
I …
-
[Bayesian optimisation of functions on graphs](https://proceedings.neurips.cc/paper_files/paper/2023/hash/86419aba4e5eafd2b1009a2e3c540bb0-Abstract-Conference.html)
```bib
@article{wan2023bayesian,
…
-
Post questions here for [Xuechunzi Bai](https://www.xuechunzibai.com/) regarding her 5/2 talk **Multidimensional Stereotypes Emerge Spontaneously When Exploration is Costly**. Stereotypes of social gr…
-
![1](https://user-images.githubusercontent.com/65479151/226549749-49b2dbd0-a970-4d5a-9560-46c393fc1c6c.jpg)