-
Hello, sorry I don't actually know much about the mathematical formulas and whatnot behind rl and the algorithms, I've previously been just training stuff with SB3 PPO.
Anyway, I installed sheeprl an…
-
I am trying to modify the YOLOv8 backbone by adding the attention module inside. However, I keep getting key error. I modified the .yaml file, modules.py and tasks.py but still not working. Anyone can…
-
Hi,
I am interested in adding the implementation of [Twin Delayed Deep Deterministic Policy Gradients (TD3) ](https://arxiv.org/abs/1802.09477) to the Keras examples repository.
TD3 addresses th…
-
微博内容精选
-
### ❓ Question
Hi,
I have several questions:
# Entropy scheduler
In SB3 it is possible to define the weight for the entropy loss function that it is used for example in A2C. I would like to de…
-
### Search before asking
- [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…
-
I modified `llama2_7b_full_wizardlm_e1_copy.py` with alpaca_dataset and added parameter `torch_dtype=torch.float16` in model loading, as following:
```
model = dict(
type=SupervisedFinetune,
…
-
## Keyword: differential privacy
### State-of-the-Art Approaches to Enhancing Privacy Preservation of Machine Learning Datasets: A Survey
- **Authors:** Chaoyu Zhang
- **Subjects:** Cryptography an…
-
## User story ##
In order to be able to ship the LAC/SAC pytorch implementation to the team we need to validate whether it gives the same results as the LAC/SAC tensorflow version.
## Consideratio…
-
The idea is to use transfer learning, at the most basic level as a first step, to using the trained agents in other environments:
- [x] Implement a perfect ring or circle in the `models.py` file.
…