-
**Describe the bug**
I try to finetune `llama3-8B` model with multi nodes but get an AtrributeError when finishing loading mcore format checkpoint and starting to build datasets, the error is below:
…
-
For reference PyTorch interface:
```Python
optimizer = optim.SGD(net.parameters(), lr=opt.learning_rate, momentum=0.9, weight_decay=1e-4)
scheduler = torch.optim.lr_scheduler.StepLR(optimizer, step…
-
Issue opened to collect info about possible future SPSA improvements.
### SPSA references
SPSA is a fairly simple algorithm to be used for local optimization (not global optimization).
The wiki h…
-
- Brainstorm different types of charts and visualizations.
- Research examples of interactive visualizations for inspiration.
-
Hi there!
I've tried to compile ethzasl_ptam but I got some errors in the process. I've followed the steps given in #39
and #48 , but without better results. I'm trying to build it in a Raspberry P…
-
This is not an issue but a question about how katago handles different board sizes.
Please feel free to move it or direct me to where to post the question if it can't stay here.
It seems that in t…
-
### System Info
Python 3.11.5
torch 2.3.0
transformers 4.41.1
accelerate 0.30.1
```
+----------------------------------…
-
## Introduction: How would you simulate gradients
If you want to simulate the gradient of a random function $Z$, it turns out that you simply need to take derivatives of the covariance funcion, as …
-
Hi,
i have a question about the code:
why all the training was done using SAC (i mean inside main.py file) and then in LongiControl_SHAP.ipynb a ddpg session was initialized? Is not possible to appl…
-
**Motivation**
MaskablePPO is great for large discrete action space that has many invalid actions at each step, while RecurrentPPO is useful for the agent to has a memory of previous observations and…