-
Hi!
Have a question about training.
After 16 hours of training, I still get average reward 0.
Will be happy if you can explain what can be wrong?
Maybe it's a problem with default setup parame…
-
-
Hi, I'm fine-tuning a fastchat-3b model with LoRA. The processes are getting killed at the `trainer.train()` step with the following log / error:
```
Loading extension module cpu_adam...
Time to lo…
-
Hello, I'm the developer of [Workflow Buddy - the missing utilities for Workflow Builder](https://github.com/happybara-io/WorkflowBuddy), which is being impacted by the deprecation of `Steps for Apps`…
-
I have trained catSeq model and its performance is as your reported. When I use
`python3 train.py -data data/kp20k/kp20k_separated/rl/ -vocab data/kp20k/kp20k_separated/rl/ -exp_path=exp -exp catSeq_…
-
# Title
## Understanding SHAP for Interpretable Machine Learning: A Tutorial and Hands-on Workshop
# Responsible person(s)
Nicolás Nieto (n.nieto@fz-juelich.de) 1,2,
Federico Raimondo (f.raimo…
-
Dear @lululxvi and comunity,
I'm using DEEPXDE to infer several unknow parameters in PDE and ODE. I used the callbacks to monitor the changes of these infered parameters during the training process.
…
ZPLai updated
2 years ago
-
Hi! I'm learning DDP method recently and also upvoted your brilliant implementation. It seems like you are using the MPC version of ilqr? I change it into normal version but it does not converged any …
-
There are some repository for reference. https://github.com/tkn-tub/ns3-gym/tree/master/scratch/rl-tcp.
There are some practical issue to apply reinforce learning for congestion control.
https://g…
-
### System Info
I'm trying to do the tutorials and there are many things that aren't correct. One that is a blocker preventing me from learning Retrieval is the fact that it seems like all document l…