q-learning Search Results

1000+ results
for q-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

samre12/deep-trading-agent #6

training issue

Hi! Have a question about training. After 16 hours of training, I still get average reward 0. Will be happy if you can explain what can be wrong? Maybe it's a problem with default setup parame…

philipshurpik updated 6 years ago
5
kjenkins5678/fitness-app #16

As a fitness website user, I want to get the searched inform…

kjenkins5678 updated 4 years ago
1
lm-sys/FastChat #1942

LoRA fine-tuning for fastchat-3b exits with return code = -7

Hi, I'm fine-tuning a fastchat-3b model with LoRA. The processes are getting killed at the `trainer.train()` step with the following log / error: ``` Loading extension module cpu_adam... Time to lo…

ht0rohit updated 1 year ago
1
slackapi/deno-slack-sdk #195

Questions on capabilities compared to the deprecated Steps …

Hello, I'm the developer of [Workflow Buddy - the missing utilities for Workflow Builder](https://github.com/happybara-io/WorkflowBuddy), which is being impacted by the deprecation of `Steps for Apps`…

I-Dont-Remember updated 1 year ago
2
kenchan0226/keyphrase-generation-rl #23

about training rl model

I have trained catSeq model and its performance is as your reported. When I use `python3 train.py -data data/kp20k/kp20k_separated/rl/ -vocab data/kp20k/kp20k_separated/rl/ -exp_path=exp -exp catSeq_…

sjchasel updated 1 year ago
7
FZJ-INM1-BDA/HAICon2024-satellite-events #6

Understanding SHAP for Interpretable Machine Learning: A Tut…

# Title ## Understanding SHAP for Interpretable Machine Learning: A Tutorial and Hands-on Workshop # Responsible person(s) Nicolás Nieto (n.nieto@fz-juelich.de) 1,2, Federico Raimondo (f.raimo…

N-Nieto updated 5 months ago
1
lululxvi/deepxde #574

Fail to update the infered unknow PDE parameters when using …

Dear @lululxvi and comunity, I'm using DEEPXDE to infer several unknow parameters in PDE and ODE. I used the callbacks to monitor the changes of these infered parameters during the training process. …

ZPLai updated 2 years ago
4
neka-nat/ddp-gym #2

unconverge

Hi! I'm learning DDP method recently and also upvoted your brilliant implementation. It seems like you are using the MPC version of ilqr? I change it into normal version but it does not converged any …

TianrongChen updated 5 years ago
1
fwhdzh/qtcp-ns3 #1

some useful links

There are some repository for reference. https://github.com/tkn-tub/ns3-gym/tree/master/scratch/rl-tcp. There are some practical issue to apply reinforce learning for congestion control. https://g…

SoonyangZhang updated 2 years ago
10
davidmigloz/langchain_dart #466

Mention which packages to import in documentation tutorials

### System Info I'm trying to do the tutorials and there are many things that aren't correct. One that is a blocker preventing me from learning Retrieval is the fact that it seems like all document l…

ScottS2017 updated 4 months ago
8

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for q-learning

1000+ results
for q-learning