q-learning Search Results

1000+ results
for q-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

meghabytes/git_test #1

Noticed

Good thing I kept all my research work private, already deep q networks code stolen. Feel free to contact me if needed in cloudsim scheduling and energy part, I have worked on reinforcement learnin…

ArkS0001 updated 4 months ago
4
delaneyj/realworld-datastar #4

windows/nodejs error

win10 node 22.1.0 go 123.3 when I run go run main.go the following error is raised ..\..\sql\setup.go:17:2: no required module provides package github.com/delaneyj/realworld-datastar/sql/zz; to ad…

johntom updated 2 weeks ago
2
FlagOpen/FlagEmbedding #792

bge-reranker-v2-minicpm-layerwise微调loss为1的问题

CUDA_VISIBLE_DEVICES=6,7 torchrun --nproc_per_node 2 \ -m FlagEmbedding.llm_reranker.finetune_for_layerwise.run \ --output_dir ./results/reranker/bge-reranker-v2-minicpm-layerwise \ --model_name_or…

sevenandseven updated 3 months ago
12
FluxML/Gym.jl #8

Standalone implementations of reinforcement learning algorit…

Hello, I would like to know what you think about having some standalone implementations as functions that take in the environment and other parameters and return the trained policy. Here an examp…

jean72human updated 5 years ago
5
goodboychan/goodboychan.github.io #87

Deep Q-Network (DQN) on LunarLander-v2 | Chan`s Jupyter

# Deep Q-Network (DQN) on LunarLander-v2 | Chan`s Jupyter In this post, We will take a hands-on-lab of Simple Deep Q-Network (DQN) on openAI LunarLander-v2 environment. This is the coding exercise fr…

utterances-bot updated 2 years ago
1
golnarkmahani/Temporal-Difference #2

Q-V Learing

Excuse me! What does Q-V Learning mean? The algorithm of `Q_V_Garbage.m` is more like the combination of TD(0) for evaluating v_pi and Sarsa control methods rather than the Q-learning method. Can you …

ZZtsmart updated 7 years ago
2
rust-lang/rust-analyzer #18251

Configuration manual missing non-default variants for `hover…

Hello, partial issue, partial question. When configuring my LSP, I noticed that the below configurations show their **default** values, but do **not** show other valid values (variants). I believe t…

chrisp60 updated 1 month ago
2
HydPy/HydPy-meetups #82

MonsterAPI | Finetune your own SLM, fraction of Size and Pr…

# Title of the Talk: No Code SLM Finetuning with MonsterAPI ## Abstract of the Talk: Dive into the world of no-code large language model (LLM) finetuning in this informative talk presented by Mons…

rcv1k4s updated 1 month ago
3
AxiomaticUncertainty/Deep-Q-Learning-for-Tic-Tac-Toe #1

Question:

I see that you are using a 0 vector for the rewards, and only updating the value that corresponds to the action here: https://github.com/AxiomaticUncertainty/Deep-Q-Learning-for-Tic-Tac-Toe/blob/c5c0…

amaynez updated 3 years ago
1
cderickson/MTGO-Tracker #31

Multifaced Cards - automatically update?

Hi, first of all thanks for the great tool. I am still learning how to use it, so apologies if any of this is trivial. Inspecting the MULTIFACED_CARDS.txt file shows it (1) is outdated and (2) con…

gfrt0 updated 5 months ago
2

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for q-learning

1000+ results
for q-learning