q-learning Search Results

1000+ results
for q-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

girlscript/winter-of-contributing #1828

Data Science with Python : Q Learning

Welcome to 'DSWP' Team, good to see you here This issue will helps readers in gaining all the guidance that one needs to know about Q Learning. Tutorial to Q Learning and how it's applied using sam…

prathimacode-hub updated 2 years ago
15
meghabytes/git_test #1

Noticed

Good thing I kept all my research work private, already deep q networks code stolen. Feel free to contact me if needed in cloudsim scheduling and energy part, I have worked on reinforcement learnin…

ArkS0001 updated 2 weeks ago
4
simoninithomas/Deep_reinforcement_learning_Course #31

Deep Q-Learning, Spaceinvaders, retro-gym

Hi! I’m trying to run SpaceInvaders, but faced problem with:” Game not found: Did you make sure to import the ROM?”. Then I tried solution by MaximusWudy, with point and renaiming files to .a26 (as a…

Yaroina updated 4 years ago
2
tflearn/tflearn #634

Epsilon truncates learning in 1 Step Q runner

This seems to be a conceptual issue. In the pacman example the e-greedy policy is annealed over time. If the network is run for more than a few hours, epsilon eventually goes to 0 and the distributi…

neale updated 7 years ago
2
neurodata/ProgLearn #289

Add Progressive Reinforcement Capabilities (potentially via …

**Is your feature request related to a problem? Please describe.** We would like to devise a Reinforcement approach that leverages progressive learning to improve its in-task predictions in mapping s…

levinwil updated 2 years ago
3
GSSoC24/Contributor #514

Pull request merged but not labelled

### Discussed in https://github.com/GSSoC24/Contributor/discussions/511 Originally posted by **Aditi22Bansal** July 21, 2024 @sanjay-kv I made these two PR's in the repository ALL_INDIA_HACK…

Aditi22Bansal updated 1 week ago
6
C-V2X-Senior-Design/TrackTasks #8

Simple Q-Learning / Classification Model for Signal Detectio…

This will be a precursor to the machine learning model we will use for detecting jammers and jammed signals. For now, it will consist on a simple "on" or "off" sequence where the ML model will learn…

jasoninirio updated 2 years ago
3
QwenLM/Qwen2 #676

qwen2量化的模型微调，loss不收敛

请问一下。分别使用了 qwen2-7B-instruct-AWQ 和qwen2-7B-instruct-GPTQ-int4 两个量化模型进行lora微调，loss 都不收敛。learning-rate 几步之后，就不变了。尝试修改learning-rate、lora-rank 都没有用。同样的数据，采用qwen2-7B-instruct lora微调能正常收敛。

DuBaiSheng updated 6 days ago
4
dennybritz/reinforcement-learning #51

Chaning Sarsa/Q-learning to deal with multiple enviroments

Hi, In the most of RL implementations at the start of each episode, the environment (in SARSA code for instance: state = env.reset() ) is reset to the initial states (i.e. same start point and goals …

jackevan1 updated 7 years ago
4
thushv89/AdaCNN #8

might have been calculating the q learning loss wrong

Previously was using all the actions for a single experience tuple, but seems I should have optimized a single action per single experience tuple.

thushv89 updated 6 years ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for q-learning

1000+ results
for q-learning