q-learning Search Results

1000+ results
for q-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dennybritz/reinforcement-learning #217

Deep Q Learning, neither works with tensorflow 1.x nor with …

After the release of tensorflow 2.0, there are several enhancements that has been made on both of the versions. Some functions are taken out of 1.x and some are deprecated and replaced in tensorflow 2…

azharsalman updated 3 years ago
1
learningequality/kolibri #12733

Remove "Activity" tab from learner view

## Overview As part of the information architecture refactor, we need to do a little bit more follow up on the new learner overview/report page - `ReportsLearnerReportPage`. Please review https://…

marcellamaki updated 2 weeks ago
3
Engineer1999/Double-Deep-Q-Learning-for-Resource-Allocation #5

Question about the meaning of constant “3”

Hi, thank you very much for sharing the code. It is very helpful. I have a question about the meaning of constant "3". In many places of the codes, "3" is directly used to define the parameters. s…

szgtvt updated 4 years ago
3
ZhengyiLuo/PHC #98

better maually cleanup gpu memory when loading motions

often meets CUDA out of memory in the stage of evaluating the model (which periodically called after 1500 iterations of training). In motion_lib_real.py line 199 we load the motions in memory and …

luoye2333 updated 1 week ago
2
hackforla/marketing #62

Create project and CoP marketing epics

### Dependency - #60 ### Overview We need to create epic issues for each project and CoP so that we can manage their marketing issues #### Details Management includes - Identifying missing is…

ExperimentsInHonesty updated 2 weeks ago
4
huggingface/deep-rl-class #394

Translating to Russian

Hi! Let's bring the reinforcement learning course to all the Russian-speaking community 🌏 Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/tran…

artyomboyko updated 10 months ago
53
unslothai/unsloth #1037

Fine tune and infer llama3 with cpu

import logging import os import json import torch from datasets import load_from_disk from transformers import TrainingArguments from trl import SFTTrainer from unsloth import FastLanguageModel…

SidneyLann updated 1 week ago
18
unslothai/unsloth #1291

Extremely long context finetuning

Hi all, I am trying to fine-tune models in extremely long contexts. I've tested the training setup below, and I managed to finetune: - llama3.1-1B with a max_sequence_length of 128 * 1024 tokens …

GianlucaDeStefano updated 2 weeks ago
2
shanirub/ecommerce #24

Passwords should be hashed

Cool (learning?) project, I guess! ## Underlying problem A little note I've found: As far as I know/see, passwords are not being hashed, are they? https://github.com/search?q=repo%3Ashanirub%…

rugk updated 2 months ago
1
enricoande/reinforcement_learning_examples #1

Possible bugs?

@enricoande [1] https://github.com/enricoande/reinforcement_learning_examples/blob/95627db2a323535153e711a23f5519ecf7409f38/invertedpendulum/Sarsa/episodeFA.m#L35 It appears that here `phi` cor…

amneetb updated 2 years ago
1

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for q-learning

1000+ results
for q-learning