q-learning Search Results

1000+ results
for q-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hackforla/marketing #62

Create project and CoP marketing epics

### Dependency - #60 ### Overview We need to create epic issues for each project and CoP so that we can manage their marketing issues #### Details Management includes - Identifying missing is…

ExperimentsInHonesty updated 3 weeks ago
4
huggingface/deep-rl-class #394

Translating to Russian

Hi! Let's bring the reinforcement learning course to all the Russian-speaking community 🌏 Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/tran…

artyomboyko updated 10 months ago
53
unslothai/unsloth #1037

Fine tune and infer llama3 with cpu

import logging import os import json import torch from datasets import load_from_disk from transformers import TrainingArguments from trl import SFTTrainer from unsloth import FastLanguageModel…

SidneyLann updated 1 week ago
18
unslothai/unsloth #1291

Extremely long context finetuning

Hi all, I am trying to fine-tune models in extremely long contexts. I've tested the training setup below, and I managed to finetune: - llama3.1-1B with a max_sequence_length of 128 * 1024 tokens …

GianlucaDeStefano updated 2 weeks ago
2
GalileoBlues/Gallium #5

Add this layout to EPKL?

Hiya! Kudos for a great layout design! I'd like to add Gallium to my EPKL program, to stand beside Graphite (and Sturdy, etc) as an example of a good and popular newer layout. I think I like the ro…

DreymaR updated 1 month ago
8
enricoande/reinforcement_learning_examples #1

Possible bugs?

@enricoande [1] https://github.com/enricoande/reinforcement_learning_examples/blob/95627db2a323535153e711a23f5519ecf7409f38/invertedpendulum/Sarsa/episodeFA.m#L35 It appears that here `phi` cor…

amneetb updated 2 years ago
1
shanirub/ecommerce #24

Passwords should be hashed

Cool (learning?) project, I guess! ## Underlying problem A little note I've found: As far as I know/see, passwords are not being hashed, are they? https://github.com/search?q=repo%3Ashanirub%…

rugk updated 2 months ago
1
FlagOpen/FlagEmbedding #1193

A second question about reproducing bge-en-icl

Hello authors, thanks for your quick responses on my previous issues! I'm making a new issue to ask whether these are the right hyperparameters for training the `bge-en-icl`. I'm finding that I ca…

greeneggsandyaml updated 3 weeks ago
4
foundation-model-stack/fms-acceleration #50

Quantized Peft Benchmark Experiments Run Out of Memory with …

## Description **Update**: Previously it was reported that the OOM was only for BNB, but now it is observed for Quantized Peft in general even for GPTQ. See #106 Outliers ![image](https://gith…

achew010 updated 2 weeks ago
1
facebookresearch/pytorch3d #1864

How to customize the output path for generated .obj files wi…

## ❓ Questions on how to use PyTorch3D NOTE: Please look at the existing list of Issues tagged with the label ['question`](https://github.com/facebookresearch/pytorch3d/issues?q=label%3Aquest…

chenchunhao9125 updated 2 months ago
1

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for q-learning

1000+ results
for q-learning