-
### Dependency
- #60
### Overview
We need to create epic issues for each project and CoP so that we can manage their marketing issues
#### Details
Management includes
- Identifying missing is…
-
Hi!
Let's bring the reinforcement learning course to all the Russian-speaking community 🌏
Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/tran…
-
import logging
import os
import json
import torch
from datasets import load_from_disk
from transformers import TrainingArguments
from trl import SFTTrainer
from unsloth import FastLanguageModel…
-
Hi all, I am trying to fine-tune models in extremely long contexts.
I've tested the training setup below, and I managed to finetune:
- llama3.1-1B with a max_sequence_length of 128 * 1024 tokens
…
-
Hiya! Kudos for a great layout design!
I'd like to add Gallium to my EPKL program, to stand beside Graphite (and Sturdy, etc) as an example of a good and popular newer layout. I think I like the ro…
-
@enricoande
[1] https://github.com/enricoande/reinforcement_learning_examples/blob/95627db2a323535153e711a23f5519ecf7409f38/invertedpendulum/Sarsa/episodeFA.m#L35
It appears that here `phi` cor…
-
Cool (learning?) project, I guess!
## Underlying problem
A little note I've found: As far as I know/see, passwords are not being hashed, are they?
https://github.com/search?q=repo%3Ashanirub%…
rugk updated
2 months ago
-
Hello authors, thanks for your quick responses on my previous issues!
I'm making a new issue to ask whether these are the right hyperparameters for training the `bge-en-icl`. I'm finding that I ca…
-
## Description
**Update**: Previously it was reported that the OOM was only for BNB, but now it is observed for Quantized Peft in general even for GPTQ. See #106
Outliers
![image](https://gith…
-
## ❓ Questions on how to use PyTorch3D
NOTE: Please look at the existing list of Issues tagged with the label ['question`](https://github.com/facebookresearch/pytorch3d/issues?q=label%3Aquest…