issues
search
foundation-model-stack
/
fms-hf-tuning
🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
Apache License 2.0
28
stars
48
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
ci: add a github workflow to label pull requests based on their title
#298
HarikrishnanBalagopal
closed
2 months ago
1
feat: Enable JSON dataset compatibility
#297
willmj
closed
3 months ago
1
feat: Example log controller yaml with training state
#296
seshapad
closed
3 months ago
0
Rename all fixtures with correct .jsonl extension
#295
willmj
closed
3 months ago
2
FIX: Metrics file epoch indexing starting from 0
#294
Abhishek-TAMU
closed
3 months ago
5
feat: Added additional events such as on_step_begin, on_optimizer_step, on_substep_end
#293
seshapad
closed
3 months ago
2
bug: certain versions of TRL as not available in some environments
#292
HarikrishnanBalagopal
closed
3 months ago
1
feat: add save_model_dir flag where final checkpoint saved
#291
anhuong
closed
3 months ago
2
Ensure additional metadata to trackers don't throw error in happy case.
#290
dushyantbehl
closed
3 months ago
1
bug: tracker params in happy case can cause exception.
#289
dushyantbehl
closed
3 months ago
0
Always update setuptools to latest
#288
jbusche
closed
3 months ago
1
Add functionality to free disk space from Github Actions
#287
willmj
closed
3 months ago
1
fix: bug where the logger was not being used properly
#286
HarikrishnanBalagopal
closed
3 months ago
0
bug: use the logger to log
#285
HarikrishnanBalagopal
closed
3 months ago
0
feat: install fms-acceleration to enable qlora
#284
anhuong
closed
2 months ago
1
[Draft] Add dev stage to dockerfile
#283
alex-jw-brooks
opened
3 months ago
0
feat: [Trainer controller] Configuration to set logging level for trigger log in the trainer controller
#282
seshapad
closed
3 months ago
1
Add unit test to verify target_modules defaults correctly
#281
willmj
closed
3 months ago
2
feat: Add DataClass Arguments to Activate Padding-Free and MultiPack Plugin and FastKernels
#280
achew010
closed
2 months ago
4
fix: do not add special tokens for custom tokenizer
#279
kmehant
closed
3 months ago
0
Enabling tests for prompt tuning
#278
Abhishek-TAMU
closed
3 months ago
0
bug: multiple runs being listed in the AIM dashboard when using multiple GPUs with `accelerate launch`
#277
HarikrishnanBalagopal
closed
3 months ago
0
Releasev1.1.0
#276
jbusche
closed
4 months ago
0
Revert "limit peft deps until investigate (#274)"
#275
anhuong
closed
4 months ago
0
deps: limit peft deps
#274
anhuong
closed
4 months ago
0
fix run evaluation to get base model path
#273
anhuong
closed
3 months ago
1
feat: Support pretokenized
#272
kmehant
closed
3 months ago
1
feat: Need a way to execute some cleanup calls before the program exits or crashes.
#271
dushyantbehl
opened
4 months ago
1
Fix: Removal of transformers logger and addition of python native logger
#270
Abhishek-TAMU
closed
3 months ago
4
Set default value of target_modules to be None in LoraConfig
#269
willmj
closed
3 months ago
3
fix: multiple runid creation bug with distributed training
#268
dushyantbehl
closed
3 months ago
2
fix: logic for getting tracker config
#267
HarikrishnanBalagopal
closed
4 months ago
0
bug: if `tracker_configs` is `None` then the variable `config` is not defined
#266
HarikrishnanBalagopal
closed
4 months ago
0
docs: fix the instructions for running with LORA
#265
HarikrishnanBalagopal
closed
4 months ago
0
feat: logging control operation
#264
seshapad
closed
3 months ago
2
feat: All metric handling changes
#263
seshapad
closed
4 months ago
1
Add config_utils tests
#262
aluu317
closed
4 months ago
2
feat: Add a dockerfile argument to enable aimstack
#261
dushyantbehl
closed
3 months ago
11
Data custom collator
#260
Ssukriti
closed
4 months ago
4
refactor code to preprocess datasets
#259
Ssukriti
closed
4 months ago
0
fix: remove lm_head for granite with llama arch models
#258
Ssukriti
closed
4 months ago
0
docs: Add documentation on experiment tracking.
#257
dushyantbehl
closed
3 months ago
3
bug: On save event added to callback
#256
seshapad
closed
4 months ago
1
Refactor formats
#255
Ssukriti
closed
4 months ago
1
fix: Added correct link in main readme for the trainer-controller readme
#254
seshapad
closed
4 months ago
1
doc: Broken link to trainer controller readme
#253
seshapad
closed
4 months ago
0
V100rc1 release
#252
olson-ibm
closed
4 months ago
1
Replace shutil.copytree() to fix permission error
#251
olson-ibm
closed
4 months ago
0
deps: Update transformers lower bound version
#250
Abhishek-TAMU
closed
4 months ago
1
Move default operations and metrics to variables
#249
alex-jw-brooks
closed
4 months ago
0
Previous
Next