neelsjain NEFTune issues - Githubissues

neelsjain / NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

MIT License

387 stars 20 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Update README.md

#18 jlhe2000 closed 6 months ago
1
Update README.md

#17 LiChao-cy opened 11 months ago
0
Update README to announce integration with Ludwig

#16 arnavgarg1 closed 1 year ago
0
quesiton about the noise injection location

#15 zhhao1 closed 1 year ago
1
how to add multi-turns function?

#14 Dhaizei closed 1 year ago
1
can you suggest about chat tuning？

#13 Dhaizei closed 1 year ago
0
No eval_scoring.py file in the repo

#12 cuongtran-uva closed 1 year ago
3
how to add it to transformers‘s mdoel??

#11 Dhaizei closed 1 year ago
2
Output 0 of ViewBackward0 is a view

#10 clechristophe closed 1 year ago
3
Unable to evaluate the model

#9 sglucas closed 11 months ago
9
i think "return model" should be within the scope of the NEFTune function, not outside it

#8 Kayce001 closed 1 year ago
1
Question about output embedding from noised tokens

#7 isamu-isozaki closed 1 year ago
1
More benchmarks

#6 eyuansu62 closed 1 year ago
1
{RecursionError}maximum recursion depth exceeded while calling a Python object

#5 NormXU closed 1 year ago
3
Fix typo in utils.py

#4 eltociear opened 1 year ago
0
QLoRA implementation

#3 ghost closed 1 year ago
1
RuntimeError: ``sharded_state_dict`` can only be used when parameters are flatten and sharded.

#2 Sniper970119 closed 1 year ago
9
[Reimplementation] Unable to reproduce results -- Training loss curves are similar

#1 maximegmd closed 11 months ago
11