-
Transformers Lib while getting the data loader (for single gpu training) adds a wrapper collator over the original collator if the dataset is not of class HF Dataset, essentially for the cases where t…
-
Are there any available tools that can convert the original .pth model files downloaded from Meta into a format usable by stack, or convert them to .safetensors format? I tried the tool from https://g…
-
After getting same error while trying the 2d Matryoshka loss, I ran the exact
[2d_matryoshka_nli](https://github.com/UKPLab/sentence-transformers/blob/master/examples/training/matryoshka/2d_matryosh…
-
issue find in stackoverflow
[Wrong decode() from redstone_mapper about observable object in web app](http://stackoverflow.com/questions/30855297/wrong-decode-from-redstone-mapper-about-observable-obj…
-
A new plugin type to be able to add new more complex / non existent transformations.
As an alternative, a arbitrary javascript transformer would do this too.
Original discussion: #38540
-
Hi, friend, maybe you can try to use the llama architecture instead of the original Transformer?(You can refer to llama architecture in llama2.c)
-
### Describe your use-case.
Read through [here](https://medium.com/@zhiwangshi28/why-flux-lora-so-hard-to-train-and-how-to-overcome-it-a0c70bc59eaf).
Its a distilled model that allows better fin…
-
### Issue Type
Documentation Bug
### Source
source
### Keras Version
2.14
### Custom Code
Yes
### OS Platform and Distribution
Ubuntu 22.04
### Python version
3.10
…
-
Sorry for the lack of template, but this not so much a feature request or help request, but rather a request for "clarification" and a possible config/implementation bug with Qwen 2.5.
The Qwen 2.5…
-
Amazing project, thank you so much for `presidio` and all of the work here.
I am noticing with a few different Hugging Face transformer models, that some of the listed entities associated with the …