original-transformer Search Results

1000+ results
for original-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

foundation-model-stack/fms-acceleration #87

Support for Iterable Datasets

Transformers Lib while getting the data loader (for single gpu training) adds a wrapper collator over the original collator if the dataset is not of class HF Dataset, essentially for the cases where t…

kmehant updated 1 week ago
1
meta-llama/llama-stack #191

Are there any available tools that can convert the original …

Are there any available tools that can convert the original .pth model files downloaded from Meta into a format usable by stack, or convert them to .safetensors format? I tried the tool from https://g…

Itime-ren updated 5 days ago
2
UKPLab/sentence-transformers #2970

TypeError: 'DataParallel' object is not subscriptable when r…

After getting same error while trying the 2d Matryoshka loss, I ran the exact [2d_matryoshka_nli](https://github.com/UKPLab/sentence-transformers/blob/master/examples/training/matryoshka/2d_matryosh…

abedkhooli updated 5 days ago
2
dart-archive/observe #73

transformer generated setter missing original metadata

issue find in stackoverflow [Wrong decode() from redstone_mapper about observable object in web app](http://stackoverflow.com/questions/30855297/wrong-decode-from-redstone-mapper-about-observable-obj…

Ticore updated 9 years ago
1
grafana/grafana #93433

Transformation: New plugin type and/or new javascript transf…

A new plugin type to be able to add new more complex / non existent transformations. As an alternative, a arbitrary javascript transformer would do this too. Original discussion: #38540

thojo0 updated 1 week ago
1
JangYeongSil/JettaRLLLM #1

give you an advice

Hi, friend, maybe you can try to use the llama architecture instead of the original Transformer?(You can refer to llama architecture in llama2.c)

win10ogod updated 2 weeks ago
1
Nerogar/OneTrainer #501

[Feat]: Flux-Dev2Pro Support

### Describe your use-case. Read through [here](https://medium.com/@zhiwangshi28/why-flux-lora-so-hard-to-train-and-how-to-overcome-it-a0c70bc59eaf). Its a distilled model that allows better fin…

Vigilence updated 2 days ago
3
keras-team/keras-io #1907

Error in Vision Transformer examples

### Issue Type Documentation Bug ### Source source ### Keras Version 2.14 ### Custom Code Yes ### OS Platform and Distribution Ubuntu 22.04 ### Python version 3.10 …

angelo-ml updated 3 weeks ago
1
huggingface/transformers #33783

[Feature Ambiguity] How exactly do I activate YaRN for Qwen …

Sorry for the lack of template, but this not so much a feature request or help request, but rather a request for "clarification" and a possible config/implementation bug with Qwen 2.5. The Qwen 2.5…

Downtown-Case updated 4 days ago
5
microsoft/presidio #1463

Transformer model may be ignoring general entity types

Amazing project, thank you so much for `presidio` and all of the work here. I am noticing with a few different Hugging Face transformer models, that some of the listed entities associated with the …

michhar updated 1 day ago
5

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for original-transformer

1000+ results
for original-transformer