pure-transformer Search Results

1000+ results
for pure-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

instructlab/instructlab #2103

Try gradient accumulation and checkpointing for Linux traini…

https://huggingface.co/docs/transformers/v4.38.2/perf_train_gpu_one#gradient-accumulation In the `TrainingArguments` passed to `SFTTrainer`, we can likely reduce the total GPU memory required to tr…

bbrowning updated 1 month ago
13
shizhediao/DaVinci #5

some questions on evaluating

How can the model evaluate on GLEU tasks？The tasks are text-pure， but in the paper it said “Similar to PLM, when prefix image is none, this task will degenerate into “text-to-image generation” task, f…

ALR-alr updated 1 month ago
4
chflame163/ComfyUI_LayerStyle #339

运行JoyCaption2提示如下错误请问要如何解决呢?

系统环境是WINDOWS+torch2.4.1+cuda12.4错误信息如下 ayerUtility: JoyCaption2 apply_chat_template requires jinja2>=3.1.0 to be installed. Your version is 3.0.3. 2024-10-12 01:11:09,476 - root - INFO - got prom…

lzhfdwu007 updated 54 minutes ago
11
linkml/linkml #2333

Generator Support Matrix for LinkML Language Constructs

Currently, LinkML's language definition includes constructs that aren't fully supported by all generators. Identifying these gaps often requires trial-and-error (ie finding out the hard way), creat…

stanleyj-edsn updated 1 week ago
3
bitsandbytes-foundation/bitsandbytes #1256

[QUESTION] Quantizing in a different way...

Hello! I did some research (using llama.cpp) and I found out that quantizing the input and embed tensors to f16 and the other tensors to q5_k or q6_k gives excellent results and almost indistinguisha…

0wwafa updated 3 months ago
1
ggerganov/llama.cpp #9168

Request to use Phi-3.5-MoE-instruct

### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…

KarlHeinzMali updated 3 days ago
10
huggingface/tokenizers #1635

Adding many AddedTokens makes loading a tokenizer extremely …

Hi! I'm not sure if this is a problem that can be solved, or needs to be solved. Basically, we want to make a kind of hybrid tokenizer, in which we add a whole bunch of whole words to a tokenizer, …

stephantul updated 1 week ago
4
drgif/limpa #2

Drop support for converting unqualified keys

E.g. in a pure Clojure/Cljs/Datomic stack this might not be necessary. For others, metosin/spec-tools transformers might be a better option.

drgif updated 3 years ago
1
vitalets/github-trending-repos #100

New daily trending repos in PureScript

Subscribe to this issue and stay notified about new [daily trending repos in PureScript](https://github.com/trending/purescript?since=daily).

vitalets updated 12 hours ago
6
google/trax #1670

ValueError on predict mode Transformer model

### Description I am getting following error if i load model in `predict ` model. it works perfectly in `eval` mode. ``` ValueError: Incompatible shapes for matmul arguments: (8, 1, 64) and (2…

shashank2123 updated 2 years ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for pure-transformer

1000+ results
for pure-transformer