ffn Search Results - Githubissues

Haiyang-W/GiT #15

FLOPs Calculation

Hi, @Haiyang-W! You have done a very interesting work. However, I encounter a problem when calculating the FLOPs of the GiT model. When I run `python tools/analysis_tools/get_flops.py`, it output 0 F…

OctoNinja9 updated 1 week ago

unslothai/unsloth #1197

Issue saving mistral-7b-instruct-v0.3-bnb-4bit to GGUF

Issue during saving `unsloth/mistral-7b-instruct-v0.3-bnb-4bit` after training/saving, both in Kaggle and [gguf-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) I have tried converting…

Linguiniotta updated 6 days ago

facebookresearch/sapiens #119

model and loaded state dict do not match

Trying to run image encoder model, I followed `extract_feature.sh` example: loading pretrained pth checkpoint `sapiens_1b_epoch_173_clean.pth` and config `pretrain/configs/sapiens_mae/humans_300m_test…

Franckevicius updated 1 week ago

unslothai/unsloth #1104

Unable to run saving GGUF F16, KeyError: '"name"'.

Unsloth: You have 1 CPUs. Using `safe_serialization` is 10x slower. We shall switch to Pytorch saving, which will take 3 minutes and not 30 minutes. To force `safe_serialization`, set it to `None` i…

ramzyizza updated 3 weeks ago

microsoft/BitNet #77

Running on Colab - convert-hf-to-gguf-bitnet.py stops with …

I am using google colab. Downloaded "Llama3-8B-1.58-100B-tokens" model. but when I run : !python utils/convert-hf-to-gguf-bitnet.py models/Llama3-8B-1.58-100B-tokens --outtype f32 initially it star…

ApurvPujari updated 1 week ago

ggerganov/llama.cpp #9818

Bug: !!Severly Performance Degration when Using llama.cpp to…

### What happened? Hi, When I use llama.cpp to deploy a pruned llama3.1-8b model, a unbearable performance degration appears: We useing a structed pruning method(LLM-Pruner) to prune llama3.1-8b, w…

gudehhh666 updated 2 weeks ago

DSL-Lab/Specformer #6

Question about Small and medium model for graph classificati…

Hi, I am interested in your work. But I have a question about your medium_model.py; it seems that in your SpecformerMedium class, you didn't apply ``` mha_eig = self.mha_norm(eig) …

Dbgsaoge updated 3 weeks ago

NNPDF/pineko #122

Drop aFONLL support, support FFNS

As soon as everyone migrated to nFONLL we should remove all the branching with aFONLL, and just support a single FONLL implementation. E.g. the following should only keep the `if` branch https://g…

alecandido updated 1 month ago

huggingface/transformers #33680

save_pretrained is changing the name of module when saving

### System Info - `transformers` version: 4.44.2 - Platform: macOS-15.1-arm64-arm-64bit - Python version: 3.10.14 - Huggingface_hub version: 0.23.3 - Safetensors version: 0.4.3 - Accelerate vers…

ZhiyuanChen updated 5 days ago

mozilla/translations #894

Experiment with the decoder sizes

In [Ludicrously Fast Neural Machine Translation](https://aclanthology.org/D19-5632.pdf), they test a variety of decoder configurations for faster models. In #174 @eu9ene showed that a larger de…

gregtatum updated 1 day ago

1000+ results for ffn

1000+ results
for ffn