mlp Search Results - Githubissues

1000+ results
for mlp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

CASIA-IVA-Lab/FLAP #7

pruning 之后使用无法读取模型

我尝试使用 `model = AutoModelForCausalLM.from_pretrained(args.model_path, device_map="auto", trust_remote_code=True, low_cpu_mem_usage=True）` 但是会报错 “Traceback (most recent call last): File "/home/ubu…

JCDemon updated 3 weeks ago
7
shuns0314/2019_Data_Science_Bowl #26

mlpモデル

とりあえず、コピペでも入れられそうなら入れたい。

shuns0314 updated 4 years ago
1
935963004/LaBraM #21

RuntimeError: The size of tensor a (341) must match the size…

I put the modified cnt dataset on the model and ran it sending some errors. D:\ProgramData\anaconda3\envs\labram\python.exe E:\lab\DL\LaBraM-main\run_class_finetuning.py Not using distributed mode …

upper127 updated 1 month ago
3
microsoft/BitBLAS #184

[Bug] Incorrect output for bitnet model

Some commits lead to incorrect output of bitnet. To reproduce: ```bash python ./integration/BitNet/eval_correctness.py ``` The output is abnormal. ``` Replacing module layers.25.mlp.dow…

LeiWang1999 updated 2 weeks ago
1
pytorch/pytorch #92259

[discussion] Fused MLPs

### 🚀 The feature, motivation and pitch I just stumbled on https://twitter.com/DrJimFan/status/1615018393601716224, there is https://github.com/NVlabs/tiny-cuda-nn which fuses small MLPs for fast tra…

vadimkantorov updated 1 year ago
1
LTH14/mar #56

About VAE channels

@LTH14 Hello Bro I found the VAE in mar is KL-16, the latent dimension is [B 16 16 16], and when use KL-8, the latent dimension is [B 4 32 32]. I have a question: if I use the SD model or other …

pokameng updated 1 day ago
10
NIEHS/beethoven #357

Base learners on GPU

Trying and testing base learners for model pipeline with GPU. See `fit_base_learner` function (R/base_learner.R) with `learner` parameter determining model type. - [ ] mlp - [ ] xgb - [ ] lgb …

mitchellmanware updated 2 weeks ago
1
pytorch/torchtune #1387

Error for inference LoRA Llama3-8b in Python Script

I have a finetuned LoRA-Llama3-8b model. Since I have many prompts, I would like to write a script to generate outputs for all prompts without repeatedly loading the model using the CLI script. The…

fangzhouli updated 1 month ago
4
casper-hansen/AutoAWQ #566

about the shape of qzeros in awq quantization model

@casper-hansen Hi, I have a question about the awq quantization model on HuggingFace, [https://huggingface.co/TheBloke/Llama-2-7B-AWQ/tree/main?show_file_info=model.safetensors](url). The shapes o…

MuYu-zhi updated 2 months ago
2
LLaVA-VL/LLaVA-NeXT #115

LLaVA-NeXT demo code froze while running

Trying to deploy and run demo on a 4 A6000 cluster but it seemed that the runtime froze without any exceptions... Could there be any possible problems? Sorry for asking a naive question and thanks for…

FrankFcc updated 2 months ago
3

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for mlp

1000+ results
for mlp