linear-models Search Results

1000+ results
for linear-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PKU-YuanGroup/Open-Sora-Plan #57

Try using linear passthrough to train a model in dit?

Try using linear passthrough to train a model in dit? `One of the key ideas is that it works as if it was like "an online passthrough", by applying a loop on a module SuperClass, that groups layers…

win10ogod updated 6 months ago
1
vllm-project/llm-compressor #106

[Bug]: Index Error tuple out of range

**Describe the bug** I'm trying to apply "W4A16" quantisation to the qwen2-7B model. In particular "cognitivecomputations/dolphin-2.9.2-qwen2-7b" though I've tried with other qwen2 models and had the…

SeanIsYoung updated 2 weeks ago
2
net-titech/gnn-models #9

Simplifying Graph Convolutional Networks

Venue: ICML 2019 Summary: Proposes a simplified linear graph neural network architecture (GCN with non-linearity layers removed). New architecture is significantly faster than the state of the art mo…

zarina-aniraz updated 5 years ago
1
flow123d/flow123d #1342

reference stress field

Add support for the reference (initial stress field) that may be necessary for some nonlinear models e.g. contacts with friction. For the linear elasticity we have balance of forces: ``` div( add…

jbrezmorf updated 2 years ago
1
k2-fsa/snowfall #142

RuntimeError in ctc_att_transformer_train.py

See below (using the latest master) ``` 2021-03-29 07:34:23,835 INFO [common.py:270] ================================================================================ 2021-03-29 07:3…

csukuangfj updated 3 years ago
5
rikdz/GraphWriter #7

CUDA out of memory.

~/桌面/GraphWriter-master$ python3.6 train.py -save res Save File Exists, OverWrite? for no Loading Data from data/preprocessed.train.tsv building vocab done Sorting training data by len ds size…

sjzabc updated 4 years ago
2
philschmid/sagemaker-huggingface-llama-2-samples #3

RuntimeError: mat1 and mat2 shapes cannot be multiplied (409…

Ran all the cells of Notebook to funetune LLama2 got this error. | 2023-07-20T16:08:06.067+05:30 | return forward_call(*args, **kwargs) File "/opt/conda/lib/python3.10/site-packages/accelerat…

monuminu updated 1 year ago
1
szaghi/OFF #7

New code roadmap

### Desiderata features + Compressible, multi-fluid, multi-phase, Navier-Stokes equations: + Preconditioned equations to efficient handling incompressibile, compressible, cavitating and multi-ph…

szaghi updated 5 years ago
1
lme4/lme4 #710

Add Correlation Parameter for Partially Crossed Random Effec…

Hello lme4 team, I am using lme4 and the Julia MixedModels code to estimate non-nested partially crossed person and firm earnings models. An example formula is shown below: earnings ~ 1 + experi…

mckman updated 5 months ago
2
pytorch/pytorch #91165

[FSDP] FSDP with CPU offload consumes `1.65X` more GPU memor…

### 🐛 Describe the bug Context: We have more and more situations where a large part of the model that's being trained is frozen. As these are very large LLMs, we want to leverage FSDP with CPU offl…

pacman100 updated 5 months ago
20

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for linear-models

1000+ results
for linear-models