causal-models Search Results

1000+ results
for causal-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

uchicago-computation-workshop/Winter2024 #2

Questions for Daniel Alabi on "Private Regression"

Please pose thoughtful questions for our speaker by Wednesday midnight, and upvote 5 by Thursday @ 10am, an hour before our session together. The associated papers are: The following papers are ass…

jamesallenevans updated 4 months ago
45
Stability-AI/stablediffusion #274

attn_mask dtype error

attn_mask dtype error occurred as follows. ``` $ python3 scripts/txt2img.py --prompt "a professional photograph of an astronaut riding a horse" --ckpt "./768-v-ema.ckpt" --config configs/stable-dif…

Ishihara-Masabumi updated 1 year ago
3
Dao-AILab/flash-attention #483

RuntimeError: CUDA error: an illegal memory access was encou…

I'm getting a `RuntimeError: CUDA error: an illegal memory access was encountered` using FlashAttention with a GPT-NeoX-esque model. I ``` from transformers import AutoConfig import torch from…

zanussbaum updated 10 months ago
6
unslothai/unsloth #437

TypeError: MistralForCausalLM.forward() got an unexpected ke…

When i try to fine tuning wizard-2 7b got the error: TypeError: MistralForCausalLM.forward() got an unexpected keyword argument 'causal_mask'. Full stack info as follow: model loading ==((====))=…

CharlieFang1 updated 4 months ago
1
briatte/dsr #14

Syllabus, emails, readings, slides

## Slides Current size of slide sets (cap at 25, except Week 1). Revise readings, practice sessions and exercises, and include screenshots of videos when relevant. - [x] 1. 37 -- OK, cap at ~ 40…

briatte updated 11 months ago
1
ollama/ollama #5821

Gemma 2 runs too slow

### What is the issue? After ollama's upgrade to 0.27 from 0.20, it runs gemma 2 9b at very low speed. I don't think the OS is out of vram, since gemma 2 only costs 6.8G (q_4_0) vram while my lapto…

AeneasZhu updated 2 months ago
18
huggingface/accelerate #2979

`RuntimeError: Expected all tensors to be on the same device…

### System Info ```Shell - `Accelerate` version: 0.33.0 - Platform: Linux-5.15.133+-x86_64-with-glibc2.35 - `accelerate` bash location: /opt/conda/bin/accelerate - Python version: 3.10.14 - Numpy…

echo-yi updated 3 weeks ago
4
microsoft/DeepSpeed #3868

[BUG] Flash attention seems cannot be integrated with pipeli…

**Describe the bug** Flash attention of both implementations from the original [one](https://github.com/HazyResearch/flash-attention/blob/main/flash_attn/flash_attention.py#L11-L71) or the torch.nn.f…

SparkJiao updated 8 months ago
1
huggingface/transformers #31998

LLama can't use `torch.compile()`

### System info - `transformers` version: 4.43.0.dev0 - Platform: Linux-5.10.0-30-cloud-amd64-x86_64-with-glibc2.29 - Python version: 3.8.10 - Huggingface_hub version: 0.23.4 - Safetensors vers…

ydshieh updated 2 months ago
2
pgmpy/pgmpy #782

Proposal for new architecture of ContinuousFactor class

![class](https://cloud.githubusercontent.com/assets/1461453/21132332/ae2df6a2-c113-11e6-9cac-4a7a20151df9.png) This structure will help us in keeping all the different distributions separate and th…

ankurankan updated 4 years ago
34

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for causal-models

1000+ results
for causal-models