causal-models Search Results

1000+ results
for causal-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ollama/ollama #5143

AMD iGPU works in docker with override but not on host

### What is the issue? Ollama is failing to run on GPU instead it uses CPU. If I force it using `HSA_OVERRIDE_GFX_VERSION=9.0.0` then I get `Error: llama runner process has terminated: signal: abo…

smellouk updated 1 week ago
17
huggingface/text-generation-inference #1861

TGI-2.0.2 encounter "CUDA is not available"

### System Info ```shell torch install path ............... ['/home/chatgpt/.local/lib/python3.10/site-packages/torch'] torch version .................... 2.1.2+cu121 deepspeed install path ..…

Cucunnber updated 1 month ago
1
AkihikoWatanabe/paper_notes #804

Understanding Social Reasoning in Language Models with Langu…

# URL - https://arxiv.org/abs/2306.15448 # Affiliations - Kanishk Gandhi, N/A - Jan-Philipp Fränken, N/A - Tobias Gerstenberg, N/A - Noah D. Goodman, N/A # Abstract - As Large Language Model…

AkihikoWatanabe updated 8 months ago
1
NVIDIA/TensorRT-LLM #1037

Can TensorRT-LLM support the modified QWenAttention

As the title describes, I slightly modified the QWenAttention. Before: ![image](https://github.com/NVIDIA/TensorRT-LLM/assets/16505966/9ee6d300-5f92-489d-8022-cdf467f9acb2) After: ![image](https:/…

Hukongtao updated 1 month ago
3
ollama/ollama #4151

High RAM usage causes yo-yoing memory pressure on Mac, slow …

### What is the issue? This is possibly related to the fix for #4028. I updated to the 0.1.33 release and pulled the latest `mixtral:8x22b-instruct-v0.1-q4_0` (`6a0910fa6dc1`), so I'm running an 80…

joliss updated 1 month ago
2
triton-lang/triton #4310

Latest nightly triton causes my custom fused attention kerne…

Hello, guys. Thank you for all your great work on this awesome project! I am currently building a new deep learning acceleration framework with it. But I have some problems with it now. Hope you cou…

chengzeyi updated 1 day ago
3
allenai/OLMo #622

Cant use LORA

### 🐛 Describe the bug ValueError: Target modules {'v_proj', 'up_proj', 'o_proj', 'down_proj', 'k_proj', 'q_proj', 'gate_proj'} not found in the base model. Please check the target modules and try …

bdytx5 updated 3 weeks ago
6
paperswithcode/galai #83

problem while inferencing the model

Hi All, I'm trying to do inference using galactica-6.7B model but errors have been popping up after inferencing few examples, and I'm not sure what to do. Can anyone look at them and tell? followin…

ra-MANUJ-an updated 9 months ago
3
huggingface/transformers #20179

🌐 [i18n-KO] Translating docs to Korean

Hi! Let's bring the documentation to all the Korean-speaking community 🌏 (currently 9 out of 77 complete) Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com…

wonhyeongseo updated 7 months ago
15
jasp-stats/jasp-issues #73

Feature Request: Weight Case Variables

Hell there. I was wondering if weighting was a feature that is coming out, or if it is even on your radar. I currently use a couple different free stat tools to run frequencies and cross tabs for …

bazar0ff updated 1 month ago
26

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for causal-models

1000+ results
for causal-models