hidden-causal Search Results

1000+ results
for hidden-causal

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

opea-project/GenAIExamples #625

TGI latest cpu version doesn't work with some models

After updated tgi version to ghcr.io/huggingface/text-generation-inference:latest-intel-cpu The codegen test failed with the following 2 MODELs: ise-uiuc/Magicoder-S-DS-6.7B m-a-p/OpenCodeInterpr…

yongfengdu updated 2 weeks ago
3
meedstrom/eva #4

Guess activity

For background, see the [README](https://github.com/meedstrom/eva) for all the theory. Current questions on the stats theory ### Re. the model for realtime guesses: - [ ] What kind of model can it b…

meedstrom updated 3 years ago
1
Dao-AILab/flash-attention #941

Fewer matrix multiplications, same results, should we consid…

In the inner loop of FlashAttention-2, each computation of O requires a computation of V. I adopted a different implementation approach. For each block Q, after calculating the complete attention scor…

pandaupc updated 3 months ago
23
Dao-AILab/flash-attention #679

The problem of dtype.

# Prob and some fix I'm using flash_attn==2.3.3 to load my finetuned LLaMa2 model (13B), but get an error when using the Flash_attn. In /flash_attn/bert_padding.py#L41 there is an error : IndexError:…

bxrjmfh updated 9 months ago
1
huggingface/transformers #31468

Attention dropout causing problem in attention score distrib…

### System Info Transformers version 4.41.2 Platform: Ubuntu 22.04.4 LTS Python: 3.10.14 ### Who can help? @younesbelkada @ArthurZucker ### Information - [ ] The official example s…

RicRicci22 updated 1 month ago
6
tensorflow/tensorflow #74972

Tensorflow/Keras_nlp bug

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source binary ### TensorFlow version tf 2.17.0 ### Custom code No ### OS platform and distribution Ubuntu…

Humbulani1234 updated 13 hours ago
1
amazon-science/chronos-forecasting #33

Use efficient implementation of attention

I am wondering what's the best way to use efficient implementations of attention. PyTorch provides the experimental [`torch.nn.functional.scaled_dot_product_attention`](https://pytorch.org/docs/stable…

abdulfatir updated 2 months ago
1
KaihuaTang/Scene-Graph-Benchmark.pytorch #200

Model fails (does not start) to classify custom image

## ❓ Questions and Help Here's my system: docker image with gpu support ubuntu 18.04 ``` (base) root@43a59b70d445:/app/scene-graph-benchmark# nvidia-smi Thu Sep 21 11:57:45 2023 +-------…

BlueVelvetSackOfGoldPotatoes updated 7 months ago
16
espnet/espnet #5065

when I run egs2/librimix/tse1/run.sh, the loss=0.000e+00 all…

I want to train a Target Speaker Extraction model on Librimix dataset, but I found the snr_loss and final loss(equal to snr_loss) are always 0.000e+00. Here is my train log: node7:0/6] 2023-03-27 16:…

mubingshen updated 9 months ago
16
py-why/dowhy #464

Algorithms for efficient adjustment sets

Hello DoWhy team. Congrats on the great work on this package! I wonder if you would be interested in a contribution to the package. First, a brief intro. In a series of papers with co-authors ([…

esmucler updated 2 years ago
4

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for hidden-causal

1000+ results
for hidden-causal