hidden-causal Search Results

1000+ results
for hidden-causal

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openxla/xla #16914

XLA does too many un-fused transposes

(This is running on a Nvidia 4090 GPU, with jax '0.4.31') I had got that is something like the example below. Here, the depth-wise convolution wants the input to be transposed from [batch, sequence…

ywrt updated 1 month ago
4
lancopku/label-words-are-anchors #26

求解释Saliency分数得到的tensor

您好！我参考您的代码，将应用于GPT2的Attentioner Manager应用到Llama上，然后得到了saliency分数，每一层都是[1,1,seq_len,seq_len]，部分具体数值如下：我想知道这里每一层的saliency分数的具体含义？我的代码如下： ``` class LlamaAttentionManager(AttentionerManagerBase): …

Patrick-Ni updated 4 months ago
1
MAGICS-LAB/DNABERT_2 #19

CompilationError: at 114:24:

Epoch [1/3] --------------------------------------------------------------------------- KeyError Traceback (most recent call last) File :21, in _fwd_kernel(Q, K, V,…

QAQ1551QAQ updated 1 week ago
26
NVIDIA/TransformerEngine #953

NaN loss issues when I switch to the Transformer Engine Tran…

**Summary** I'm hitting a NaN loss issue when I use the TransformerLayer in place of a pytorch transformer layer I wrote. **Details** I'm using the nvcr.io/nvidia/pytorch:24.04-py3 docker cont…

jasonkrone updated 4 months ago
1
luchangli03/export_llama_to_onnx #17

请问Qwen转换出错问题:RuntimeError: Sizes of tensors must match excep…

python export_qwen2_1.5.py -m /media/yanxiao/机械硬盘1/LLM/Qwen2-7B-Instruct -o ./ WARNING:root:*** Note: please apply modications to model before conversion: modication 1: in Qwen2ForCausalLM.forwar…

yanxiao1930 updated 1 month ago
5
FenTechSolutions/CausalDiscoveryToolbox #110

[SAM] RuntimeError: The size of tensor a (10) must match the…

``` obj = SAM_handson(num_hidden_generator=200, num_hidden_discriminator=200, train_epochs=100, test_epochs=30, batchsize=10, dagloss=True, verbose=True, nruns=1) output = obj.predict(data, graph=sk…

insookim43 updated 3 years ago
3
NVIDIA-Merlin/Transformers4Rec #556

[QST] How to use session-level (single) and item-level (sequ…

# ❓ Questions & Help Existing examples in session-based/sequential recommendations only use item-level, sequence-based features. However, in many real-world scenarios, we do have access to either …

zuoxingdong updated 8 months ago
3
Infini-AI-Lab/TriForce #10

Attention Scores Matrix Visualization

Hi, I would like to ask why the attention mask is not used in the prefill stage. I want to output the attention scores matrix in prefill stage. Is the code below right? ``` if spec: # s…

bulaikexiansheng updated 2 months ago
1
ErSKS/AI #1

AI Presentation Topics

**Choose Topics for Presentation** - [x] Q-learning - [x] Deep Neural Network - [x] Artificial General Intelligence - [x] Artificial Quantum Intelligence - [x] Cognitive Science - [ ] Quantum Co…

ErSKS updated 4 years ago
47
aws-neuron/aws-neuron-sdk #1035

Compiling StableDiffusionXL unet(torch.float16) failed.

Hi, I tried a test about compiling unet(torch.float16), which is the part of StableDiffusionXLPipeline in Inferentia2.8xlarge and it failed. When the latent size of unet is (64, 64), it did not fai…

newgrit1004 updated 1 week ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for hidden-causal

1000+ results
for hidden-causal