graph-attention Search Results

1000+ results
for graph-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/TensorRT #2197

🐛 [Bug] Encountered bug when using Torch-TensorRT with torch…

## Bug Description I get an error when converting a conformer transducer enecoder to tensorrt. (asr task) ## To Reproduce [requirenments.txt](https://github.com/pytorch/TensorRT/files/123430…

kzelias updated 8 months ago
16
huggingface/optimum-habana #1314

CodeGen inference error "synNodeCreateWithId failed for node…

### System Info ```shell +-----------------------------------------------------------------------------+ | HL-SMI Version: hl-1.17.0-fw-51.1.0 | | Driver Ver…

caijimin updated 2 months ago
4
pytorch/pytorch #117735

[Inductor] Request to support get_attr in pattern matcher

### Description It is found currently that `DistilBert` using [`torch.Tensor`](https://github.com/huggingface/transformers/blob/main/src/transformers/models/distilbert/modeling_distilbert.py#L246) ge…

Valentine233 updated 4 months ago
2
triton-inference-server/server #7318

Uneven QPS leads to low throughput and high latency as well …

**Description** We found that the performance of triton+tensorrt under stable QPS and uneven QPS is very different. As follows: - uneven QPS (1) QPS ![image](https://github.com/triton-inference-se…

SunnyGhj updated 1 month ago
14
huggingface/transformers #34600

AssertionError for Pytorch PiPPy example

### System Info ``` (zt) root@autodl-container-7071118252-7032359d:~/test/PiPPy/examples/llama# transformers-cli env Copy-and-paste the text below in your GitHub issue and FILL OUT the two last p…

Noblezhong updated 5 days ago
2
pytorch/PiPPy #1132

[BUG] cannot capture your model as a full graph

torch version: 2.5.0.dev20240616+cu121 python version: python 3.8 I run the llama example with torchrun --nproc-per-node 2 pippy_llama.py. It got an Error ``` Loading checkpoint shards: 100%|███…

sunkun1997 updated 2 weeks ago
6
mlc-ai/mlc-llm #2926

[Bug] Own 2B model is crashing with errors on Snapdragon 8 g…

## 🐛 Bug When I deploy my own 2B model using MLC on Android, the model interface initializes successfully and displays the "Ready to chat" prompt after opening. However, the app crashes after sendi…

AspenFPS updated 1 month ago
2
intel-analytics/ipex-llm #11578

Does IPEX-LLM support Flash Attention ?

Hi, i encounter the following error message trying to enable flash attention when running the command below. Can i know is flash attention supported ? ``command: ./main -m $model -n 128 --prompt …

wallacezq updated 1 month ago
5
NVIDIA/TensorRT #4089

SDXL failure of TensorRT 10.2 when running SDXL & INT8 quant…

## Description When I use your demo/Diffusion/demo_txt2img_xl.py for INT8 datatype inference, it reports an error: Invoked with: %338 : Tensor = onnx::Constant(), scope: transformers.models.clip…

Ijustakid updated 2 months ago
2
Kyubyong/dc_tts #21

[SOLVED] It is not training

This is pretty weird. The graph Attention plot graph is also blank ![alignment_009k](https://user-images.githubusercontent.com/2422433/40480115-ce19c76e-5f4d-11e8-8fa7-afe41d0a3bbf.png) I r…

Jinex2012 updated 4 years ago
25

上一页 1...17 18 19 20 21 22 23...100 下一页

1000+ results for graph-attention

1000+ results
for graph-attention