autotune Search Results

1000+ results
for autotune

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/TensorComprehensions #211

Support for CPU computations and add is_cuda check

When submitting a bug report, please include the following information (where relevant): - OS: ubuntu 16.04 - How you installed TC (docker, conda, source): conda - Python version: 3.6 - CUDA/cuDNN…

arogozhnikov updated 6 years ago
2
TauLabs/TauLabs #1789

Windows development setup on wiki needs update

It would be worth updating the Makefile to import all of the tools required for compilation as it once did and as OP still does. This would make it a little easier for others to contribute to the codi…

gke updated 9 years ago
3
hiyouga/LLaMA-Factory #4785

FSDP-QLora w/ DeepSeek-v2-lite dones't work on 4 GPUs

### Reminder - [X] I have read the README and searched the existing issues. ### System Info [2024-07-12 02:22:28,334] [INFO] [real_accelerator.py:203:get_accelerator] Setting ds_accelerator to cuda…

Jiayi-Pan updated 2 weeks ago
4
magenta/ddsp-vst #16

Untuned pitch detection

I'm a little frustrated that when I play the instrument with my theremin, it snaps to the correct pitch rather than interpreting the pitch of the theremin directly. I can't play microtonal scales or …

hargisss updated 9 months ago
4
dmlc/MXNet.jl #261

Thrice as much memory for AlexNet on Caltech256 in Julia tha…

I am having a severe problem with training AlexNet (see [alexnet.jl](https://gist.github.com/hesseltuinhof/01d52e5ba64546bf6b806b3ffcc10c3f#file-alexnet-jl)) in Julia (0.5.2) on my GPU (12gb mem). …

hesseltuinhof updated 7 years ago
3
tusen-ai/simpledet #361

Segmentation fault: 11

**Describe the bug** A clear and concise description of what the bug is. On executing the below command I am facing segmentation fault and I know that there is the other issue similar to this but …

gopikrishnabs updated 2 years ago
1
alashworth/test-issue-import #90

Generalize auto-tuning in ADVI to grid search function, for …

**Issue by [dustinvtran](https://github.com/dustinvtran)** _Sunday Feb 28, 2016 at 23:09 GMT_ _Originally opened as https://github.com/stan-dev/stan/issues/1780_ ---- (There are a number of things …

alashworth updated 5 years ago
5
tensorflow/profiler #8

What's kernel launch time?

The performance summary shows that my model spend ~50% time in the "kernel launch" step. I find other items easy to understand, but I have no idea what "kernel launch" is, and how I can reduce its ti…

leasunhy updated 2 years ago
30
DeepVAC/deepvac #96

tensorrt转换器报错

当打开tensorrt转换器开关后，转换逻辑报错。

gemfield updated 3 years ago
3
triton-lang/triton #4020

[RFC] "autotuner deja-vu" save and restore autotuner cache p…

### Motivation In our experiments and applications, the triton autotuner is key to achieve competitive or best performance (e.g. for [flash attention in vLLM](https://github.com/vllm-project/vllm/i…

bringlein updated 1 month ago
7

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for autotune

1000+ results
for autotune