gpu-optimization Search Results

1000+ results
for gpu-optimization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ultralytics/yolov5 #13399

why different optimizer train get different result

### Search before asking - [X] I have searched the YOLOv5 [issues](https://github.com/ultralytics/yolov5/issues) and [discussions](https://github.com/ultralytics/yolov5/discussions) and found no simi…

tank1530532 updated 3 days ago
2
ctarver/NR_GPU_LDPC #1

Question about CPU time

Hi @ctarver I finally got some time to look into this code before my thesis defense. Sorry for the delay. I took a quick look and collected initial profile data. If I understand it correct, the …

Jokeren updated 5 months ago
1
comfyanonymous/ComfyUI #1860

ComfyUI misdetects vram as shared when running on intel macs…

I got an mac pro with Radeon RX 5700 gpu, so I tried to conduct tests. ComfyUI misdetects vram as shared when running on intel macs with dedicated gpu. Pytorch is nightly version installed with `pip3…

RarogCmex updated 1 month ago
2
jax-ml/jax #23410

Improve CuSolver errors diagnostics

### TLDR: Often when writing scientific algorithms we have to use some routines from cuSolver, like svd/eigh/qr. Those routines sometimes fail with unclear error messages that are not easy to unders…

PhilipVinc updated 2 months ago
8
maxwang967/MetaTTE #1

Unable to multiprocess using multiple GPUs

Hi @maxwang967 , I am unable to multiprocess my training epoch using multiple GPUs. Can you please help here ? Regards, Saikat

saihow1999 updated 1 week ago
2
JLi69/voxelworld #7

Optimizations

Currently each voxel vertex consists of 40 bits: 8 bits for x (though only 6 are used) 8 bits for y (though only 6 are used) 8 bits for z (though only 6 are used) 8 bits for "block id" (which is d…

JLi69 updated 1 month ago
4
colmap/colmap #2643

CUDA-accelerated BA / sparse solvers

**Is your feature request related to a problem? Please describe.** An early experiment showed that there was large speed-up using _dense_ solvers https://github.com/colmap/colmap/pull/2161 The goa…

pwais updated 3 months ago
5
microsoft/vscode #232220

augmented_images = data_aug(img)

Type: Bug Auto-generated text from notebook cell performance. The duration for the renderer, VS Code Builtin Notebook Output Renderer, is slower than expected. Execution Time: 42ms Renderer Duration…

Debasish1134 updated 2 weeks ago
1
msr-fiddle/pipedream #12

optimizer speed for resnet101

Hi, I'm trying to compare resnet101 with model parallelism and your pipeline parallelism using a nvprof. For this one, I'm trying to make an optimization code to launch. I launched the python co…

jeageun updated 5 years ago
3
pytorch/pytorch #113590

Implement FlashFFTConv algorithm

### 🚀 The feature, motivation and pitch * FlashFFTConv is a faster version of FFTConvolution similar to how FlashAttention is a faster version of attention (on GPUs). * We should upstream this op…

Skylion007 updated 11 months ago
4

上一页 1...18 19 20 21 22 23 24...100 下一页

1000+ results for gpu-optimization

1000+ results
for gpu-optimization