workload-optimization Search Results

1000+ results
for workload-optimization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AUTOMATIC1111/stable-diffusion-webui #13496

[Bug]: Random freezing occurs when user interrupting the gen…

### Is there an existing issue for this? - [X] I have searched the existing issues and checked the recent builds/commits ### What happened? In the event that a user decides to abort an unfinished i…

Aeka0 updated 4 months ago
1
pytorch/torchtune #1228

[FEATURE REQUEST] Compile model+loss_fn together

Right now when `compile=True`, only the model is compiled https://github.com/pytorch/torchtune/blob/e10142016798cf84f2e5c638a985014384f400a7/recipes/lora_finetune_single_device.py#L383-L386 We c…

gau-nernst updated 1 month ago
25
huixiangufl/aparapi #54

Fall back with CodeGenException goto -> 0362

``` What steps will reproduce the problem? I've developed an algorithm (to solve polyominoes) for which the GPU code could be generated. What is the expected output? I expected the algorithm to gen…

GoogleCodeExporter updated 8 years ago
6
RobRich999/Chromium_Clang #26

Chromium Build Discussion

Discussion regarding Chromium builds and related topics.

RobRich999 updated 2 hours ago
422
NCEAS/metadig-engine #122

Kubernetes performance testing

In order to check the feasibility of using Kubernetes (k8s) for use with metadig engine, representative workloads will be run on the k8s cluster (docker-ucsb-1.test.dataone.org, docker-ucsb-2).

gothub updated 1 year ago
4
rui314/mold #250

Consider doing bootstrap (PGO bootstrap)

Similar to GCC, `mold` can easily bootstrap (link `mold` using already built `mold`). Plus you can squeeze some extra performance from PGO (`-fprofile-generate` and `-fprofile-use`), where linking o…

marxin updated 1 year ago
20
root-project/root #15778

Evaluate using Profile-Guided Optimization (PGO) for optimiz…

### Explain what you would like to see improved and how. I checked various compiler optimizations like Profile-Guided Optimization (PGO) on many projects - all the results are available at https://gi…

zamazan4ik updated 2 months ago
13
b4rtaz/distributed-llama #69

[Feature Suggest] From All-Reduce to Ring-All-Reduce

Dear author, ### Challenge and solution This repository has implemented Tensor Parallel, which facilitates the system by distributing the **computation workload** evenly to each node, achieving ne…

zhengpeirong updated 1 month ago
3
tdeneau/aparapi #54

Fall back with CodeGenException goto -> 0362

``` What steps will reproduce the problem? I've developed an algorithm (to solve polyominoes) for which the GPU code could be generated. What is the expected output? I expected the algorithm to gen…

GoogleCodeExporter updated 9 years ago
6
Xtra-Computing/briskstream #5

yahoo streaming benchmark on briskstream ?

Is there any implementation for YSB on briskstream. Or is there any hint for for me to implement some new benchmark?

chenzongxiong updated 4 years ago
10

上一页 1...77 78 79 80 81 82 83...100 下一页

1000+ results for workload-optimization

1000+ results
for workload-optimization