-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What happened?
In the event that a user decides to abort an unfinished i…
Aeka0 updated
4 months ago
-
Right now when `compile=True`, only the model is compiled
https://github.com/pytorch/torchtune/blob/e10142016798cf84f2e5c638a985014384f400a7/recipes/lora_finetune_single_device.py#L383-L386
We c…
-
```
What steps will reproduce the problem?
I've developed an algorithm (to solve polyominoes) for which the GPU code could
be generated.
What is the expected output?
I expected the algorithm to gen…
-
Discussion regarding Chromium builds and related topics.
-
In order to check the feasibility of using Kubernetes (k8s) for use with metadig engine, representative workloads will be run on the k8s cluster (docker-ucsb-1.test.dataone.org, docker-ucsb-2).
-
Similar to GCC, `mold` can easily bootstrap (link `mold` using already built `mold`).
Plus you can squeeze some extra performance from PGO (`-fprofile-generate` and `-fprofile-use`), where
linking o…
-
### Explain what you would like to see improved and how.
I checked various compiler optimizations like Profile-Guided Optimization (PGO) on many projects - all the results are available at https://gi…
-
Dear author,
### Challenge and solution
This repository has implemented Tensor Parallel, which facilitates the system by distributing the **computation workload** evenly to each node, achieving ne…
-
```
What steps will reproduce the problem?
I've developed an algorithm (to solve polyominoes) for which the GPU code could
be generated.
What is the expected output?
I expected the algorithm to gen…
-
Is there any implementation for YSB on briskstream.
Or is there any hint for for me to implement some new benchmark?