-
### 🐛 Describe the bug
using deepspeed zero_stage3 cpu offload sft Qwen1.5-72B-Chat model,the zero config is
```
"zero_optimization": {
"stage": 3,
"offload_optimizer": {
…
-
> This paper presents the application of the Swept rule for two-dimensional grids on heterogeneous architectures.
The Swept rule was introduced in 2016 (ref. [1]), extended to two-dimensional grids i…
-
### Description
As a node operator and miner, I would like objective data to know:
* If my machines' configurations are not optimal,
* or if my network connections are insufficient for reliabl…
-
Hello,
I am encountering an issue with the GPT-NeoX library. When I set either `pipe_parallel_size` or `model_parallel_size` to 2, I get the following assertion error:
```
[2024-07-11 06:28:59,…
-
It seems there is a 16kb limit on the amount of user_data that can be passed to an EC2 instance when starting up.
We serialize the local Dask config and pass it to the scheduler and workers via the…
-
### Feature description
@ma921 has developed some great ideas e.g. as described in [this paper](https://www.sciencedirect.com/science/article/pii/S2405896323014763) and more recently [SOBER](https://…
-
### Checklist
- [ ] The issue exists after disabling all extensions
- [ ] The issue exists on a clean installation of webui
- [ ] The issue is caused by an extension, but I believe it is caused by a …
-
# Open Grant: Decentralized Edge Compute Cloud
*deMEC Cloud Computing*
**integration-adoption**
**Proposer:** `fruitbox12`
**Do you agree to open source all work you do on behalf of this …
-
### Team Name:
QUANTIFY
### Project Description:
The diagrammatic approach to quantum computations pioneered in [1,2] has been extended to quantum circuit compilation and optimisation [3].…
-
On eight m4.2xlarges I created the following dataset
```python
N = 1e8
beta = np.array([-1, 0, 1, 2])
M = 4
chunks = 1e6
seed = 20009
X = da.random.random((N, M), chunks=(chunks, M))
z0 =…