-
## Description
Hi~ I try to use sparse embedding train the recommendation algorithm of criteo dataset.
here is my network:
```python
class CtrDnn(nn.HybridBlock):
def __init__(self, spars…
-
-
Author: @vqd8a
-
**Concisely describe the proposed feature**
I would like to serialize/deserialize sparse data structure with to a buffer or to file, this will benefit online transmission and data logging/replay
*…
-
Gyroaverages may be crucial for providing a physical cut-off to the wavenumber spectrum required to be resolved in the moment_kinetics model.
Initial experiments implementing an ion gyroaverage is …
-
Is it needed? If so, how to perform such operation on the count matrix, either based on expression of negative probes or something else?
-
# 🚀 Feature
I was sad to see how many things in python -m xformers.info weren't enabled on windows so I set out to do something about it.
Literally all that needs to be done is an expansion of…
-
### 🐛 Describe the bug
```python
import torch
import torch.distributed.elastic.multiprocessing
@torch.distributed.elastic.multiprocessing.errors.record
def Main():
torch.distributed.…
-
-
设备为两台linux,每台2张A100 40G显卡:A100(40G) * 2
训练命令如下:主节点命令为CUDA_VISIBLE_DEVICES=0,1 NNODES=2 NODE_RANK=0 NPROC_PER_NODE=2 MASTER_ADDR=127.0.0.1 swift sft --model_type qwen1half-7b-chat --model_id_or_path /…