-
[Local collaborative autoencoders](https://sci-hub.ru/https://dl.acm.org/doi/abs/10.1145/3437963.3441808)
[Local latent space models for top-n recommendation](https://sci-hub.ru/https://dl.acm.org/do…
-
Is following right way to partition dataset for Pytorch DDP?
I did not generate my dataset using Petastorm and using make_batch_reader().
Ref: https://github.com/PyTorchLightning/pytorch-lightnin…
-
Problem Description: Horovod with Spark - Job Not Distributing Across Worker Nodes
**Environment:**
Cluster Setup: 1 Master Node, 2 Worker Nodes
Software Versions:
Horovod: >= 0.19.0
TensorFl…
-
Hello,
I am trying to execute your uploaded code with all required pre-requisite.
INFO:root:Loading dataset
Traceback (most recent call last):
File "G:\Implementation of the Project\Learning to …
-
我在用lora微调的时候,发现它保存的目录下有一个文件 mp_rank_00_model_states.pt,有32GB,还有一个bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt,有900M。有点困惑,lora应该只保存它训练的那部分参数才对。
我的finetune_lora.sh如下:
```shell
#!/bin/bash
expo…
-
https://pdfs.semanticscholar.org/5ef7/34327c0f4cb7a2898bbdb1cc5a65e34d09cb.pdf
-
**Describe the bug**
Deepspeed loads the whole model to every GPUs.
When running Llama2-13b in full precision:
**To Reproduce**
I followed the tutorial in https://www.deepspeed.ai/tutorials/…
yunoJ updated
4 weeks ago
-
[Reduced-rank regression](https://www.sciencedirect.com/science/article/pii/0047259X75900421) seems like a simple and fairly well-established technique. It could be implemented minimally by adding a `…
-
hello,
I'm learning PageRank algorithm, happening to see your codes.
I think you might forget to use the parameter beta in the original pagerank algorithm.
new_rank_vector[child] += (initial_ran…
Tau-J updated
3 years ago
-
Hello,
I greatly enjoyed your paper, an am interested in understanding the approach of ranked based learning described in section 4.2. I see code here for the reconstruction of the negative labels …