-
Hi,
I'm new in github and MPI (mpiexec) usages, so I try to run a process that can run in more than one thread. So, I used hwthreads. But, the problem is that hwthread is just limited to one node, …
-
我基于twitter-2010数据集测试PageRank算法。在相同配置的机器上,[Gemini论文实现](https://github.com/thu-pacman/GeminiGraph)比Plato的实现(0.1.1 Release)快三倍。请问可能是什么原因?官方是否可以分享一下Benchmark数据?
测试结果显示,Gemini实现和论文中的描述性能接近。而Plato实现性能差很多…
-
全参微调loss可以下降到0.03,效果相对较好;但是lora微调loss在1.2-1.5波动,效果也不好。
`deepspeed --hostfile=$hostfile fine-tune.py --report_to tensorboard --data_path "data/ysx_25588.json" --model_name_or_path ".…
-
大佬们好,我在用openi数据集(大概6500条数据)对VisualGLM进行微调之后,检测模型的推理能力的时候,出现以下情况
![c413986eb8372045aa10b401b63884c](https://github.com/THUDM/VisualGLM-6B/assets/56297762/5ea46e26-b074-43b8-ac01-c2a3935a0bac)
是…
-
### Cloudberry Database version
1.5.3
### What happened
When initialising a new database there are errors thrown during locale checking - the database is initialized correctly so these errors appea…
-
For generating global file identifier (gfid), unifyfs uses a full pathname including the mountpoint. This work fine when the file system is mounted under a consistent mountpoint (e.g., /unifyfs) acros…
-
with open(default_hoster, "r+") as hostfile:
PermissionError: [Errno 13] Permission denied: 'C:\\Windows\\System32\\drivers\\etc\\hosts'
Can you please help me solve this error. I tried to run…
-
### System Info
```shell
deepspeed 0.14.4+hpu.synapse.v1.18.0
optimum-habana 1.14.0
docker image: vault.habana.ai/gaudi-docker/1.18.0/ubuntu22.04/habanalabs/pytorch-ins…
-
I'm encountering an issue where the ncmpi_create function appears to stall when running my application with a high number of MPI processes. Specifically, the program hangs at the ncmpi_create call whe…
-
Ubuntu 20.04
NCCL test suite commit: 7130fa6096466f80b0c310b9a3070b6556c0e158
MPI: either mpich 3.3.2 or openmpi 4.1.2 (doesn't matter which)
NCCL: Either commit 014407367347d9a14fff072c6fb9a4d55e6…