oneccl Search Results - Githubissues

199 results
for oneccl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

oneapi-src/oneCCL #121

device_dir hard code to /dev/dri/by-path/

Hi, developers, I found the variable `device_dir` is hard coded to `/dev/dri/by-path/` (see code [here](https://github.com/oneapi-src/oneCCL/blob/master/src/common/global/ze/ze_fd_manager.cpp#L151)), …

zhouyu5 updated 3 weeks ago
4
oneapi-src/oneCCL #109

Allreduce cpu example fails with CCL_WORKER_COUNT > 1

I started playing with allreduce example from the main repository https://github.com/oneapi-src/oneCCL/blob/master/examples/cpu/cpu_allreduce_test.cpp . I modified it slightly by increasing the buf…

piotrchmiel updated 4 months ago
3
uxlfoundation/open-source-working-group #102

oneTBB documented decision making

Define processes for decision making within the oneTBB project. This should include a RFC process for new design and feature proposals (see oneDNN for example - PR tagged RFC that links to the proposa…

rodburns updated 2 weeks ago
3
intel/llm-on-ray #127

Add ipex extra in pyproject.toml to use restricted transform…

IPEX has restriction on transformers version, but llm-on-ray doesn't have. To verify IPEX and other llm-on-ray functions in parallel in CI, we can add a new ipex extra in pyproject.toml with right tra…

jiafuzha updated 6 months ago
1
oneapi-src/SYCLomatic #1225

Migration of ncclCommInitAll is not supported ?

Please see the example https://docs.nvidia.com/deeplearning/nccl/user-guide/docs/examples.html#example-1-single-process-single-thread-multiple-devices Thanks.

jinz2014 updated 1 year ago
1
oneapi-src/SYCLomatic #1226

Migration of ncclGroupStart and ncclGroupEnd not supported ?

Please see the example https://docs.nvidia.com/deeplearning/nccl/user-guide/docs/examples.html#example-1-single-process-single-thread-multiple-devices

jinz2014 updated 1 year ago
1
microsoft/DeepSpeed #5313

[BUG] Failed for using cpu for pipeline based training acro…

**Describe the bug** I have two ubuntu machines, and with 10Gb/s erthnet cable connected and I want to use deepspeed to use these two machines to run a model training with pipeline parallel, and …

xuanhua updated 2 months ago
10
oneapi-src/oneCCL #13

[Improvement] Allow multiple CCL inits from same process but…

Currently we can initialize multiple XGBoost Rabit instances from same process but from different thread. In Spark, its possible to have multiple tasks run on same executor. A executor is single JVM p…

umamaheswararao updated 3 years ago
3
VincyZhang/intel-extension-for-transformers #17

failed to create the serving

I tried to create the serving on my system, but failed with the below error: (emon_analyzer) [root@SPR-1 emon_data_analyzer]# neuralchat_server start --config_file ./config/neuralchat.yaml 2024-03-1…

VincyZhang updated 5 months ago
5
intel/intel-extension-for-pytorch #599

Communication and compute on separate Streams do not overlap

### Describe the bug Communication and computation do not appear to overlap when launching kernels in different `xpu.Stream`s (on Intel GPU Max 1550s). Being able to overlap communication and commun…

garrett361 updated 3 weeks ago
11

上一页 1...1 2 3 4 5 6 7...20 下一页

199 results for oneccl

199 results
for oneccl