-
rpc_server.cc:112] Init RDMA failed!Create rdma server failed!
Describe your problem
---------------------
A clear and concise description of what your problem is. It might be a bug,
a feat…
-
### Describe the feature
Valkey support now RDMA. This commit was merged and will be included as part of Valkey-8 https://github.com/valkey-io/valkey/pull/477
### Use Case
The required su…
-
```python benchmark.py --block-size 32 --size 128 --rdma```
when batch is big (> 4096), it takes around 1x ms. (gcc -O0)
this is too slow:
rdma_write:
mm->allocate add batch allocate, if it coul…
-
Describe your problem
---------------------
Support for specifying RDMA devices if there exist more than one RDMA device.
-
**Describe the bug (描述bug)**
应用场景可以简化为client发起rpc请求,server收到请求后取回数据,并通过response attachment返回,追踪日志看:
1. fuse->rpc->worker 完成
2. worker收到响应->take data-> append to attachment完成
3. fuse侧hang住,直到报错
…
-
**Describe the bug (描述bug)**
```
W0528 17:36:56.793694 510 external/brpc/src/brpc/input_messenger.cpp:240] Close Socket{id=1248 fd=1395 addr=10.156.8.12:24452:24141} (0x55c9fa744b40) due to unknow…
-
I train distributed on 2 nodes 8 H100 use repo llama-factory
scripts:
Node 1:
export NCCL_IB_GID_INDEX=4
export NCCL_IB_HCA=mlx5
export NCCL_IB_DISABLE=0
export NCCL_DEBUG=INFO
export NCCL_DEBUG_SUBS…
-
Hi all,
I have question about sw_app (rdma_service)
1. How wback works ? - (used in function bThread::checkCompleted)
2. How to get ctid ? - (used in function Constructor bThread)
-
![image](https://github.com/user-attachments/assets/fc0b29fd-2a87-4a09-8147-e51500b8ac63)
"ping -I source_rdma_ip dst_rdma_ip" worked
What's the reason
-
Describe your problem
---------------------
A clear and concise description of what your problem is. It might be a bug,
a feature request, or just a problem that need support from the vineyar…