microsoft / Freeflow

High performance container overlay networks on Linux. Enabling RDMA (on both InfiniBand and RoCE) and accelerating TCP to bare metal performance. Freeflow requires zero modification on application code/binary.
MIT License
597 stars 88 forks source link

Floating point exception(core dump) while running ib_send_bw #14

Open UntaggedRui opened 4 years ago

UntaggedRui commented 4 years ago

Problem

I'm reproducing freeflow on my own machine, and when I run ib_send_bw in two containers located at the same physical machine, Floating point exception(core dump) occured. The error is also in two containers located at two machines while these two machines can run ib_send_bw correctely. I am strictly following the quick start of github. Here is my environment and how I run freeflow.

Environment

host IPs and virtual IP to host IP mapping

I have two machines, 192.168.2.203 and 192.168.2.206. They are connected by weave overlay.I have modied host IPs and virtual IP to host IP mapping in my code. In ffrouter.h#L76,

const char HOST_LIST[HOST_NUM][16] = {
    "192.168.2.13",
    "192.168.2.15"
};

In ffrouter.cpp#L215,

    this->vip_map["10.47.128.0"] = "192.168.2.203";
    this->vip_map["10.47.0.5"] = "192.168.2.206";

Implementation

At host 203, enter freeflow router container and excute ./router router1, and run a container named node1, whose ip is 10.47.128.0, and run ib_send_bw. At host 206, enter freeflow router container and excute ./router router1, and run a container named node2, whose ip is 10.47.0.8, and run ib_send_bw 10.47.128.0. Then the error happened. In container which run ib_send_bw, log is error In container which run ib_send_bw 10.47.128.0, log is error2 How should I solve it? And can you tell me your gcc version ?

sglee0323 commented 3 years ago

I have the same problem... It has been solved? Can I get the solution?

wwwzrb commented 2 years ago

Hi, we encountered a similar problem when trying to reproduce Freeflow with the MT27710 family [ConnectX-4 ]. Do you have any suggestions for this problem?

image