Mellanox / libvma

Linux user space library for network socket acceleration based on RDMA compatible network adaptors
https://www.mellanox.com/products/software/accelerator-software/vma?mtag=vma
Other
582 stars 153 forks source link

VMA does not support more then 1 epfd listed #816

Open highfly22 opened 5 years ago

highfly22 commented 5 years ago

I got the following log when running a distributed Tensorflow training job. In socket_fd_api::add_epoll_context function, there is a comment "Currently VMA does not support more then 1 epfd listed". Is there any plan to support that feature? Thanks!

 Pid: 29071 Tid: 29218 VMA DEBUG: epfd_info:263:add_fd() epoll_ctl: fd=34 is already registered with another epoll instance 30, cannot register to epoll 32 (errno=12 Cannot allocate memory)
liranoz12 commented 5 years ago

Hi,

Multiple epfd's support has been tested in the past but caused performance degradation. Hence, there are no plans to support this feature.

Thanks, Liran