oVirt / vdsm

The Virtual Desktop Server Manager
GNU General Public License v2.0
161 stars 201 forks source link

[feature] Add support infiniband nuc and RDMA subsystem #122

Closed slavonnet closed 2 years ago

slavonnet commented 2 years ago

Lets try ....

1; Add IB to Setup Host interface for assign static IP 2, Add IB network to claster for "Migration network" type select

  1. Add support RDMA for:
    • Migrations (simple use rdma://)
    • nvme over fabric , nvme target, nvme-fc (simple modprobe and few cmds). Add blk dev to nameT and all host have access with latancy smaller than sas!
    • NFS over DRMA (simple flag/mount param)
    • iSer (iSCSI Over rdma) (simple modprobe)
  2. Add option for emulate RDMA using rdma_rxe (software rdma. Simple add bind to any eth. thats all)
  3. Add support for VMA (accelerate tcp/udp connect over rdma layer) and SPDK (many devices, drivers, agoritms and more) accelerators
  4. Add qemu vhost rdma/ring for fast over VM internal fabrics support. Also OVN support RDMA backplain switch (vxlan and more)
  5. Add support for RDMA Paravirtulisation device (assign RDMA dev to host/ring and every VM can emulate rdma nic)
  6. Look to rdma multicast for locks, events, quorum and etc tasks
  7. Loks to PCI assign device to any VM in ANY host!

few days work for half of this list and will get great profit!

P.S.

VDSM while RR jobs/cheksthis is most baddly (OSCAR prize!) architectute that i see.... Seriosly... sync tasks, big latancy between queue/tasks, error paths from begin in 80% times, reset hosts in 80% times bug in code or minor fails.... At most cases if vdsm not found hosted-engine its remount storages, restart services, restart hosts even other VM is UP. Any trys to migrate/pause/save/snapshot VMs? Any trys to detect errors from events, logs? Look to log! 90% work in vdsm is overhead and big latancy queue!

Change VDSSM ARCH to

Please think about latancy, bottleneck, perfomance and TIME!

Thanks for watching and hacking! Beast regalst, best im, best you, with love and peace!

sandrobonazzola commented 2 years ago

@nirs @bennyz @ahadas can you please review?

nirs commented 2 years ago

Thanks @slavonnet but I don't think this issue is helpful. Adding infiniband support is not something that can be added in vdsm, this affects the entire system (engine, hosted engine, libvirt?, qemu?).

I suggest you start a discussion in the ovirt devel mailing list. The most important thing in this discussion why and how infiniband support is useful to the oVirt community.

The second part about vdsm architecture is not clear to me. Are you sure you understand what is the purpose of vdsm and how it operates? If will be more productive if you can identify specific issues in vdsm.

mz-pdm commented 2 years ago

As for RDMA migrations, AFAIK they had been considered by oVirt several years ago but were abandoned later. I think they are still not considered very reliable on the platform level.

nirs commented 2 years ago

We got no additional info from @slavonnet and I don't think the current issue is useful.

Feel free to reopen or create more specific and detailed issues instead.