Critical path towards DHT efficiency and performance

raulk commented 5 years ago

This issue outlines the critical issues to solve on the road towards a solid, robust and performant DHT implementation.

[x] Critical: termination criteria for DHT queries (avoid backtracking unless necessary): #290; there's a WIP PR here: #291.
[x] Critical: provider record saturation. Nodes close to popular content get flooded with provider records. Related: #316 https://github.com/libp2p/specs/issues/163. The latter solution has the property that, the more popular the content gets, the more widely advertised it is on the DHT, thus making queries faster.
[x] Critical: dissociating the routing table from the connection table. Currently when a peer disconnects, we drop it from the routing table. This is not proper Kademlia. #283
[x] Critical: participate in the DHT in client-only mode until we receive our first inbound connection via a non-relay transport. This will be tricky, but will help emergent stability and responsiveness. #216
[ ] Important: persisting and seeding the routing table via a Snapshotter and Seeder, using bootstrap peers only as a fallback. #254 #295; WIP PR here: #315.
[x] Important: routing table membership based on peer quality and failure counting. Eject peers who misbehave, present high latency, or fail frequently.
[ ] Cool: an alternative data structure for the routing table.

We could potentially use the brand new libp2p testlab to continuously measure the impact of the changes we make.

If you're willing to help in pushing the DHT to new heights, please comment below ;-)

aarshkshah1992 commented 5 years ago

@raulk I've only just started contributing to DHT & am currently working on correctly implementing the DHT bootsrapping. That task combined with the reading of the DHT paper/meandering around the codebase should give me a fairly good idea of the DHT codebase. Please count me in as & when we start focusing on this epic.

rgrover commented 5 years ago

The routing table is currently a linear collection of kbuckets with increasing common-prefix-lengths, potentially resulting in significant contention for the first few k-buckets.

Assuming that bits is an array-view over bits in own PeerId, the first kbucket in the routing table corresponds to the prefix ~bits[0], the second to bits[0] ~bits[1], and so on. Bucket[0] will be contended by around half of all available peers.

The original Kademlia recommends an LRU eviction policy for buckets filled to capacity, but libp2p-kad-dht only ever evicts disconnected or lost peers. With a value of k set to 20, this means that after learning about 40 peers, nearly half of all new peers are unable to enter the routing-table.

Possible solutions would be to either a) maintain a replacement list alongside each kbucket for nodes which are waiting for entry to the kbucket, b) allow the first few k-buckets to have capacity larger than k, c) evict peers from kbuckets based on some policy (such as LRU or latency), or d) use a recursive tree data-structure (instead of an array) and allow splits (up to a certain depth b) for kbuckets not on the prefix-path of own-PeerId, as suggested in the Kademlia paper.

It would also help to periodically prune unreachable peers out of the routing table in a proactive manner.

raulk commented 5 years ago

@aarshkshah1992 – hey, thanks for reaching out! https://github.com/libp2p/go-libp2p-kad-dht/pull/315 (PR for persisting the routing table) is heading in a good direction and would benefit from somebody pushing it over the finish line. Is this something that catches your attention?

aarshkshah1992 commented 5 years ago

@raulk Perfect ! Please can you assign it to me ? Will get back with my comments as soon I've gone though the existing comments & code.

aarshkshah1992 commented 5 years ago

@raulk What's next for us here ? Is there anything I can help with ?