bandwidth-manager: introduce ecn marking to prevent bufferbloat

Currently the bandwidth manager is enforcing a rate limit for flows in one pod, and the flows in one pod are sharing one same queue. It's using tail drop policy with threshold as 2 seconds. This can cause bufferbloat and 2-second queuing latency when there are many tcp connections.

Here we introduce ecn marking to solve the issue, by default, the marking threshold is set to 1ms.

For tests, we had a pod with 100Mbps egress limit, and there are 128 TCP connections in the pod as background traffic, and we compare the TCP_RR latency

Method | Avg Latency with-ECN | 3.1ms without-ECN | 2247.3ms

Please ensure your pull request adheres to the following guidelines:

[x] For first time contributors, read Submitting a pull request
[x] All code is covered by unit and/or runtime tests where feasible.
[x] All commits contain a well written commit description including a title, description and a Fixes: #XXX line if the commit addresses a particular GitHub issue.
[ ] If your commit description contains a Fixes: <commit-id> tag, then please add the commit author[s] as reviewer[s] to this issue.
[x] All commits are signed off. See the section Developer’s Certificate of Origin
[x] Provide a title or release-note blurb suitable for the release notes.
[ ] Are you a user of Cilium? Please add yourself to the Users doc
[x] Thanks for contributing!

Fixes: #29083

bandwidth-manager: introduce ecn marking to prevent bufferbloat

cilium / cilium

bandwidth-manager: introduce ecn marking to prevent bufferbloat #33220