Closed aarshkshah1992 closed 1 year ago
one note from your AB output is it completed in around 5 minutes. that may not be enough to get to steady state - with membership debounce at 5 min, you're only going to see 1 round of downvoting happen at all, and won't have reached a point where nodes would actually be excluded.
@willscott Fixed the AB output.
@aarshkshah1992 deployed c9035dd from this PR to staging, you can observe things after this timestamp (UTC):
2023/03/01 17:20:00 Starting bifrost-gateway 2023-03-01-adb95e1
I don't know where the change to interval is, but orchestrator is getting hit multiple times per second per IP
147.75.71.197 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4700 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
136.144.59.113 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4745 "-" "bifrost-gateway/2023-03-01-adb95e1"
145.40.87.133 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4975 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
136.144.48.241 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4850 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
188.240.57.10 - - [01/Mar/2023:22:37:23 +0000] "POST /register?ssl=done HTTP/1.1" 200 753 "-" "Saturn/648_5207745"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
136.144.59.113 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4745 "-" "bifrost-gateway/2023-03-01-adb95e1"
86.109.14.251 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 5013 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.75.71.197 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4700 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.75.84.243 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4784 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
136.144.59.113 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4745 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
142.202.255.19 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/local HTTP/1.1" 200 246 "-" "Saturn/648_5207745"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
136.144.59.113 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4745 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
136.144.48.241 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4850 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.75.71.197 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4700 "-" "bifrost-gateway/2023-03-01-adb95e1"
136.144.59.113 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4745 "-" "bifrost-gateway/2023-03-01-adb95e1"
145.40.65.177 - - [01/Mar/2023:22:37:23 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4745 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
5.44.249.154 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/local HTTP/1.1" 200 437 "-" "Saturn/648_5207745"
147.28.129.15 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
188.244.117.127 - - [01/Mar/2023:22:37:24 +0000] "POST /register?ssl=done HTTP/1.1" 200 737 "-" "Saturn/648_5207745"
145.40.65.177 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4745 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.75.84.243 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4784 "-" "bifrost-gateway/2023-03-01-adb95e1"
136.144.59.113 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4745 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
136.144.59.113 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4745 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
145.40.65.177 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4745 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.75.71.197 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4700 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
147.28.129.15 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4751 "-" "bifrost-gateway/2023-03-01-adb95e1"
136.144.48.241 - - [01/Mar/2023:22:37:24 +0000] "GET /nodes/nearby?count=1000 HTTP/1.1" 200 4850 "-" "bifrost-gateway/2023-03-01-adb95e1"
@DiegoRBaquero Yeah, that was a bug in this PR. Have pushed a fix.
https://github.com/ipfs/bifrost-gateway/commit/4cbc3a75de06f487190982dcb67c2f5ac124b81f with caboose from this PR (https://github.com/filecoin-saturn/caboose/pull/59/commits/a518e617b034cb3aa497d152e171693ea0ceb7fc) deployed to staging box;
2023/03/02 15:49:36 Starting bifrost-gateway 2023-03-02-4cbc3a7
We need to downvote votes more gradually so that they get time to recover from temporary failures, Lassie timeouts, from being overloaded etc. but without them getting in the way of fetch requests However, once we remove a node from the pool post the gradual downvoting, we should make the node earn it's reputation back before we start sending more requests to it. To that end, this PR:
TODO
CURRENT Results from apache backend testing