celestiaorg / celestia-app

Celestia consensus node
https://celestiaorg.github.io/celestia-app/
Apache License 2.0
345 stars 290 forks source link

Sanity check testground #3147

Open evan-forbes opened 8 months ago

evan-forbes commented 8 months ago

We should add sanity checks in testground to ensure that testground is not creating a bottleneck in throughput.

These sanity checks could involve deep diving into resource usage, such as:

To ensure that something more complex is not wrong with testground, we should run similar if not identical experiments on a different backend such as knuu. Depending on the results of an experiment, the results of the experiments that we've ran so far might need to be revisited. If the results of the experiments are similar, then we are more confident in the results provided by both backends.

Acceptance Criteria

Conduct the following experiments in both knuu and testground:

Assess the sanity of the testground results using the following metrics:

Bonus / Next

We can additionally or as the next step address https://github.com/celestiaorg/celestia-core/issues/1262.

evan-forbes commented 7 months ago

After adding more precise tracing for bandwidth, we can see that the entire allocated bandwidth for the 2 node experiment in testground is not fully being used. Roughly ~75% (~75MB/s out of the 100MB/s allocated) can be observed.

bandwidth-2-val

We should compare the tracing data seen in knuu with these results. We expect them to be similar if not identical. We should also compare the results of the same experiment with less allocated bandwidth. More information on the parameters / procedure can be found the two node follup up write up.

evan-forbes commented 5 months ago

to update this, we are starting on santiy test analysis using the two node data. Beyond the two node, we are blocked on getting knuu capable of running 100 nodes https://github.com/celestiaorg/celestia-app/issues/3488.