Help wanted: Benchmarks in xray_core

atom-archive / xray

An experimental next-generation Electron-based text editor

MIT License

8.48k stars 235 forks source link

Help wanted: Benchmarks in xray_core #21

Open nathansobo opened 6 years ago

nathansobo commented 6 years ago

It would be great to get a PR setting up benchmarks on xray_core in the canonical Rust manner. You can use your imagination on the kinds of operations we want to test, but here are some ideas:

Making large numbers of edits
Moving large numbers of cursors in a document containing large numbers of edits

Just getting a basic framework in place with some non-trivial benchmark would be helpful.

max-sixty commented 6 years ago

If anyone wants to get started on this, this is a config for running bench on nightly without preventing compiles on stable: https://github.com/maxim-lian/xray/commit/bc950a891cf8bd7a3e88a6b81b7ff1a45d6c7244

With commands

cd xray_core/
cargo +nightly bench --features "dev"

...or reply here if there are other ways

cmyr commented 6 years ago

Another option for micro benchmarks that is stable/nightly friendly is criterion: https://github.com/japaric/criterion.rs.

ysimonson commented 6 years ago

You can put the benchmarks in a benches directory in the crate root. They'll still run when you call cargo bench, but then you don't have to put them behind a feature flag.

pranaygp commented 6 years ago

Can this be closed now that #62 is merged?

nathansobo commented 6 years ago

That just laid the framework. I think we could probably use more benchmarks though of core functionality.

anderoonies commented 6 years ago

i'm taking a look at this but had a question about how atomic the benchmarks should be. right now benchmarks are done for individual functions in one scenario. should the same functions be tested in multiple scenarios—e.g. selecting to the end of a line without any edits, selecting to the end of a line that has multiple edits, etc. how should benchmarks for the same functions—but with different setups—be organized?

thanks!

nathansobo commented 6 years ago

@anderoonies I honestly haven't developed strong opinions on this. My initial thought would be that we'd want to focus on situations involving more complexity. If we optimize those, presumably we'd do well in simpler cases, and conversely, if we're fast on a line without edits and slow on a line with lots of edits it still seems like we'd be too slow. That said, I think each scenario might be different. I have a lot of experience optimizing, but not much experience designing a long-lived benchmark suite. Would be happy to hear perspectives on the best design considerations.

anderoonies commented 6 years ago

i'm curious to hear others' experience and input as well, being new to writing benchmarks myself. the existing benchmarks @rleungx added establish a pattern of testing individual functions of the editor API under single, pretty "intense" scenarios. i'm happy to extend that to benchmark the rest of the core API. @nathansobo, as someone very familiar with the underlying implementations, are there any behaviors of the editor you feel should be focused in benchmarking? in the rgasplit paper, briot et al use "randomly generated traces" for performance evaluations, but i'm not sure the consensus on randomness in benchmarking

nathansobo commented 6 years ago

I'm not sure random edits are as important as sequential edits that simulate what a human would do. But testing against documents containing lots of edits will likely be important.