-
## List of Non-Deterministic Operations in PyTorch
The following operations in PyTorch exhibit non-deterministic behavior according to the ```torch.use_deterministic_algorithms``` [documentation](h…
-
The discussion in #40 regarding implementation touches on performance of existing ICU4C code vs. the pre-existing Rust module [unicode-normalization](https://github.com/unicode-rs/unicode-normalizatio…
-
It will likely that we will want to integrate llama.cpp (or one of its available rust bindings) to our stack. It will be important to have comparison benchmarks. The following is required
- [ ] Ben…
-
There is presently no benchmark suite for Numba’s CUDA target, and there is a gap between Numba’s performance and the maximum achievable. To support performance optimization efforts, a benchmark suite…
-
Samples/examples are where many users will start on how to make use of this library. It is extremely frustrating for users when these contain errors of any sort and the current CI will only catch buil…
-
One useful feature of the original sightglass code was the ability to run the benchmarks as native machine code in order to form a baseline for comparison. If we migrate this functionality from `webui…
-
**Is your feature request related to a problem? Please describe.**
I'm interested in hybrid FSDP where the model is replicated across nodes and sharded within node.
My understanding is that this c…
-
spacy benchmarks model inference speed in [words per minute](https://spacy.io/api/cli/#benchmark-speed).
This could be useful info for model comparison.
Stanza is painfully slow, udpipe is very fa…
-
- [ ] #5
- [ ] #6
- [ ] #7
- [ ] #8
- [ ] #9
- [ ] #10
-
I was going through the implementation of the Lengauer-Tarjan algorithm in BGL's domibator_tree.hpp and have found that there is a vector of std::deque to store buckets: vertices with the given semido…