-
- [ ] [Comparing LLM Performance: Introducing the Open Source Leaderboard for LLM APIs](https://www.anyscale.com/blog/comparing-llm-performance-introducing-the-open-source-leaderboard-for-llm)
# Comp…
-
In my [book](https://www.bwplotka.dev/book), I mentioned amazing work [Vitess](https://benchmark.vitess.io/) has done. They have nightly run of unit and macro tests on some cloud provider. It would be…
-
I've been working to create a test suite for determining how well PDBFixer works. The idea is to have an objective way of telling whether or not a code change makes it better. Basically, we need a l…
-
Since Continue works with any model, there's a ton of prompt engineering needed to optimize for each of them.
The most important area for work is the /edit slash command. GPT-4 is able to handle a ve…
-
201910161900 ~ @ 5F Lounge, Wantedly, Inc. Tokyo HQ
## WHY
Go 完全に理解したい
## WHAT
とりあえず読みたいものを書いていきましょう!
-
Dear Vadim,
DIANN is really helpful for our proteomic research.
We recently met one problem. For example, in one batch of over 300 continuous sequenced data, 12 of them could not be processed in o…
-
-
## What is arewefastyet
Arewefastyet is the automated and continuous benchmarking platform for Vitess (https://github.com/vitessio/vitess). It automatically performs different types of benchmarks o…
-
Maybe this is a naive question, but these typechecker plugins work with GHCi as well, right?
Using Ghcid to invoke GHCi with these plugins enabled will provide an IDE-like experience. All that's ne…
-
We should track method performance using a benchmark suite like @alimanfoo mentioned in https://github.com/pystatgen/sgkit/pull/36#issuecomment-658893949.
It would be ideal if this ran as a part …