continuous-benchmarking Search Results

1000+ results
for continuous-benchmarking

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

irthomasthomas/undecidability #766

Comparing LLM Performance: Introducing the Open Source Leade…

- [ ] [Comparing LLM Performance: Introducing the Open Source Leaderboard for LLM APIs](https://www.anyscale.com/blog/comparing-llm-performance-introducing-the-open-source-leaderboard-for-llm) # Comp…

irthomasthomas updated 8 months ago
1
thanos-io/thanos #5764

Add automated nightly micro and macro benchmarks

In my [book](https://www.bwplotka.dev/book), I mentioned amazing work [Vitess](https://benchmark.vitess.io/) has done. They have nightly run of unit and macro tests on some cloud provider. It would be…

bwplotka updated 1 year ago
3
openmm/pdbfixer #59

Test suite for optimizing PDBFixer

I've been working to create a test suite for determining how well PDBFixer works. The idea is to have an objective way of telling whether or not a code change makes it better. Basically, we need a l…

peastman updated 9 years ago
4
continuedev/contribution-ideas #7

Prompt Engineering

Since Continue works with any model, there's a ton of prompt engineering needed to optimize for each of them. The most important area for work is the /edit slash command. GPT-4 is able to handle a ve…

sestinj updated 9 months ago
2
wantedly/gophers-code-reading-party #29

20191016 Gophers Code Reading Party

201910161900 ~ @ 5F Lounge, Wantedly, Inc. Tokyo HQ ## WHY Go 完全に理解したい ## WHAT とりあえず読みたいものを書いていきましょう！

izumin5210 updated 5 years ago
5
vdemichev/DiaNN #509

Some raw data could not be processed

Dear Vadim, DIANN is really helpful for our proteomic research. We recently met one problem. For example, in one batch of over 300 continuous sequenced data, 12 of them could not be processed in o…

chenliangyu18 updated 2 years ago
5
igraph/rigraph #1338

How do we check changes don't affect performance

maelle updated 7 months ago
6
vitessio/arewefastyet #525

LFX third iteration of UI improvement: Shadcn, Typescript an…

## What is arewefastyet Arewefastyet is the automated and continuous benchmarking platform for Vitess (https://github.com/vitessio/vitess). It automatically performs different types of benchmarks o…

frouioui updated 1 month ago
14
MercuryTechnologies/ghc-specter #6

GHCi Support

Maybe this is a naive question, but these typechecker plugins work with GHCi as well, right? Using Ghcid to invoke GHCi with these plugins enabled will provide an IDE-like experience. All that's ne…

friedbrice updated 1 year ago
4
sgkit-dev/sgkit #68

Add benchmark suite

We should track method performance using a benchmark suite like @alimanfoo mentioned in https://github.com/pystatgen/sgkit/pull/36#issuecomment-658893949. It would be ideal if this ran as a part …

eric-czech updated 3 years ago
6

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for continuous-benchmarking

1000+ results
for continuous-benchmarking