-
Provide automated load testing to benchmark performance of intercepting.
We can use Artillery, Azure Load Testing, etc.
-
Great job on DAX studio, invaluable tool.
tldr: Can I programmatically input a DAX query and obtain server timings?
My setup and idea: I have a report with many pages. I would like to export the…
-
flash attention benchmark fails with [changes](https://github.com/intel/intel-xpu-backend-for-triton/pull/1905) to use upstream pytorch.
It should be a torch issue.
```
Traceback (most recent c…
-
Hi there, when trying to bench from linux host the cross compilation I'm getting the following. Any suggestions around cross-compiling and testing?
```
$ zig build bench -fwine -Dtarget=x86_64-win…
-
Title.
Benchmarks:
Evaluation Methodologies
- [x] G-Eval
- [ ] QAG
Coding Ability
- [ ] Aider Benchmark
- [ ] CodeMMLU
Confabulation Rate
- [ ] TruthfulQA
- [ ] f
Context Length
…
-
The benchmark testing script just shows the QPS or latency comparison without accuracy testing. Does turbo have the method for accuracy testing or some suggestion to do this? Thanks.
-
Zephyr CI testing reports a build error b_u585i_iot02a/stm32u585xx/ns tests/benchmarks/sched_queues
when building the tests/benchmarks/sched_queues on the stm32u585 disco kit
(during test campaign…
-
The Non-Functional Requirements (NFRs) could be more clearly scoped as currently, it is difficult to determine when they have been fully met. Some examples are listed below.
![Screenshot 2024-11-15 a…
-
(llm) test@test-Z590-VISION-D:~/ipexllm_whowhat/ipex-llm/python/llm/dev/benchmark/harness$ python run_llb.py --model ipex-llm --pretrained /home/test/models/LLM/haiyan-scp/chatglm3-6b/pytorch --precis…
-
**Is your feature request related to a problem? Please describe.**
Benchmarking certain FHIR Features in instrumented tests and/or benchmark modules is quite difficult. The team could use an app to c…