-
MMA-Diffusion-NSFW-adv-prompts-benchmark: I want to use this dataset for some research, especially to see if it can be used to generate ADV attachments, but my request was rejected on the huggingface.
-
it's unclear how to test data sets other than the random data set. It would also be nice to have a flag for generating more uniform/bias data in cases of a speculative decoding models where we would l…
-
Hi!
Thanks for your excellent work! Can I ask if you benchmark on the Mip-360 Dataset? If yes, could you please provide your benchmarking results? Thanks in advance!
-
### Pandas version checks
- [X] I have checked that this issue has not already been reported.
- [X] I have confirmed this issue exists on the [latest version](https://pandas.pydata.org/docs/whatsnew…
-
## タイトル: AIによる研究支援の可能性評価(AAAR-1.0)
## リンク: https://arxiv.org/abs/2410.22394
## 概要:
数多くの研究が大規模言語モデル(LLM)をはじめとするAIシステムの能力を評価し、メール作成、質問応答、クリエイティブなコンテンツ生成といった日常業務への活用を探ってきました。しかし、研究者自身は、研究アイデアのブレインストー…
-
Hi.
I am unable to reproduce the benchmark results in the paper for test split in `distil-whisper/tedlium` using model `distil-whisper/distil-large-v2` when using `run_eval.py`. However, I am able…
-
- update data/queries folder each time we upsert a collection of documents.
- add code that handles the missing upsertion of some documents (sometimes some documents escapes the upsertion).
- using …
-
The provided datasets have four variants, each serving a specific purpose, and contain a `text_description` as described below E.g gov:
1. **syntheticDocQA_government_reports_test** – **No text_des…
-
Hello,
I have used BRAKER3 with default parameters to annotate 3 anemone genomes and my busco scores were lower than in my genome and so I ran it again using the --busco_lineages option and it solved…
-
Hey folks!
I saw that someone asked the same question yesterday on the mailinglist, but nobody has answered so I thought I bring it here since I'm running into the same issue.
When I try to run …