embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark
https://arxiv.org/abs/2210.07316
Apache License 2.0
1.8k stars 240 forks source link

Adding Public Health QA dataset #741

Closed xhluca closed 4 months ago

xhluca commented 4 months ago

I plan to contribute a public health-related dataset sourced from COVID-19 related question and answer pairs from major public health authorities. Before opening the PR, I had a few questions:

Thanks!

imenelydiaker commented 4 months ago

Hello,

We'd love to have medical datasets as we're handling multiple domains!

For your questions:

xhluca commented 4 months ago
xhluca commented 4 months ago

I've added the dataset in #750