SeanLee97 / AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
https://arxiv.org/abs/2309.12871
MIT License
454 stars 32 forks source link

Training data for UAE-Large-V1 #15

Closed memray closed 9 months ago

memray commented 10 months ago

Hi,

Awesome work! Can you share the details about what data was used for adapting WhereIsAI/UAE-Large-V1 from BGE-large? Can you share the data as well?

Thanks!

SeanLee97 commented 9 months ago

Hi @memray, many thanks for following our work!

We're sorry for any inconvenience caused by the fact that we did not publish our training details yet. Below is the training data that was used for fine-tuning UAE.

image

We are now working on Next Generation sentence embeddings. After we release our new sentence embedding model, we will open-source our training details for UAE.