cncf-tags / cloud-native-ai

https://cncf-tags.github.io/cloud-native-ai/
4 stars 3 forks source link

Create Semantic Ranking for Title, Summary, Keywords, Title and Conference Name Searching #30

Closed nbcstevenchen closed 3 weeks ago

nbcstevenchen commented 3 weeks ago

Implemented 2 methods for semantic ranking.

BM25 algorithm

Sentence BERT Bi-Encoder

I have done some tests on the 600 samples that we extracted. Both methods works well.

rootfs commented 3 weeks ago

@nbcstevenchen this is very cool! Do you need GPU to run the similarity search? Can it be done on github runner?

nbcstevenchen commented 3 weeks ago

@nbcstevenchen this is very cool! Do you need GPU to run the similarity search? Can it be done on github runner?

I don't think GPU is necessary. The Bi-Encoder is specifically designed to facilitate faster inference speeds in applications by enabling the pre-generation of embeddings. BM25 also works well. We can use this one first for the demo.