stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
MIT License
2.67k stars 355 forks source link

Irrelevant results returned by the Colbert V2 Model #308

Open ravi-kumar-1010 opened 4 months ago

ravi-kumar-1010 commented 4 months ago

I am using Colbert v2 model to index a bunch of products and their specifications and trying to build a search engine to retrieve products based upon given query. I've found that for approximately 20Million products I've added to the index. The results obtained are not accurate , often times model recognize to identify the 'Brand' of the product and produces products of a completely different brand sometimes the model completely ignores the numeric values mentioned in the query.

Please suggest what model setting while indexing or searching can directly affect the model accuracy in product retrieval. Or if there is a reranking technique that can help me in producing more relevant results.

If there is any other service that I can put on top of this model to improve the overall search?

Thanks : )

thedeepengine commented 4 months ago

Hey Ravi,

What you're up to looks interesting, but just couple of points: