sourcegraph / sourcegraph-public-snapshot

Code AI platform with Code Search & Cody
https://sourcegraph.com
Other
10.1k stars 1.27k forks source link

Context: evaluate large repos and multi-repo case #61744

Open jtibshirani opened 5 months ago

jtibshirani commented 5 months ago

When repos are very large or the context includes multiple distinct repos, it can be harder to retrieve highly relevant results. We should evaluate the multi-repo and large repo case using the "golden queries" approach.

Once we finish an initial "research spike", we'll add a list of tasks.

I suspect that good ranking is going to be critical here, given the larger and noisier result set. Ideas:

/cc @sourcegraph/search-platform

keegancsmith commented 5 months ago

The sourcegraph repo may be a good "simple" target now for multi-repo. Recently I believe we removed most of the docs directory and it is now in a docs repo. I think similiar things may also happen to our CHANGELOG.