sourcebot-dev / sourcebot

Blazingly fast code search 🏎️ Deployed as a single Docker image 📦 Search million+ lines of code in your GitHub, GitLab, Gitea, and Gerrit repositories 🪄 MIT licensed ✅
https://sourcebot.dev
MIT License
1.17k stars 39 forks source link

Missing search results #111

Closed KonradStanski closed 7 hours ago

KonradStanski commented 8 hours ago

The search: repo:^github\.com/torvalds/uemacs$ define https://www.sourcebot.dev/search/search?query=repo%3A%5Egithub%5C.com%2Ftorvalds%2Fuemacs%24%20define shows a bunch of results in uemacs for define, but repo:^github\.com/torvalds.* define https://www.sourcebot.dev/search/search?query=repo%3A%5Egithub%5C.com%2Ftorvalds.*%20define does not include "define" results from uemacs. The repo must be matching though, because: repo:^github\.com/torvalds.* ncol https://www.sourcebot.dev/search/search?query=repo%3A%5Egithub%5C.com%2Ftorvalds.*%20ncol does contain uemacs results.

I think this is a bug? Would be good to dig deeper.

brendan-kellam commented 7 hours ago

Looks like that search query returns so many results that the maxMatchDisplayCount limit (which defaults to 10000) is hit. If you click "(load more)", we multiply the display count by 2. For example, when the display count is 10000000, we get all of the results (including those that are in uemacs): https://www.sourcebot.dev/search/search?query=repo%3A%5Egithub%5C.com%2Ftorvalds.*%20define&maxMatchDisplayCount=10000000