sourcebot-dev / sourcebot

Blazingly fast code search 🏎️ Deployed as a single Docker image 📦 Search million+ lines of code in your GitHub, GitLab, Gitea, and Gerrit repositories 🪄 MIT licensed ✅
https://sourcebot.dev
MIT License
1.17k stars 40 forks source link

Missing search results #111

Closed KonradStanski closed 18 hours ago

KonradStanski commented 20 hours ago

The search: repo:^github\.com/torvalds/uemacs$ define https://www.sourcebot.dev/search/search?query=repo%3A%5Egithub%5C.com%2Ftorvalds%2Fuemacs%24%20define shows a bunch of results in uemacs for define, but repo:^github\.com/torvalds.* define https://www.sourcebot.dev/search/search?query=repo%3A%5Egithub%5C.com%2Ftorvalds.*%20define does not include "define" results from uemacs. The repo must be matching though, because: repo:^github\.com/torvalds.* ncol https://www.sourcebot.dev/search/search?query=repo%3A%5Egithub%5C.com%2Ftorvalds.*%20ncol does contain uemacs results.

I think this is a bug? Would be good to dig deeper.

brendan-kellam commented 18 hours ago

Looks like that search query returns so many results that the maxMatchDisplayCount limit (which defaults to 10000) is hit. If you click "(load more)", we multiply the display count by 2. For example, when the display count is 10000000, we get all of the results (including those that are in uemacs): https://www.sourcebot.dev/search/search?query=repo%3A%5Egithub%5C.com%2Ftorvalds.*%20define&maxMatchDisplayCount=10000000