danswer-ai / danswer

Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
https://docs.danswer.dev/
Other
9.77k stars 1.09k forks source link

Add support for indexing GitHub code files #1650

Open artmatsak opened 2 weeks ago

artmatsak commented 2 weeks ago

This PR adds indexing of GitHub code files, taking inspiration from https://github.com/danswer-ai/danswer/pull/1586. One notable difference from the GitLab PR is that we only index GitHub code files on initial load or complete re-indexing because there's currently no obvious way to define the "last updated on GitHub" date for a repository file. (We cannot use the last commit date because there can be an arbitrary delay between a commit being made and it actually being pushed to GitHub.)

This functionality is off by default and requires the GITHUB_CONNECTOR_INCLUDE_CODE_FILES env var to be set to true.

vercel[bot] commented 2 weeks ago

@artmatsak is attempting to deploy a commit to the Danswer Team on Vercel.

A member of the Team first needs to authorize it.