Purpose of the Tool is to aid in generating Dataset with 2 use cases:
Code Snippets with Leaked / Exposed Secret credentials inside.
Code Snippets clean with no Leaked or Exposed Secret Credentials inside.
This dataset is currently hard to come by and will be made available for researchers.
This tools, uses the GitHub API to Hunt for Repositories that have a hit on suspicious queries, clones them, runs truffleghog3 on top of it, and produces the snippets. Snippets are 10 lines long with Secret, or Clean samples from the same repositories.
A metadata file is appended to the result file with data on the findings from trufflehog.
Purpose of the Tool is to aid in generating Dataset with 2 use cases:
This dataset is currently hard to come by and will be made available for researchers.
This tools, uses the GitHub API to Hunt for Repositories that have a hit on suspicious queries, clones them, runs truffleghog3 on top of it, and produces the snippets. Snippets are 10 lines long with Secret, or Clean samples from the same repositories.
A metadata file is appended to the result file with data on the findings from trufflehog.