ponder-lab / GitHub-Issue-Classifier

Python script to mine for GitHub issues + comments and classify them.
MIT License
6 stars 0 forks source link

Filter our search request to closely match our results #6

Closed y3pio closed 3 years ago

y3pio commented 3 years ago

Currently, GitHub's search query results doesn't properly consider dot operators in the query string. Our current query of tf.function returns results that may include tf.function or tf or function or any other issues that GitHub's search algorithm feel that it matches our query.

Acceptance Criteria:

khatchad commented 3 years ago

I think that this post-processing step should already be in @tatianacv's script, right?

y3pio commented 3 years ago

Yup, there's a check for tf.function in the body or title for each of the results:

"tf.function" in json_data["body"]) or ("tf.function" in json_data["title"]

Will add this in and also look into this a bit further to see if there's anything else on the search results that we can use to further help narrow our results to more closely match our query.