Closed Gopu2001 closed 2 years ago
Original issue text: Research methods of checking if the extracted recommendation is actually a valid recommendation (e.g., if I extract "CLion" how do I check that it's a valid respond to "best C++ IDEs reddit" i.e., that it is a C++ IDE) (pointers: perhaps google "
basic solution: google search with quotation marks around product and see if there is more than 1 mention on the Internet
This is a rather great idea! Let's talk Sunday w/ the team on this idea and perhaps it implement it soon
Keep in mind that with each of these ideas, there are pros and cons. My belief is that, to decrease the number of false positives in our output model, we need to cross-reference using at least 2 of these ideas.
For our minimum viable product, we can simply use the CSE (Google Custom Search Engine) idea. To go a level beyond that, we should also check if the same product/recommendation has been recommended or mentioned elsewhere in the same comment thread/discussion. This MVP should suffice for short-term but, without using a larger and more complex structure for our database and caching, the wait times of our processing could result in an early decline of the product
Thus far, all that has been implemented is point 3 of the checklist above. This is reflected in the changes under AnmolStuff. To be fully marked as completed, I feel that this part needs testing (i.e., someone else needs to ensure that I have not missed a test case in a case of bad input). As such, I will label this issue with a "further testing needed" label.
Will leave this issue closed because the base code has been implemented.
There are some updates / upgrades that could be made to the code files (ie additional features), but might only add them as needed or as time permits
There are several ways we can go about this: