Open johngian opened 5 years ago
Currently we are concatenating all the issue bodies and titles to a corpus and tokenize our input based to that. I think it might improve our performance if use NLP to our data processing pipeline.
Currently we are concatenating all the issue bodies and titles to a corpus and tokenize our input based to that. I think it might improve our performance if use NLP to our data processing pipeline.