Is your feature request related to a problem? Please describe.
The Ultimate aim of the project is to detect illicit websites. As of now the algorithm uses graph knowledge to target suspicious links. Advance techniques are required to accurately classify links and reduce the computational complexity.
Describe the solution you'd like
Text-based classification using NLP which transforms the crawler into Context Focused Crawler from the traditional Naive-Best First Crawler. This will further help in crawling at greater depths.
Describe alternatives you've considered
Classification technique is yet to be decided.
Is your feature request related to a problem? Please describe. The Ultimate aim of the project is to detect illicit websites. As of now the algorithm uses graph knowledge to target suspicious links. Advance techniques are required to accurately classify links and reduce the computational complexity.
Describe the solution you'd like Text-based classification using NLP which transforms the crawler into Context Focused Crawler from the traditional Naive-Best First Crawler. This will further help in crawling at greater depths.
Describe alternatives you've considered Classification technique is yet to be decided.