Closed NovaCygni closed 8 years ago
SEs will be updated for 4.0.2. Got a awesome method ive thought of to run each SE's checks as individual Asynchronous processes, though may still yet use Twister or another method, whichever works the way intended and provides the speed gain im after.
i would use scrapy in async mode.
Yeh ive been looking at that as a option along with using Tornado and Asyncio, alas on my own theres a ton of work still to be done and this aspect is currently taking a backseat as I wish to improve the FTP Crawler first so the Metasploit Vulns are matched to detected headers... When thats done my next task is to nerf the 18.5k d0rk list down to 7.8k of SQLi dorks, 2.2k CCTV Dorks (* with a potential avenue for CCTV attacks and so forth *) along with a "Shell Sweeper" to look for known shells.... If your willing to help out on any aspect of the program feel free to join me :)
Jeah i will try to improve some stuff and make it more smooth.
some interesting links: https://github.com/danmcinerney/xsscrapy https://github.com/titantse/seCrawler
Btw the long elif chain was intentional I had planned to add checks for known exploits against x target, in the end I got distracted by other projects and never got round to it yet, The search engines were removed down to 1 atm as I was playing around with yahoos flaws in blocking suspicious traffic, it appears upon us.search.yahoo blocking me you can simply change the "us" part for gb, fr, de or whatever and itll work. Cheers for the seCrawler link gonna pull it to Pycharm now and give it a peek.
before i forget: http://blog.florian-hopf.de/2014/07/scrapy-and-elasticsearch.html some scrapy stuff for piplining the results to db / elasticsearch, may use @ kibana for smooth web ui. like this: https://raw.githubusercontent.com/TiNico22/ELKconfig/master/CISCO-ASA/Dashboard-ASA.png
d4op Kibana+Marvell looks awesome... easy enough to implement and pythonise aswell.
NovaCygni jeah looks really nice, may give it a shot. already thought a bit about your TLD.yahoo.com idea. you are generating a sitesarray, may push more target TLDs to the sitearray and do it like for site in sitesarray => http:// + site + .yahoo.com :p so the results will be better and its rotating.
After looking at yahoo and comparing results im not sure if its doing filtering of the results itself (* Removing some of the results and replacing with domains sponsored version of the link *), bing compared to yahoo results in 20% more "Hits" with 20% less d0rks used!, depending on how I can get ES to use other engines it wont matter anyway, sadly my TKinter learning began only a month ago so the Gui interface will be fugly.... really fugly!. Cant thank you enough though think ima going to love ES and Marvel :)
Issues been nailed, first stage of implementing Asyncio is complete already pulling bulk pages of requests in one go, vastly improved speed, will use same method to run checks with other search engines as more 'future' processes.
Only last called SE is doing the checks, AND seems not to produce the result expected, look into better nesting of SEs so each dork is run against each SE and THEN += 1 the increment for next d0rk.