Multithreaded pastebin scraper, scrapes to mysql database, then reads pastes for noteworthy information.
Use sniff-paste.py to go through the entire process of collection, logging, and harvest automatically. The scraper can be set to a paste limit of 0 to scrape indefinitely. If scraped indefinitely, press ctrl + c to stop scraping, any useful information will be in the database, along with a link back to the original paste it was found in.
sudo apt install libxslt-dev python3-lxml python3-nmap xsltproc mysql-server
pip3 install -r requirements.txt
sniff_paste
in mysql serverpython3 sniff-paste.py
This will scrape pastebin for the latest number of pastes, then run analysis for ip addresses, emails, and phone numbers. It filters out duplicates and runs scans on some of the harvested data.
sniff_paste
-- root db
pastes
-- stores paste with full text, date, link, title, and languageemails
-- stores emails with extension to pastelinks
-- stores urls with extension to pasteip
-- stores ip with connectivity and extension to pastephones
-- stores phone numbers with extension to pastesecrets
-- stores secret type with extension to pasteports
-- stores port scan info (port, status, service, version, ip)cryptos
-- stores cryptocurrency findings with extension to paste Crypto findings are not certain to be valid, consider them low probability findings