sockysec / Telerecon

A reconnaissance framework for researching and investigating Telegram.
828 stars 115 forks source link

Enhanced URL Scraping and File Management Features #23

Open HipShot-Critical opened 1 month ago

HipShot-Critical commented 1 month ago
  1. Added Features:

-Implemented logic to remove deduplicate URLs from scraped Telegram channels. -Introduced functionality to save URLs to a Master CSV files (total_urls.csv) in root directory to be used in other scripts. -Included capability to update a Master CSV file (total_urls.csv) with all unique URLs from multiple channels. -Added print statements to display the number of URLs scrapped, how many new URLs were added and total URLs in both channel-specific and master CSV files after each update. urlscraper py update print

  1. Error Handling:

-Improved error handling to skip invalid Telegram URLs and handle exceptions gracefully during scraping.

  1. Enhancements:

-Optimized file handling by appending new URLs to existing files and updating without duplicates.

  1. Refactoring and Cleanup:

-Removed unnecessary dependencies (like Pandas) to streamline the script.