erickutcher / httpdownloader

HTTP(S) download manager that uses input/output completion ports (IOCP).
https://erickutcher.github.io/#HTTP_Downloader
550 stars 62 forks source link

[request] html link extraction with regex #140

Open azev opened 4 years ago

azev commented 4 years ago

I'd like to request a feature to extract links from html and json files (or any other text file). I have a lot of json and html files with urls inside and it would be great if HTTP downloader had an option to extract links from files using provided regex.

erickutcher commented 4 years ago

You should be able to drag the content of an HTML page into the main window, Add URL(s) window, or URL drop window and it'll parse all of the anchor tag links out of it. Ctrl + A the webpage and then drag the selection into the program.

As for JSON files, you're probably better off using a third party program to extract them. There's sites online that'll extract links from generic text files.