after loading the reference list

lindsey98 / Phishpedia

Official Implementation of "Phishpedia: A Hybrid Deep Learning Based Approach to Visually Identify Phishing Webpages" USENIX'21

120 stars 38 forks source link

after loading the reference list #31

Open Varun-Mayilvaganan opened 3 weeks ago

Varun-Mayilvaganan commented 3 weeks ago

Screenshot (38) Screenshot (39)

lindsey98 commented 3 weeks ago

Hi， you need to move everything under expand_targetlist/expand_targetlist to expand_targetlist/, such that there is no nested directories.

Varun-Mayilvaganan commented 3 weeks ago

Eventhough it runs perfect but the functionality is not working i took 6 fake websites and url but it shows legit

WhatsApp Image 2024-08-23 at 12 30 45 AM (1)

lindsey98 commented 2 weeks ago

It seems that the logo matching confidence (0.64, 0.65 ...) is far from the threshold. I suspect the top-1 logo reported is not the identity logo. So you can try to change https://github.com/lindsey98/Phishpedia/blob/main/phishpedia.py#L102, set top_k=3 for example to iterate over top-3 logos.

Varun-Mayilvaganan commented 2 weeks ago

Hi lindsey98, I tried it but no change in the result

lindsey98 commented 2 weeks ago

I visited some of the pages https://intsa.vercel.app/, https://www.wal-mart.com/, and I found our reference list didn't protect similar logos as the ones shown on the webpage. For the Instagram phishing, they didn't use the official logo of Instagram. For the Walmart phishing, the logo shown on the webpage is in blue background but our reference list only keeps the white background. You can do the following to expand the target list:

Delete the LOGO_FILES.npy and LOGO_FEATS.npy
Manually add similar logo images to expand_targetlist/Instagram or expand_targetlist/walmart
Rerun