hedii / php-crawler

A php crawler that finds emails on the internets
MIT License
134 stars 65 forks source link

Own logic #11

Closed tpoxa closed 8 years ago

tpoxa commented 8 years ago

How can add own parse logic into crawler? For example if I want to parse names also.

btw great tool!

hedii commented 8 years ago

The crawler uses a package i wrote called "Extractors" to extract resources from web pages (email and urls currently): https://github.com/hedii/extractors

You may find a way to extract names, write the logic and make a pull request to hedii/extractors. When it's done i can help to implement the 'names' option on the crawler.