Closed allado closed 9 years ago
it is not possible for the moment. it would be added next. pull requests are welcome
Hey Allado take a look at this other project I was working on that stays on the domain to see if you get any ideas. It is much less elegant then Hedii's project but it might give you some ideas for incorporating a "stay on domain" check-box or something like that: domainEmailCrawler
hi @allado and @jackmcdowell. check the branch https://github.com/hedii/php-crawler/tree/feature-domain-specific I think it works well, i will be happy to hear if it does :)
Hi Jack ... you are the man !
Thanks !!!
2015-09-13 16:50 GMT-03:00 hedii notifications@github.com:
hi @allado https://github.com/allado and @jackmcdowell https://github.com/jackmcdowell. check the branch https://github.com/hedii/php-crawler/tree/feature-domain-specific I think it works well, i will be happy to hear if it does :)
— Reply to this email directly or view it on GitHub https://github.com/hedii/php-crawler/issues/2#issuecomment-139912619.
Thanks @allado but it was @hedii who you should be thanking!
i close, tell me if you have any issues with the domain specific branch
Hi
How modify regex $pattern = ?
In Crawler.php to find urls without domain start
/comercios/item1.html /comercios/item1.html /casas/item1.html /casas/item2.html /casas/item3.html ..etc
You can modify it to find urls /xxxxx/xxxx ?
In the source code of the webpage:
Regards !
2015-10-10 12:13 GMT-03:00 hedii notifications@github.com:
i close, tell me if you have any issues with the domain specific branch
— Reply to this email directly or view it on GitHub https://github.com/hedii/php-crawler/issues/2#issuecomment-147096753.
Hi @allado I am working on a new version that will be able to do that. It is not possible with the current version. watch this repo to be updated when it will be released
How stay over the same 1 domain ?