Open ShareDo9 opened 5 years ago
爬取不同域名这样写一直不走on_scan_page 和 on_list_page这两个方法,只走on_content_page
'domains' => array( 'zhongshang114', 'detail.zhongshang114.com' ), 'scan_urls' => array( 'http://detail.zhongshang114.com/list.php?catid=91400' ), 'list_url_regexes' => array( "http://detail.zhongshang114.com/list.php\?catid=91400\&page=\d+" // 公司列表页 ), 'content_url_regexes' => array(
// "http://detail.zhongshang114.com/list.php\?catid=91400\&page=\d+", "http://.*?.zhongshang114.com/" ),
这个content_url_regexes该怎么写呢
爬取不同域名这样写一直不走on_scan_page 和 on_list_page这两个方法,只走on_content_page
// "http://detail.zhongshang114.com/list.php\?catid=91400\&page=\d+", "http://.*?.zhongshang114.com/" ),
这个content_url_regexes该怎么写呢