Open YunBAI-PSL opened 2 years ago
Hi, thank you for your interests.
Did you change the time range setting in the settings/*.cfg files? Also, you may also need to set a larger sleep time because frequent visits to nytimes from the same IP may trigger their reCAPTCHA verification.
Even if you set the date in cfg, data cannot be crawled after 2017.
Maybe name class has changed, so you can not get all link paper. You can check line 31
Just change line 31 to elements = soup.table.find_all('a')
.
Just test, it runs without problem.
Dear Author,
Thanks for your nice job. I run your codes and find there isn't news after 2017.03. But I need some recent news, how do you handle this kind of problem?
Many thanks.