AndyTheFactory / newspaper4k

📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
MIT License
485 stars 50 forks source link

Cannot download the text or top image for Chinese news #274

Open AndyTheFactory opened 1 year ago

AndyTheFactory commented 1 year ago

Issue by bq-chen Mon Feb 11 18:55:59 2019 Originally opened as https://github.com/codelucas/newspaper/issues/676


I've tried a Chinese news link, but I get nothing as top_image or even the text of the news. Could you please guide what might have gone wrong and how to fix it? Does your api actually work for Chinese news?

>>> from newspaper import Article
>>> url = "https://www.toutiaoabc.com/index.php?app=news&act=view&nid=697665"
>>> article = Article(url)
>>> article.top_image
''
>>> article.text
''
>>> article
<newspaper.article.Article object at 0x106b90978>
>>> article.publish_date
''
2019-02-11 1 55 20
AndyTheFactory commented 12 months ago

https://www.toutiaoabc.com/index.php?app=news&act=view&nid=1191487