can not extract main text.

fhamborg / news-please

news-please - an integrated web crawler and information extractor for news that just works

Apache License 2.0

1.99k stars 414 forks source link

can not extract main text. #253

Closed simplew2011 closed 7 months ago

simplew2011 commented 8 months ago

Describe the bug

To Reproduce

from newsplease import NewsPlease
article = NewsPlease.from_url('http://www.news.cn/world/2023-10/29/c_1212295036.htm')
print(article)
aa = article.get_serializable_dict()
print(aa)
exit()

can not extract main text.

fhamborg commented 7 months ago

not using bug report template, closing