codelucas / newspaper

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
https://goo.gl/VX41yK
MIT License
13.89k stars 2.1k forks source link

suggestions for calculate_best_node #965

Open shewenkan opened 1 year ago

shewenkan commented 1 year ago

the score of the grandpa node should not be discounted if it has son text_node with better score,for example: https://indianexpress.com/article/opinion/columns/c-raja-mohan-writes-cooperation-amid-conflict-is-indias-burden-for-g20-8472106/ the correct top_node is the div id="pcl-full-content" not div class="ev-meter-content"