For this plugin, the logic to extract agency/source/authors for the news, extractAuthors() does not consistently capture this information from the HTML content.
For example, here the source data was missed from sourceName field but is present in the extracted text body:
"sourceName": [""], "pubdate": "2021-07-18", "text": "By PTI\nNEW DELHI:
For this plugin, the logic to extract agency/source/authors for the news,
extractAuthors()
does not consistently capture this information from the HTML content. For example, here the source data was missed from sourceName field but is present in the extracted text body:"sourceName": [""], "pubdate": "2021-07-18", "text": "By PTI\nNEW DELHI: