Open ethan-hunt-007 opened 9 years ago
@ethan-hunt-007 Forbes is largely incompatible with text extractors like goose, newspaper, etc, because their current site uses Javascript to render most of the webpage. That means, you'd have to render the page in a headless browser of some sort, let the JS run, and then extract the text / data. That's a lot more work and probably wouldn't be terribly performant.
While extraction from Forbes.com not getting the needed data and getting unnecessary data in many cases . Here the code
I am getting the same text in many links of this type. What can be the issue and how to correct this???