crawlbase / proxycrawl-python

ProxyCrawl Python library for scraping and crawling
https://proxycrawl.com
Apache License 2.0
60 stars 19 forks source link

How to only crawl latest post? #10

Closed null15 closed 3 years ago

null15 commented 3 years ago

Hey,

how to only get the latest content of the last post?

crawlbase commented 3 years ago

@null15 this depends on the website that you are crawling. You will get all the content from the page and then you will have to parse the content via an HTML parsing module or library. For python you can use html.parser

As this is more an issue related to a website than to the library itself, I'm closing the issue. Please contact directly our support team for further assistance here.