Scraping articles from a news source filtered by a search term

we invite you to try the last release or Rcrawler v 0.1.3 (just uploaded on cran)

support of https website
scraping a website by search term (keywords matching)
less error during crawling process .

Rcrawler(Website = "http://www.example.com/", KeywordsFilter = c("keyword1", "keyword2"))`

Crawl the website and collect only webpages containing keyword1 or keyword2 or both.

  Rcrawler(Website = "http://www.example.com/", KeywordsFilter = c("keyword1", "keyword2"),
 KeywordsAccuracy = 50)

Crawl the website and collect only webpages that has an accuracy percentage higher than 50% of matching keyword1 and keyword2. You can use one or more search terms, the accuracy will be calculated based on how many keywords are on the page plus their occurrence.

waiting your review

salimk / Rcrawler

Scraping articles from a news source filtered by a search term #10