Load the website in background page without webservice, extract the content with a selector defined in the option page (by default the balise article).
This can keep images in the article and all other content.
Simple Configuration for configure website and selector for use the Page Extractor, if there a configuration for the website, load the site in the background page.
For site you want use the page extractor, put the site name = the css selector (by example article or .main-content).
Other site will use the other selected api (Mercury or Boilerplate).
It's an amelioration of the ArticleExtractor Pull Request #27
Load the website in background page without webservice, extract the content with a selector defined in the option page (by default the balise article). This can keep images in the article and all other content. Simple Configuration for configure website and selector for use the Page Extractor, if there a configuration for the website, load the site in the background page. For site you want use the page extractor, put the site name = the css selector (by example article or .main-content). Other site will use the other selected api (Mercury or Boilerplate).
It's an amelioration of the ArticleExtractor Pull Request #27