Closed Victor239 closed 3 years ago
Try this:
{
"url": "https://palewebserial.wordpress.com/table-of-contents/",
"title": "Pale",
"author": "Wildbow",
"chapter_selector": "article .entry-content > p a",
"content_selector": "article .entry-content",
"filter_selector": ".sharedaddy, style, a[href*='palewebserial.wordpress.com']"
}
That works great, thank you!
Trying to extract Pale, but it fails after trying to extract social media links for Twitter and Facebook, followed by a "None" page:
I followed the example here to try and exclude "None" with
"chapter_selector": "#main .entry-content > p > a:not([href*=None])",
but it skipped 95% of existing chapters that way.