Tjatse / node-readability

Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.
341 stars 36 forks source link

Filter children of `topCandidate` if choose the parent of it as Article Object. #16

Closed Tjatse closed 8 years ago

Tjatse commented 8 years ago

Currently extracting topCandidate.parent().html() as art.content directly (html output). Somehow topCandiate.parent().children() should be filtered too.