scrapy / scrapely

A pure-python HTML screen-scraping library
1.86k stars 273 forks source link

allow to define field extractors over text contents only, without need to re parsing. #18

Closed kalessin closed 12 years ago

kalessin commented 12 years ago

Allow to define field extractors over text contents only, without need to re parsing. Added tests and fixed current ones, as a special flag was added on HtmlDataFragment objects

kalessin commented 12 years ago

I made the changes you suggested. please check.