scrapy / scrapely

A pure-python HTML screen-scraping library
1.86k stars 272 forks source link

Interest in other wrapper induction techniques? #115

Open fgregg opened 5 years ago

fgregg commented 5 years ago

Hi all,

I'm sorry if this is not the right place for this discussion. If there is a more appropriate forum, I'd be happy to move over there.

I've been digging into the wrapper induction literature, and have really appreciated the work that y'all have done with this library and pydepta and mdr.

I'd like to build a library using the ideas from the Trinity paper or @AdiOmari's SYNTHIA approach.

It does not seem like your wrapper induction libraries are currently a very active area of interest, but I wanted to know if these would be of interest to y'all (or other methods)?