dalab / web2text

Source code for the paper "Web2Text: Deep Structured Boilerplate Removal", full paper @ ECIR'18
MIT License
168 stars 31 forks source link

Testing on other datasets (e.g. Dragnet) #5

Closed mtlive closed 4 years ago

mtlive commented 5 years ago

Is it possible to test your approach on newer datasets such as Dragnet? Cleaneval is really old and it doesn't reflect modern website designs.

tvogels commented 4 years ago

Hi @mtlive, Sorry for the late response and thanks for the suggestion. I agree that it would be nice to have evaluations on Dragnet. If you happened to have done such an evaluation, I would be curious about the results.