tosdr / tosback2

Reimplementing TOSBack with Ruby and using git to see TOS changes!
http://tosback.org
GNU General Public License v2.0
125 stars 32 forks source link

add getpocket.com xml and crawl #22

Closed igalic closed 6 years ago

JimmStout commented 10 years ago

I didn't test it, but I think you can just add this xpath to get rid of the header links and sign in/out stuff we don't want to track:

<url name="https://getpocket.com/privacy" xpath="//div[@class='legacy-content']">

If you resubmit that, I'll merge it in!

igalic commented 10 years ago

ack, will try! Thanks for the feedback

pierreozoux commented 8 years ago

@igalic did you try the suggestion? (Just going though all our PRs and issues to reopen or close oldies)

igalic commented 8 years ago

@pierreozoux updated!