am9zZWY / TueR

TüR 🚪 – Tübingen Retrieval. The search engine that opens doors to Tübingen.
3 stars 2 forks source link

Crawler #32

Closed okihnjo closed 1 month ago

okihnjo commented 1 month ago
am9zZWY commented 1 month ago
  • [ ] Ignore hidden stuff (hidden tags)

I looked for solutions but it's kind of difficult to achieve that because I would have to check the CSS-classes of a tag or the style itself which is unfeasible in this amount of time. What I did instead was to look only at specific tags, such as <title/>, <p/>, etc...