xdvom03 / klaus

Bayesian text classification of websites in a nested class system
Creative Commons Zero v1.0 Universal
2 stars 0 forks source link

Create URL verifier #84

Closed xdvom03 closed 3 years ago

xdvom03 commented 3 years ago

On redownload, a few sites have since gone unreachable. This is undesirable if I want to bundle a useful classification with the thing, so create a piece of code that checks all servers are still reachable and lets me resolve any errors or text content changes.

xdvom03 commented 3 years ago

Primary use case superseded by #85.

xdvom03 commented 3 years ago

Yeah. There is no good reason to send somebody classes to download themselves, unless you want to avoid sending the corpus over, which so far is not a scale issue (corpus size is ~10% of executable size).