fkie-cad / COMIDDS

A comprehensive survey of datasets for research in host-based and/or network-based intrusion detection, with a focus on enterprise networks
MIT License
29 stars 4 forks source link

Automate health checks for URLs #56

Open Maspital opened 3 months ago

Maspital commented 3 months ago

Implement a tool which parses through all markdown files and checks URLs for their return codes. This could also be integrated into the build process.

Maspital commented 2 months ago

One thing I need to think about is how to deal with redirects (3xx), as these will be pretty common. I'll probably first test if there are any non-DOI links which do this - if there aren't things are pretty easy, if there are, I'll probably have to define manual exceptions for the link-checking process.

Maspital commented 1 month ago

Additionally, this should also check internal links like /intrusion-detection-datasets/content/datasets/iscx_ids_2012 as these will always generate errors in editors like PyCharm. Checking this should be pretty straightforward (just verify the files existence relative to the root of the repository).

Some linke are relative, like ../datasets/ait_alert_dataset, which is similarly trivial - just keep in mind that the initial ../ is only there because GitHub Pages throws a tantrum otherwise, for the purpose of verification it should be ignored.