Open psaiz opened 10 months ago
I'm taking this snippet from an email from Tibor from the 2.10.2023. The email described work done to understand the CODP usage analysis, and it identifies different URL that were combined into categories
Distinct IPs overall, firewall logs (/eos): 496
Distinct IPs overall, firewall logs (/about): 3946
Distinct IPs overall, firewall logs (/collection): 18773
Distinct IPs overall, firewall logs (/visualise/events): 14054
Distinct IPs overall, firewall logs (/visualise/histograms): 2967
Distinct IPs overall, firewall logs (/record): 19033
Distinct IPs overall, firewall logs (/search): 8670
Distinct IPs overall, firewall logs (/): 72752
Distinct IPs overall, firewall logs (*): 97679
We collect statistics based on the nginx access monitoring. The information is available here
The accesses can be split into categories, which are done based on invenio [here] (https://github.com/inveniosoftware/helm-invenio/blob/master/invenio/templates/configurations/logstash.yaml#L112)
We should verify that the categories make sense for opendata.