bird-house / birdhouse-deploy

Scripts and configurations to deploy the various birds and servers required for a full-fledged production platform
https://birdhouse-deploy.readthedocs.io/en/latest/
Apache License 2.0
4 stars 6 forks source link

:bulb: [Feature] Log download stats from THREDDS server #444

Open huard opened 6 months ago

huard commented 6 months ago

Description

It would be useful for reporting purposes to monitor data downloads from THREDDS:

References

This information can be parsed from NGINX logs, but those logs need to be exposed to Prometheus to be aggregated and archived within the current architecture.

Possible solutions:

Additional info

See also:

Concerned Organizations

fmigneault commented 6 months ago

Consider downloads from WPS outputs and STAC data proxy endpoints as well for the same reasons.

huard commented 5 months ago

ESGF uses Beats and Logstash to collect logs and compute their stats. See https://drive.google.com/drive/folders/1LbvoYeQ_6L_bzTsO-EEhwqjIx1jZ-G1k

fmigneault commented 5 months ago

If the "node collector" can be located on the same instance, logstash seems like an interesting candidate. If there is no distinction between beats or logstash as "log producers", I would favor the 2nd architecture to limit the number of configurations/technologies involved.

huard commented 1 week ago

Parser for nginx logs and prometheus counter https://gist.github.com/huard/25ca5be3479f72546f748da54f7097e7