Open StevenBarre opened 5 months ago
Capture infra node cpu/memory/network usage before/after Capture collector pod cpu/memory/network usage before/after
Directions to install Loki while keeping Elastic and Kibana around until all the logs in them age out https://access.redhat.com/articles/6991632
Installed in CLAB, leveraging a temp S3 endpoint on the NetApp. Storage team would prefer we use ECS when this is operationalized.
Currently running without redundancy as that was causing errors.
Will need to discuss more with Matt on tuning parameters and query formats.
Case insensitive search
|~ `(?i)mystring`
How to query the API via HTTP https://access.redhat.com/solutions/7046397
Changed the size from 1x.demo
to 1x.extra-small
and that fixed replication and the PDB errors. Needed to expand onto the worker nodes as infra didn't have enough capacity while ES is still in place.
Audit Log query to find Deletes by non-system users
{ log_type="audit" } | json requestURI, verb, code="responseStatus.code", user="user.username" | line_format "{{.requestURI}} {{.verb}} {{.code}} {{.user}}" | verb="delete" user!~"system:.+"
Disk usage before the switch, for 2d of log retention, was 35G x3
Testing logging alerts, but haven't gotten it working yet. Case opened with RH.
Some testing of queries against longer time periods shows a lot of S3 data read, could be an issue in PROD.
Describe the issue Current logging stack is being deprecated in future versions of OpenShift. We need to test out Loki and the migration process.
What is the Value/Impact? Keeping current with technology
What is the plan? How will this get completed?
Identify any dependencies Storage team
Definition of done Loki running in CLAB and a plan to move forward