Open dsadgat opened 1 year ago
@prathapsridharan , can this be closed?
@dsadgat - There is definitely more to do to flesh this out but is the v1 of datadog sufficient to close this particular ticket (cc: @metakuni )
If the scope of this issue was v1, then I think we can close this, but I defer to @dsadgat .
Motivation
Current production system monitoring is very limited. We have only AWS Cloudwatch metrics for infra and basic endpoint monitoring via custom logging of API endpoint requests in Cloudwatch. The team has no central dashboards for full system observability. See tech proposal.
Definition of Done
Enable APM for this repo and document steps to ensure others understand if any work is needed to enable APM going forward.
Tasks
The work would involve taking the changes from the associated PR, https://github.com/chanzuckerberg/single-cell-data-portal/pull/3610/files, and getting them on to main (which has progressed considerably since this spike, so it won't be a straight merge), retesting in testing in rdev, and then in dev, etc. And updating some documentation.
Review related ticket for context.