GSA / data.gov

Main repository for the data.gov service
https://data.gov
Other
649 stars 101 forks source link

Profile memory consumption of harvest runner under load #4875

Open btylerburton opened 2 months ago

btylerburton commented 2 months ago

User Story

In order to identify memory leaks in the harvester, datagovteam wants to conduct a formal analysis using industry standard memory profiling tools.

Acceptance Criteria

[ACs should be clearly demoable/verifiable whenever possible. Try specifying them using BDD.]

Background

[Any helpful contextual notes or links to artifacts/evidence, if needed]

Security Considerations (required)

[Any security concerns that might be implicated in the change. "None" is OK, just be explicit here!]

Sketch

btylerburton commented 2 months ago

Discussed whether breaking the harvest up into discrete processes: extract, transform, validate, sync/load will make this a non-issue. We can leave this ticket as a might-do just in case that doesn't work.