where to capture system performance and needs documentation?

rahulbot commented 10 months ago

We are starting to learn more about (1) the compute/memory/storage needs and (2) system throughput and performance. For instance, @philbudne shared that:

"32GB and 16 CPUs is way more than the rss-fetcher needs, but insufficient for the story indexer, at least memory-wise (think 20 fetchers each of wants 1G, and THEN add ES to the mix)"
1 importer worker playing catch up processed about 60 Stories/second, bumping to 2 doubled rate linearly (up to 120 stories/second)

This kind of information seems useful to capture in a collaborative space (rather than my personal notes app), but I'm not sure where. Perhaps a wiki page on this GH repo? Better ideas?

philbudne commented 10 months ago

I don't have any brilliant thoughts on how to capture the data.

I've certainly seen that parsers running on ramos can do more than the same number on ifill, so it depends on the hardware. And the software revision (unless one monitors for performance regressions)!

rahulbot commented 7 months ago

Don't have a great solution, and numbers will likely change with hardware we have in house. Closing as archival information.

mediacloud / story-indexer

where to capture system performance and needs documentation? #206