Open anarcat opened 7 years ago
I think we could process oldoldstable (but possibly we need more RAM on manziarly for that). I can test this on my machine to see how much the RAM footprint increases.
I think processing manpages for all Debian releases is overwhelming, both in terms of significantly growing the resource requirements, and in terms of overloading the UI with a large list of releases.
In our current setup, we use about 1G of resident set size. With oldoldstable added, we use about 1.5G of resident set size.
Looking at manziarly, we’re already swapping during regular operation: https://munin.debian.org/debian.org/manziarly.debian.org/index.html#system
So, I see a couple of options:
i'd love to see 2 and 3 happen... :) it seems fairly straightforward to profile memory usage in go but i haven't played with that, personally.
i also wonder if we couldn't rely on archived suites not changing. for example, we could have links to the squeeze
suite now and generate those manpages once without having to reparse the whole suite at every run. we just need to keep links for the relevant manpages... same probably applies to wheezy: it's unlikely that we have manpages changes in LTS...
wouldn't that approach save some resources? or it's too much of a design change?
You’re right regarding the content of the manpages, and rendering is indeed skipped already. Checking whether manpages need to be rendered only takes on the order of a few seconds for the entire corpus, so this is not worth optimizing.
Note, however, that the navigation panel on the manpages of all versions needs to be updated whenever any version changes. E.g., if a package gets removed from testing, it shouldn’t appear in the oldoldstable version’s navigation panel. Hence, we need to process all pages.
I just had a thought that it would actually be really great if we had the history of older manpages from unsupported suites. For example, I was just looking for the manpage of dpkg-buildpackage for wheezy and it wasn't linked here:
https://manpages.debian.org/jessie/dpkg-dev/dpkg-buildpackage.1.en.html
yet it actually exists there:
https://manpages.debian.org/wheezy/dpkg-dev/dpkg-buildpackage.1.en.html
So I'd ask for two things, if I can:
This would neatly fix the concerns with the "stability" of codenamed suites (expressed in #54): we just keep those forever, basically. From what I understand, the disk space usage isn't that critical that we should keep from doing this.