lfd / PaStA

The Patch Stack Analysis
GNU General Public License v2.0
33 stars 20 forks source link

Linux weekly digest #47

Closed rsarky closed 2 months ago

rsarky commented 4 years ago

A weekly (could be for any appropriate timescale) digest that presents a number of statistics about the Linux kernel development process in a concise and informative way. Eg. Patches accepted/rejected/ignored by subsystem. top patch submitters, top reviewers, number of lines of code added/deleted etc.

rralf commented 4 years ago

You should look at the bitergia / patchwork stuff. There might already be some preexisting work.

rsarky commented 4 years ago

Oh. I kind of expected this though, will check it out

bulwahn commented 4 years ago

We are already considering this for a while. The appropriate time scale is probably every full release cycle, because the differences per week are mainly dependent where we are in the release cycle. Further, patches accepted/rejected/ignored can only be determined after a few weeks/months when they show up in the git repositories.

Now to the specific stats:

bulwahn commented 4 years ago

I consider off-list patches interesting (because they are changing over time and interesting for other reasons). Also, top reviewers is interesting, but the really interesting aspect is to present author-reviewer-file relationships in a comprehensible manner (that is pretty stable over time, though.).

rsarky commented 4 years ago

We are already considering this for a while. The appropriate time scale is probably every full release cycle, because the differences per week are mainly dependent where we are in the release cycle.

Makes sense. Anyway the timescale shouldn't reallly have an impact on the implementation.

Further, patches accepted/rejected/ignored can only be determined after a few weeks/months when they show up in the git repositories.

Oh yes. That's something that I missed. Do you have any solution for this as you mentioned ignored patches in the list of stats.

Also we can report threads that have been most actively discussed by the community. Applying some NLP methods for sentiment analysis may allow us to discover the most polarising threads. The stability of various subsystem as a function of bug reports is also going to be an interesting metric.

rsarky commented 4 years ago

Any idea how these statistics are currently generated by lwn?

rralf commented 4 years ago

Have a look at git://git.lwn.net/gitdm.git

@ditzlike is currently also trying to find out how it's exactly generating graphs.