Researching other open communities: Governance + Structure

This is an issue to keep track of some "meta" research as a part of the Mozilla Open Leaders program. There are lots of open projects out there, and they've all got their own take on how to effectively organize open communities. As I read and learn more, I'll keep this issue updated with information in case others are interested or have thoughts of their own!

Interesting links

Python has a PEP for changing its own governance model in the wake of Guido taking a step back.
Python also has a PEP for surveying other project models which is a great list for inspiration (similar and more complete than this one).
The Rust community has a great teams page on their website. I love the fact that teams have explicit responsibilities and that there are lots of non-technical teams in there!
- The NPM community also adopted this process for its own "proposal changes and feature requests" activity. (thx @nayafia!)
- Ethereum also has a similar process though I believe they require all new RFC proposals to be discussed in one of the "all dev" meetings
Moreover, the Rust RFC proposal list is a really interesting read if you wanna see a list of "big decisions" in the Rust community over the last few years. In fact, there's an RFC that proposes the current RFC system. :-)
I also heard about the Mozilla Archetypes of Open Source white paper, which describes many kinds of open source projects. Seems like a useful resource as we think about the Jupyter community.
The Mozilla project governance is split into "modules". Each module has a core team and a module owner that makes decisions. I'll keep this list updated as I hear of more! If you've got any suggestions for links, comments, etc please post below!
Here's a fascinating series of blog posts about hybrid organizations from one of the leaders in the Mozilla community. Lots of interesting conversation about how a public/private organization can more effectively balance being mission-driven but also scaling to have an impact on markets in order to further that mission.
The Apache foundation describes their voting process and the different nuances that come with voting in different scenarios and with different groups of people.
An excellent blog post on the Node.js contributor model
Some general model frameworks for open source governance
A great set of slides on the "Liberal Contribution" model that has been adopted by modern-day projects like node and Rust from @nayafia
The Kubeflow governance model is pretty interesting to look through as well

I just had another read-through of the Mozilla archetypes of open source PDF and I wanted to share some thoughts about where I think Jupyter fits in. Here are a few relevant archetypes they discuss:

Rocket Ship To Mars

Characteristics: “Rocket Ship To Mars” projects are characterized by a small full-time core team that is wholly focused on a well-articulated and highly specific goal. They are often led by charismatic founders and enjoy a funding runway sufficient for bootstrapping. Their open source strategy is often rooted in a commitment to transparency and providing insurance: they want to instill confidence among developers and users in order to promote adoption, and being open source is one ingredient in doing that.

Rocket Ships To Mars typically do not invest a lot of effort in encouraging broad contributorship, not so much because they wouldn’t welcome contributions as because it is hard to find contributors who share the project’s intensity and specificity of vision, and who are sufficiently aligned with the founders to take the time to make their contributions fit in. These projects are also usually trying to reach a convincing alpha release as quickly as possible, so the tradeoff between roadmap speed and contributor development looks different for them than for other types of projects.

Controlled Ecosystem

Characteristics: Real community involvement, and diversity of community motivations, but with a shared understanding that the founder (or some credible entity) will act as “benevolent dictator”. Requires experienced open source community management on part of project leaders and occasional willingness to compromise. Works best when technical architecture directly supports out-of-core innovation and customization, such as via a plugin system. Controlled Ecosystem efforts find much of their value in that ecosystem. The core pro vides base value, but the varied contributions across a healthy plugin ecosystem allow the project to address a much larger and diverse set of needs than any one project could tackle alone. Over time, these projects might see more and more of the core functionality structured as plugins as the product becomes more customizable and pluggable. This increasingly lets the plugins determine the end-user experience, and the core project can eventually become infrastructure.

Wide Open

Characteristics: Wide Open projects actively welcome contributions from any source, and tend to get contributions at all levels: from individual developers providing one-off bug fixes to sustained organizational contributors developing major features and being involved in long-range project planning. A Wide Open project values investing in contributors relatively highly: the core development team is generally willing to pay a high opportunity cost (in terms of code not written or bugs not fixed) in order to encourage or onboard a promising new contributor. The project’s collaboration tools and processes reflect this prioritization. For example, the experienced developers in the project will still take the time to hold design discussions in forums-ofrecord — even when many of the individuals concerned are co-located and could make the decisions in person — in order to publicly reinforce the possibility of participation by any other self-nominated interested party. Wide Open is a good archetype for projects that aim to serve the same technical purpose over a long period of time, and for which stability and reliability are more important than rapid development of the code base or fast adoption of field-specific innovations. Wide Open is also especially effective when some significant percentage of the user base is technically oriented (and thus susceptible to being converted to contributors), and when regular contact with a broad developer base is likely to bring real-world usage information that would be difficult for Mozilla to acquire in any other way.

Mass market

Characteristics: Mass Market projects are often similar to “Wide Open” projects, but they necessarily have a protective layer around their contribution intake process. This is because there are simply too many users and too many opinions for everyone to be heard by the development team, and because the average technical sophistication of the users is low (especially in terms of their familiarity with the conventions of open source participation). This is especially true in a commercial context where product experience and time-to-market are highly sensitive. Like B2B, Mass Market tends to drive down the price of directly competitive products, the way OpenOffice.org (and later LibreOffice) did with Microsoft Office. But Mass Market is less likely than B2B to affect complementary products, because Mass Market is usually aimed at individuals not at intermediary organizations. Mass Market open source projects have certain opportunities that come with economies of scale, and they should actively seek to take advantage of those opportunities. For example, they can often get good translation (i.e., localization) coverage, because there are many users who are qualified and motivated to perform translation even though they might not have the skills to participate as programmers. Mass Market projects can do effective A/B testing and user surveys, and have some ability to influence de facto or formal Internet and Web standards. Finally, they can be thought leaders in a broader political sense (e.g., through industry partnerships, user clubs, etc).

My thoughts:

Totally spitballing here: I feel like Jupyter began as a "Rocketship to Mars". The core team went through many cycles of rapid development. This gave us amazing products like the Jupyter Notebook, at the cost of some good will from the developer community (in the past I'd often hear "it's so hard to keep up with developments the Jupyter community because the technology moves so fast")

In the last few years, as the products in Jupyter have matured and the community of core contributors has diversified, Jupyter has become something closer to these categories, in decreasing order:

a controlled ecosystem, in the sense that the Jupyter extension/plugin system is a part of the core Jupyter ideology (standards and modularity)
a "mass market" in the sense that many core Jupyter products provide an open-source alternative to what would otherwise be captured by proprietary products from companies
a "wide open" project in the sense that we've seen more emphasis in recent years on building community rather than just building features. E.g., JupyterCon, JupyterDays, improvements to documentation, improvements to team processes to make Jupyter more welcoming, etc.

I feel like we're also starting to bump up against "B2B" and "Multi-Vendor Infrastructure" models, but not necessarily in a principled way, just because of the fact that there are more large-scale organizations relying on Jupyter tech.

One other thing I'll mention is that I get the feeling this depends a lot on the particular sub-community that one is a part of. Most of my experiences are from the JupyterHub and Binder side of things, and I'd be really curious to hear what other folks think about where Jupyter falls on this scale.

I want to think about this a bit more but these are just some early thoughts. Other questions swirling in my brain:

If we had 100% points to give out to all of these categories, how would they be divided?
This is an attempt at description, not prescription. What do people think Jupyter should be in its goals?
What kind of challenges or opportunities do each of these models have for sustainability in the Jupyter community?
Given thoughts in 2 and 3, what are some things we, as a community, should be doing differently?

I found that article from Mozilla to be really thoughtful. I am going to sidestep your specific question from now and provide some context about what makes Jupyter a bit unique in this space:

Today, most of our users are not and will not likely become developers of Jupyter. This is meant to be a descriptive statement, not a prescriptive one (I think we should do everything we can to develop and mentor users into developers/designers).
Closely related to this - our user base has grown at a rate that has outpaced resources (core team time, $) by orders of magnitude.
Unlike most projects with the level of funding we have had, there is no commercial entity behind us. This means we have tackled large ambitious blocks of work, but in a manner that has more "breathing room" and space for community involvement.

I think there are other archetypes in the document that capture aspects of Jupyter:

Multi-Vendor Infrastructure

On Wed, Oct 24, 2018 at 11:03 AM Chris Holdgraf notifications@github.com wrote:

I just had another read-through of the Mozilla archetypes of open source https://blog.mozilla.org/wp-content/uploads/2018/05/MZOTS_OS_Archetypes_report_ext_scr.pdf PDF and I wanted to share some thoughts about where I think Jupyter fits in. Here are a few relevant archetypes they discuss: Rocket Ship To Mars

Characteristics: “Rocket Ship To Mars” projects are characterized by a small full-time core team that is wholly focused on a well-articulated and highly specific goal. They are often led by charismatic founders and enjoy a funding runway sufficient for bootstrapping. Their open source strategy is often rooted in a commitment to transparency and providing insurance: they want to instill confidence among developers and users in order to promote adoption, and being open source is one ingredient in doing that.

Rocket Ships To Mars typically do not invest a lot of effort in encouraging broad contributorship, not so much because they wouldn’t welcome contributions as because it is hard to find contributors who share the project’s intensity and specificity of vision, and who are sufficiently aligned with the founders to take the time to make their contributions fit in. These projects are also usually trying to reach a convincing alpha release as quickly as possible, so the tradeoff between roadmap speed and contributor development looks different for them than for other types of projects.

Controlled Ecosystem

Characteristics: Real community involvement, and diversity of community motivations, but with a shared understanding that the founder (or some credible entity) will act as “benevolent dictator”. Requires experienced open source community management on part of project leaders and occasional willingness to compromise. Works best when technical architecture directly supports out-of-core innovation and customization, such as via a plugin system. Controlled Ecosystem efforts find much of their value in that ecosystem. The core pro vides base value, but the varied contributions across a healthy plugin ecosystem allow the project to address a much larger and diverse set of needs than any one project could tackle alone. Over time, these projects might see more and more of the core functionality structured as plugins as the product becomes more customizable and pluggable. This increasingly lets the plugins determine the end-user experience, and the core project can eventually become infrastructure.

Wide Open

Characteristics: Wide Open projects actively welcome contributions from any source, and tend to get contributions at all levels: from individual developers providing one-off bug fixes to sustained organizational contributors developing major features and being involved in long-range project planning. A Wide Open project values investing in contributors relatively highly: the core development team is generally willing to pay a high opportunity cost (in terms of code not written or bugs not fixed) in order to encourage or onboard a promising new contributor. The project’s collaboration tools and processes reflect this prioritization. For example, the experienced developers in the project will still take the time to hold design discussions in forums-ofrecord — even when many of the individuals concerned are co-located and could make the decisions in person — in order to publicly reinforce the possibility of participation by any other self-nominated interested party. Wide Open is a good archetype for projects that aim to serve the same technical purpose over a long period of time, and for which stability and reliability are more important than rapid development of the code base or fast adoption of field-specific innovations. Wide Open is also especially effective when some significant percentage of the user base is technically oriented (and thus susceptible to being converted to contributors), and when regular contact with a broad developer base is likely to bring real-world usage information that would be difficult for Mozilla to acquire in any other way.

Mass market

Characteristics: Mass Market projects are often similar to “Wide Open” projects, but they necessarily have a protective layer around their contribution intake process. This is because there are simply too many users and too many opinions for everyone to be heard by the development team, and because the average technical sophistication of the users is low (especially in terms of their familiarity with the conventions of open source participation). This is especially true in a commercial context where product experience and time-to-market are highly sensitive. Like B2B, Mass Market tends to drive down the price of directly competitive products, the way OpenOffice.org (and later LibreOffice) did with Microsoft Office. But Mass Market is less likely than B2B to affect complementary products, because Mass Market is usually aimed at individuals not at intermediary organizations. Mass Market open source projects have certain opportunities that come with economies of scale, and they should actively seek to take advantage of those opportunities. For example, they can often get good translation (i.e., localization) coverage, because there are many users who are qualified and motivated to perform translation even though they might not have the skills to participate as programmers. Mass Market projects can do effective A/B testing and user surveys, and have some ability to influence de facto or formal Internet and Web standards. Finally, they can be thought leaders in a broader political sense (e.g., through industry partnerships, user clubs, etc).

My thoughts:

Totally spitballing here: I feel like Jupyter began as a "Rocketship to Mars". The core team went through many cycles of rapid development. This gave us amazing products like the Jupyter Notebook, at the cost of some good will from the developer community (in the past I'd often hear "it's so hard to keep up with developments the Jupyter community because the technology moves so fast")

In the last few years, as the products in Jupyter have matured and the community of core contributors has diversified, Jupyter has become something closer to these categories, in decreasing order:

a controlled ecosystem, in the sense that the Jupyter extension/plugin system is a part of the core Jupyter ideology (standards and modularity)

a "mass market" in the sense that many core Jupyter products provide an open-source alternative to what would otherwise be captured by proprietary products from companies

a "wide open" project in the sense that we've seen more emphasis in recent years on building community rather than just building features. E.g., JupyterCon, JupyterDays, improvements to documentation, improvements to team processes to make Jupyter more welcoming, etc.

One other thing I'll mention is that I get the feeling this depends a lot on the particular sub-community that one is a part of. Most of my experiences are from the JupyterHub and Binder side of things, and I'd be really curious to hear what other folks think about where Jupyter falls on this scale.

I want to think about this a bit more but these are just some early thoughts. Other questions swirling in my brain:

If we had 100% points to give out to all of these categories, how would they be divided?

This is an attempt at description, not prescription. What do people think Jupyter should be in its goals?

What kind of challenges or opportunities do each of these models have for sustainability in the Jupyter community?

Given thoughts in 2 and 3, what are some things we, as a community, should be doing differently?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/jupyter/governance/issues/60#issuecomment-432766439, or mute the thread https://github.com/notifications/unsubscribe-auth/AABr0BCNA_7OeO8enWVzsFXDqDu9nUrcks5uoKtYgaJpZM4X0bEn .

-- Brian E. Granger Associate Professor of Physics and Data Science Cal Poly State University, San Luis Obispo @ellisonbg on Twitter and GitHub bgranger@calpoly.edu and ellisonbg@gmail.com

@ellisonbg I'm curious to your thoughts on the set of blog posts that I just posted from the Mozilla person. They kind of get at this point, in the context of discussing "hybrid organizations" like Mozilla. It's about 10 years old and obviously much has changed since they were written, but I found it insightful nonetheless https://commonspace.wordpress.com/category/hybrid/

Love this series of blog posts :-)

The hybrid idea is a really nice way of capturing the dualities that we find in Jupyter.

In so many of the conversations I have had about sustainability, governance, organization, etc., people have come up with ideas and proposals that end up boxing us into one of the categories in a way that doesn't at all reflect the history and values of the project.

As mentioned in the blog post, the "intersection between participation and product" is one aspect that I think is really important. We have always been product and user focused (let's build a tool that helps people). But, the broader community of users and contributors has always been super important for us - in a way that most companies can't or don't prioritize. Also, our prioritization of educational and research applications remains a core value of the project. And in that space, it isn't just about product development. I have remained in academia because I deeply value the activity of teaching, training, and mentoring people. Not only the economic aspects, but the broader aspects that have societal and personal impact.

In many ways, we are like a social benefit non-profit - it happens to be that the way we help people and society is by building what are essentially enterprise tools for working with code and data. And we want to do both of those things in a manner that embraces the participatory nature of the web. That is super weird and unusual and I love Jupyter for it.

Another is the reality of structure and governance. Seeing how we are hybrid like this helps me to understand why it is so difficult to build a solid governance model that captures the reality of our project.

To me, some of the question boils down to "what are the means and what are the ends"? Being a hybrid organization means that we have multiple means and ends that feedback into each other.

On Wed, Oct 24, 2018 at 12:27 PM Chris Holdgraf notifications@github.com wrote:

@ellisonbg https://github.com/ellisonbg I'm curious to your thoughts on the set of blog posts that I just posted from the Mozilla person. They kind of get at this point, in the context of discussing "hybrid organizations" like Mozilla. It's about 10 years old and obviously much has changed since they were written, but I found it insightful nonetheless https://commonspace.wordpress.com/category/hybrid/

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/jupyter/governance/issues/60#issuecomment-432796381, or mute the thread https://github.com/notifications/unsubscribe-auth/AABr0Kfd-MbeuX_dNlsZfNK8Btra5-RIks5uoL8ugaJpZM4X0bEn .

-- Brian E. Granger Associate Professor of Physics and Data Science Cal Poly State University, San Luis Obispo @ellisonbg on Twitter and GitHub bgranger@calpoly.edu and ellisonbg@gmail.com

Hey folks - just pinging all on this thread that a member from the ropensci community is sharing some research they've done on open governance structures. It'll be in this ropensci community call here:

https://ropensci.org/blog/2018/12/05/commcall-dec2018/

Thanks I will try to make this!

On Thu, Dec 6, 2018 at 8:49 AM Chris Holdgraf notifications@github.com wrote:

Hey folks - just pinging all on this thread that a member from the ropensci community is sharing some research they've done on open governance structures. It'll be in this ropensci community call here:

https://ropensci.org/blog/2018/12/05/commcall-dec2018/

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/jupyter/governance/issues/60#issuecomment-444943380, or mute the thread https://github.com/notifications/unsubscribe-auth/AABr0J5WKKYLDgXNhEdTz3YrJIgm15Q0ks5u2Uq3gaJpZM4X0bEn .

-- Brian E. Granger Associate Professor of Physics and Data Science Cal Poly State University, San Luis Obispo @ellisonbg on Twitter and GitHub bgranger@calpoly.edu and ellisonbg@gmail.com

I see that Dan is a postdoc at UC Berkeley, so we might be able to talk to him outside the call as well.

Thanks for sharing, I look forward to listening in on this.

I just noticed that Rust has their own working group on governance refactoring, might be worth looking at that process for inspiration:

https://internals.rust-lang.org/t/governance-working-group-announcement/9637?u=batmanaod

jupyter / governance