Closed cmungall closed 9 months ago
Current docs live (but not linked): https://linkml.io/linkml/howtos/collaborative-development
Pull requests welcome on:
@cmungall FYI we're going to rename from O3 Principles to O3 Guidelines. I can send a PR later or feel free to update
Permissively License Your Code and Data
(adapted from O3 guidelines)
Using recognizable, permissive licenses (e.g., CC0, CC BY) encourages contribution and ensures content longevity. Non-permissive licenses or custom terms can hinder reuse and engagement. Permissive licensing doesn't typically lead to a lack of credit for the original resource.
Given our recent discussion, should we point out that CC0 and CC BY are good for non-code resources (or maybe also for projects that include code and non-code resources) but that code should be licensed with a software license like Apache 2.0 or MIT?
The LinkML ecosystem attempts to encourage best practice for collaborative schema development - for example, the schema cookiecutter sets you up with standard CI workflows to encourage contributions via PRs, together with default CONTRIBUTING.md, CODE_OF_CONDUCT.md, etc.
However, we would benefit from a more direct narrative guide. the best place may be in the howto section (but FAQ entries also welcome)
There would be no need to write this de-novo. It would heavily reference and link out to other guides, in particular
Although we would heavily reference rather than duplicate, we could include some concrete examples of docs in existing schema or data modeling projects:
However, we should recognize that different kinds of projects call for different kinds of processes. A small schema designed primarily to support a single data portal does not need processes for creating working groups. A project that needs direct input from a large number of non-technical SMEs may need to insulate from technical GitHub interfaces
O3 principles
Here's a summary of each point in the provided text as bullet points (from ChatGPT):
Version Control Your Code and Data
Permissively License Your Code and Data
Make Your Data Approachable
Use Technical Workflows (Automation)
Use Social Workflows
Establish Project Governance
Attract and Engage Contributors
Obook Open Science Engineer guide
The document "Maximising impact as an open science engineer - OBO Semantic Engineering Training" outlines principles and practices for effective collaboration and impact in the field of open science engineering. Here's a summary:
Principle of Collaboration: Emphasizes the importance of social collaborative workflows in open science. It advises on effective online communication, upvoting helpful answers on platforms like Stack Overflow and GitHub, answering questions even outside one's specific project, conducting basic research before posting queries, and continuously improving open science documentation.
Principle of Upstream Fixing: Encourages fixing issues at the earliest possible stage in the dependency chain, maximizing the impact of changes and benefiting a wider community. It includes a case study highlighting the importance of quality control and community contributions in ontology development.
Principle of No-ownership: Advocates for a mindset of shared ownership and collaborative development in open science projects, particularly in the context of publicly funded work like ontologies. It suggests embracing community-driven development without specific owners or decision-makers and emphasizes the importance of proactive involvement and decision-making in a decentralized environment.
The document also includes a TL;DR summary with key takeaways:
These principles and practices are aimed at fostering a more collaborative, efficient, and impactful open science community.
TisLab guide
The document outlines several common pitfalls encountered in transdisciplinary and geographically distributed research teams. Here's a bullet list summarizing each pitfall:
Conflicts of Interest: Failing to recognize and manage conflicts of interest within the team, which can lead to biased decision-making and undermine the team's integrity and objectives.
Time Zone Challenges: Difficulties in planning and coordinating activities across different time zones, leading to scheduling conflicts and reduced team efficiency.
Miscommunication in Electronic Communication: Misunderstandings and misinterpretations that arise from reliance on electronic communication, lacking the nuances of face-to-face interaction.
Excessive Project Management Change: Frequent changes in project management strategies or personnel, leading to disruption, confusion, and a lack of continuity in the team’s approach.
Poor Version Control: Inadequate version control practices for managing documents and data, resulting in disorganization, confusion, and potential loss of important information.
Avoiding Difficult Conversations: Reluctance to address challenging issues or conflicts within the team, which can lead to unresolved problems and deteriorate team dynamics.
These pitfalls highlight the complexities and challenges of managing large, diverse, and distributed research teams, underscoring the need for effective management and communication strategies.
Best practices:
The document outlines several best practices for managing transdisciplinary and geographically distributed research teams effectively. Here's a summary of these practices:
Clear Collaboration Rules: Establishing explicit guidelines and rules for collaboration to ensure everyone understands expectations and responsibilities.
Effective Communication Strategies: Developing robust communication strategies that accommodate different time zones and leverage various communication tools to facilitate clear, frequent, and inclusive dialogue.
Onboarding and Offboarding Processes: Implementing structured processes for introducing new members to the team (onboarding) and managing the departure of members (offboarding) to maintain continuity and knowledge transfer.
Collaborative Writing Processes: Encouraging a cooperative approach to writing and document creation, which includes shared authorship and equitable contribution recognition.
Investing in Project Management: Recognizing the value of professional project management and allocating resources accordingly to enhance project coordination and efficiency.
Strong Leadership: Fostering strong, empathetic leadership that can guide the team, resolve conflicts, and motivate members towards common goals.
Addressing Bullying: Proactively addressing and preventing bullying within the team to maintain a respectful and productive working environment.
Nurturing Careers: Supporting the career development of team members, recognizing their contributions, and providing opportunities for growth and advancement.
These best practices are designed to foster a cohesive, efficient, and respectful working environment within diverse and distributed research teams, thereby enhancing productivity and team satisfaction.
Bioschemas governance
The document titled "Bioschemas Governance" outlines the governance structure and guidelines for the Bioschemas community, a project aimed at improving data interoperability in life sciences. Here's a summary:
Overview: Bioschemas adheres to five core principles of OpenStand: Respectful cooperation, adherence to standards development parameters, collective empowerment, availability, and voluntary adoption. Community members must follow the Code of Conduct based on FORCE11 guidelines.
Steering Council: Responsible for strategic and organizational planning, oversight of community activities, and promoting Bioschemas activities. The council meets every two months and communicates regularly via email and online messaging platforms.
Community and Working Groups: Day-to-day activities are conducted by the community, focusing on profile and type development and adoption. Working groups, each led by two individuals, develop markup practices for specific concepts. The Steering Council approves releases of profiles and types.
Role Holder Appointment and Removal Processes: Describes the election of Steering Council members and Working Group Leads, with a 2-year term of service. Inactive role holders may be removed following a defined process.
Specification Development and Versioning: Explains how specifications (profiles or types) are developed, including collaborative community engagement, version numbering, and authorship acknowledgment.
Profile and Type Development: Details steps for developing profiles and types, including identifying base types, property cardinality, and use cases. Processes for proposing new profiles or types and renaming or deprecating them are also covered.
Changing Governance Documents: Future changes to governance documents are to be submitted via a GitHub pull request, with public comment and Steering Council approval.
Sources: Lists references used in the document, including links to governing principles and codes of conduct from other organizations like FORCE11, Jupyter, and W3C.
This document provides a comprehensive guide to the governance structure, roles, processes, and best practices within the Bioschemas community, emphasizing open collaboration, transparency, and adherence to established standards.
schema.org how we work doc
The document titled "How We Work - Schema.org" provides an overview of the processes and practices employed by Schema.org in developing and updating its schemas. Here's a summary:
Overview and Process:
Versioning and Change Control:
Schema Structure:
Extensibility Mechanisms:
Early Access Fixes and Pending Releases:
Workflow FAQ:
Related Links and Further Reading:
This document serves as a comprehensive guide to the operational framework, versioning system, and collaborative nature of Schema.org's efforts in structuring and standardizing web data.