GenomicsStandardsConsortium / mixs

Minimum Information about any (X) Sequence” (MIxS) specification
https://w3id.org/mixs
Creative Commons Zero v1.0 Universal
38 stars 21 forks source link

Scoping checklists and packages #510

Closed ramonawalls closed 1 year ago

ramonawalls commented 2 years ago

Discussed at the TWG meeting on Oct. 4.

Our understanding is that checklists are for different sequence types and packages are for different source materials or environments. With the advent of new sequencing types stable isotope fractioning of samples before sequencing, those distinctions may blur.

Can the CIG help us with defining some public guidance on when to add a new checklist vs. a package and how to handle gray areas?

turbomam commented 2 years ago

This was in response to @mslarae13's suggestion for a SIP stable isotope profiling checklist or package, right?

ramonawalls commented 1 year ago

Discussed at the CIG call on 24 Jan 2023.

Definitions and decisions:

ramonawalls commented 1 year ago

@lschriml @only1chunts @pbuttigieg @turbomam @mslarae13 Please review the comment above to make sure you agree. Once we have agreement, I will write definitions for checklist, extension, and package that can be added to the schema.

lschriml commented 1 year ago

A few edits below:

Definitions and decisions:

“Checklists” Defined sets of metadata terms that describe minimal information about a sequenced genome, marker gene or metagenome. “Extensions” Defined sets of environmental package “extensions” metadata terms that describe the specific environment of the collected sample, that can be combined with other Extensions and Checklists

Mix and Match combinations of checklists and extensions are available, to support broad utilization. In our LinkML schema, these are currently called the CombinationClasses. We will re-name them to “Packages”. And encode this information in LinkML Update website. On the website, but we should have the links to checklists, extensions, and packages go to the automatically generated documentation from LinkML. We need to either generate the website documentation automatically from the LinkML docs or have an SOP in place to make sure they stay in sync. [Lynn: I vote to maintain the some static webpages, that describe the scope and description of checklists and extensions]

Need to update the figure that is under extensions on the website so that it reflects our current terminology and status of packages, etc. Need to search github repos to see if there is any out of date documentation there. If so, move it to LinkML docs.

only1chunts commented 1 year ago

@lschriml , I think your definitions are a bit narrower than we want aren't they? Checklists aren't just for "sequenced genome, marker gene or metagenome" we also have microbiomes and want to include transcriptomes as well as single cell genomics, and possibly even synthetic sequences and who knows what else in the future. So can we use the more generic terminology from the meeting notes, maybe something like this: Checklists are; Defined sets of metadata terms that describe sequences from a particular experiment type or methodology. Extensions are; Defined sets of metadata terms that extend a given checklist to describe the specific environment and/or collection details of the collected sample(s). Packages are; The combination of a checklist with one or more extensions.

lschriml commented 1 year ago

yes, that sounds good Chris

On Wed, Jan 25, 2023 at 7:15 AM Chris Hunter @.***> wrote:

@lschriml https://github.com/lschriml , I think your definitions are a bit narrower than we want aren't they? Checklists aren't just for "sequenced genome, marker gene or metagenome" we also have microbiomes and want to include transcriptomes as well as single cell genomics, and possibly even synthetic sequences and who knows what else in the future. So can we use the more generic terminology from the meeting notes, maybe something like this: Checklists are; Defined sets of metadata terms that describe sequences from a particular experiment type or methodology. Extensions are; Defined sets of metadata terms that extend a given checklist to describe the specific environment and/or collection details of the collected sample(s). Packages are; The combination of a checklist with one or more extensions.

— Reply to this email directly, view it on GitHub https://github.com/GenomicsStandardsConsortium/mixs/issues/510#issuecomment-1403517517, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABBB4DOQ5DO2YC765JZ6HFDWUEKNBANCNFSM6AAAAAAQ4WAA4A . You are receiving this because you were mentioned.Message ID: @.***>

-- Lynn M. Schriml, Ph.D. Associate Professor

Institute for Genome Sciences University of Maryland School of Medicine Department of Epidemiology and Public Health 670 W. Baltimore St., HSFIII, Room 3061 Baltimore, MD 21201 P: 410-706-6776 | F: 410-706-6756 @.***

only1chunts commented 1 year ago

@ramonawalls , you've closed this ticket, does that mean the definition have been added to the SoT somewhere? if so, please can you mention here, exactly where they have been put, and confirm that the definitions provided by me above are the ones used? I can them update the gensc.org website with the definitions to match.