HumanCellAtlas / metadata-schema

This repo is for the metadata schemas associated with the HCA
Apache License 2.0
65 stars 32 forks source link

Rename "donor" to more generic (e.g. "organism") #83

Closed malloryfreeberg closed 6 years ago

malloryfreeberg commented 6 years ago

Based on user feedback, some projects use non-human subjects (e.g. mouse, dog), in which case the concept of a "donor" does not apply. Some users were very hesitant to put non-human subjects in the "donor" tab.

Suggest renaming "donor" to something more generic. For example, "organism". Other suggestions?

lauraclarke commented 6 years ago

Is there a way to have differential requirements across different species?

malloryfreeberg commented 6 years ago

Re-opening this issue as there is now more discussion going on about the use of donor versus organism. Below are quotes from pilot users taken during user experience evaluation sessions and notes from the Boston metadata meeting specifically centered on metadata terms which led to the decision to replace donor with organism as the entity name.

  1. Quote from pilot user who has both human and mouse data: "Donor is for the human samples, right? ... [A mouse is] an organism, but not a donor. At least not a voluntary donor."

  2. Scribe notes from Boston metadata meeting: "The HCA considered Donor to be the whole organism. Most data contributors put their "specimen" under "donor" and not "specimen from organism". Why? This is because the term "donor" was confusing."

  3. Scribe notes from Boston metadata meeting: Data generators thought "Immune cells, brain tissue" were "donor" type samples. They should be "specimen from organism" types.

Another suggested term instead of donor was "individual", but "organism" was chosen instead because even the term "individual" has some suggestion of person-hood.

malloryfreeberg commented 6 years ago

Also copying a relevant, recent Slack conversation here between myself and @pnejad:

"Could you explain the reasoning behind not using the term donor?"

"So, given that we are expecting to receive non-human samples, we got some feedback from users that the term "donor" implies a human giving consent, which might not always be the case. So, we switched to the more neutral and general "organism" which removes a sense of the individual giving consent (which non-humans can not give) and also makes clear that this schema is for the "whole individual"."

"In regards to donor and consent, wouldn't the non-consent issue also apply to samples from embryos, fetuses and children? Other species can't consent to their samples being donated to research, but likewise some human samples also don’t have consent from donor."

"Yup! There are many instances where the idea of "consent" doesn’t apply, which is why we decided to drop donor in favor of organism."

malloryfreeberg commented 6 years ago

Another point to consider brought up by @claymfischer:

"donor makes more sense than organism for the people we have been talking with. If donor is really unacceptable, we would like to consider a source id field. We understand that donor doesn't feel right to some due to not giving consent, but human donors don't always have the agency to consent either (embryonic stem cells, children under 18, cell lines such as HeLa). We do feel that donors have contributed to science and we should respect that by using the term donor."

malloryfreeberg commented 6 years ago

A point brought up by @gabsie:

"I appreciate Donor might be perfect for the occasions when the data is ideal scenario - coming from Human donors. Do we think this is the primary (95%) type of data coming to the Atlas?

Considering we have spoken at least at this point to teams who are going to have data from mixed origin, or from other organisms, we need to have the option for them to recognise something else as a name, like "organism". Donor wouldn’t be suitable for their experiments, and neither would fields under Donor that have something saying "mouse strain".

As pointed out in the Metadata evaluation report - there needs to be a way to cater for different types of users to recognise their type of organism, Human or Mouse, e.g.

In my opinion, as a word being shown to users, it could say Donor/Organism or Donor, if somebody has selected Human; and Organism, maybe after a selection they have made for other species?"

JimKent commented 6 years ago

A problem with organism is that it is synonymous with species in many usages. When you say "what organism" you are not distinguishing between individuals, but between species most often.

On Wed, Feb 14, 2018 at 5:21 AM, Mallory Freeberg notifications@github.com wrote:

A point brought up by Gabby:

"I appreciate Donor might be perfect for the occasions when the data is ideal scenario - coming from Human donors. Do we think this is the primary (95%) type of data coming to the Atlas?

Considering we have spoken at least at this point to teams who are going to have data from mixed origin, or from other organisms, we need to have the option for them to recognise something else as a name, like "organism". Donor wouldn’t be suitable for their experiments, and neither would fields under Donor that have something saying "mouse strain".

As pointed out in the Metadata evaluation report - there needs to be a way to cater for different types of users to recognise their type of organism, Human or Mouse, e.g.

In my opinion, as a word being shown to users, it could say Donor/Organism or Donor, if somebody has selected Human; and Organism, maybe after a selection they have made for other species?"

— You are receiving this because you were assigned. Reply to this email directly, view it on GitHub https://github.com/HumanCellAtlas/metadata-schema/issues/83#issuecomment-365604897, or mute the thread https://github.com/notifications/unsubscribe-auth/ABo8TBsX8PhXE1tLxlronej6kEtfI3rAks5tUt28gaJpZM4QwZRx .

lauraclarke commented 6 years ago

While I take your point Jim that organism is frequently used as a synonym to species I think given the context in which is being presented here I think the risk of that sort of misinterpretation is quite low.

Thinking about alternatives

We could add whole_ in front of organism, making it clearer we mean organism in the singular rather than the plural

We could change to something like individual rather than organism or donor, this is a good singular non specific noun but I do still worry it isn't really a term that gets used for non human organisms

Anything else?

tburdett commented 6 years ago

I like @lauraclarke's earlier suggestion about the possibility of having different requirements across different species. So 'donor' for human data, 'individual' or something else for other species. Like @JimKent I don't really like 'organism' as almost every other database I know (including both EBI and NCBI biosamples databases) uses this synonymously with species/taxonomy... I'd expect this to confuse people more than donor!

gabsie commented 6 years ago

On the UI specifically, we were thinking to ask the question about species, and then based on that, have the spreadsheet say Donor, if "Homo Sapiens only" has been selected.

lauraclarke commented 6 years ago

I agree that having the UI tailor the language depending on the user is the best solution

We still need a general name for the type through which is agnostic to the specific context

We could decide that as in the majority of cases donor is correct so we use donor though I will note it wasn't just scientists with non human samples who seemed confused, the name donor was also attributed to organ donations as well as humans by the people working in a clinical setting too

malloryfreeberg commented 6 years ago

Proposal: Rename organism.json to donor_organism.json with the understanding that "donor_organism" can be translated into alternative phrases like "donor" or "mouse" or w/e in the UI, in spreadsheets, and other user-friendly interfaces. It seems there will never be a 100% consensus, so my hope is that this compromise is at least tolerable by a majority of invested parties.