Princeton-LSI-ResearchComputing / tracebase

Mouse Metabolite Tracing Data Repository for the Rabinowitz Lab
MIT License
4 stars 1 forks source link

Do not allow duplicate animal records that differ by only the name #688

Open lparsons opened 1 year ago

lparsons commented 1 year ago

FEATURE REQUEST

Inspiration

Animal records (mice) are defined by the metadata associated with them. If two animals are identical in every way except the name, they are very likely errant duplicates.

Description

When loading animal records, if an incoming animal matches an existing animal exactly but has a different name, the loader should raise an exception and fail to load the animal.

Alternatives

Dependencies

Comment

If, at some point in the future, we discover a need to handle replicate animals, a "replicate field" (perhaps "mouse number") should be added to the animal model to allow researchers to explicitly define replicates.


ISSUE OWNER SECTION

Assumptions

Requirements

Limitations

Affected Components

A tentative list of anticipated repository items that will be changed, labeled with "add", "delete", or "change". One item per line. (Mostly, this will be a list of files.)

DESIGN

Interface Change description

Describe changes to usage. E.g. GUI/command-line changes

Code Change Description

Describe code changes planned for the feature. (Pseudocode encouraged)

Tests

mneinast commented 2 months ago

Actually I think there could be instances where different animals have the same metadata. Here is each column in the Animals sheet of the Study Doc:

I'm imagining that users will frequently submit studies where 1-2 groups of animals were given the same infusate. For example, in a study where 13C-valine was infused comparing two different Genotypes...within a single Genotype, the only unique metadata would be Weight (Infusion Rate is often in proportion to Weight). We will likely encounter different mice with the same Weight.

Right now, does the Animal Name field accomplishe the same thing that a Replicate field would?

lparsons commented 2 months ago

Right now, does the Animal Name field accomplishe the same thing that a Replicate field would?

Yes, this issue was described last year which was before the work on an animal and sample database got moving. The main concern was that if people weren't careful in their naming we could end up with two animal records for the same animal. However, as we have moved forward, I'm less concerned about that possibility. I would suggest that this issue can be closed as a "won't fix" and we can revisit the issue in the animal and sample tracking project. Do you agree @mneinast and @hepcat72?

hepcat72 commented 2 months ago

That sounds reasonable to me @lparsons. However, let me offer an alternative. We have no way of tracking potential duplicate animals or assessing the extent of it in existing data. Instead of a loading-prohibitive error, we could just make it a warning that only the user sees in the validation interface. The curator wouldn't have to bother with it during a load.

In any case, it would be a low priority issue, not necessary for any of the existing milestones.

mneinast commented 2 months ago

Agree with lance. I think Rob's proposed alternative is nice but not necessary