TheJacksonLaboratory / PDXintegrator

Code for mapping PDX Net data to a common data model
http://pdxintegrator.readthedocs.io/en/latest/
MIT License
1 stars 1 forks source link

Universally unique identifier (UUID) management #10

Open pnrobinson opened 6 years ago

pnrobinson commented 6 years ago

We need to decide upon a strategy for the various (patient, PDX model) identifiers to be used in PDX Net. Some of the issues include

chuanglab commented 6 years ago

Carol and Steve have more direct experience with this, but here are my comments:

sethi7ik commented 6 years ago

Agree with Jeff here (samples need UUIDs in addition to patients). The standard method for generating UUIDs are 32 character alphanumeric ids. An example of how to generate these UUIDs could be here: http://www.javapractices.com/topic/TopicAction.do?Id=56

It is trickier to figure out how to connect these IDs with IDs outside the PDXNet as the samples could be distributed from any center or from the PDMR. Also, differences in data usage consent on different studies can make it impractical for the same patient to have the same UUID on different studies.

However, I think that for PDXNet, we need to generate the UUID and give it to all the PDTCs.

pnrobinson commented 6 years ago

@chuanglab @sethi7ik who should be responsible for the topic of uuid's? I am not currently doing anything about this but I am concerned that nobody else is. Can we develop an SOP that we can present to the group at one of the next calls?

chuanglab commented 6 years ago

Carol developed a system such that the numbers are long enough to be unique but don't provide much biological info in themselves (e.g. a model is TM00099), and they had also been working on tracking info about related samples from the same patient. I believe Steve has worked on this with Carol.