malariagen / fits

File tracking system for group DK
0 stars 0 forks source link

Human Burkina Faso samples missing oxford codes #19

Open tnguyensanger opened 6 years ago

tnguyensanger commented 6 years ago

From the solaris_human database, the samples with sequencescape_sample_name like ILB_BFFUL* have oxford codes like BB<4 digits>-<C, M, F>.

EG)

SELECT sanger_sample_code, oxford_code 
FROM solaris_human.oxford_to_sequencescape 
where sanger_sample_code like '%ILB_BFFUL%' limit 3;
sanger_sample_code oxford_code
ILB_BFFUL5541309 BB0001-C
ILB_BFFUL5541325 BB0001-F
ILB_BFFUL5541317 BB0001-M

In the fits database, the oxford_code is NULL, and one of the samples ILB_BFFUL5541309 seems to be missing:

select fits_sample_id, oxford_sample_id, sequenscape_sample_name from mm6_fits.vw_pivot_sample where sequenscape_sample_name in 
('ILB_BFFUL5541309', 'ILB_BFFUL5541325', 'ILB_BFFUL5541317');
fits_sample_id oxford_sample_id sequenscape_sample_name
20389 ILB_BFFUL5541317
20403 ILB_BFFUL5541325
magnusmanske commented 6 years ago

The first query gives me 93 results. I added the Oxford codes to FITS via the sequencescape_sample_name (tag_id 3586), but only found 74 matches; I assume the other samples are not in there.

Note that the values are only NULL in the pivot table; in the sample2tag table, they are just missing.