NIAID-Data-Ecosystem / nde-portal

Discovery platform to find NIAID-related datasets and tools
Apache License 2.0
4 stars 1 forks source link

Double check counts for the visual summary bar graphs #75

Closed flaneuse closed 2 years ago

flaneuse commented 2 years ago

For the datasets in the DDE, I get different answers than what's in the bargraphs. @candicecz can you check the calculations?

From a quick JavaScript calc, I get:

[{"technique":"transcriptomics","count":98},{"technique":"metabolomics","count":63},{"technique":"null","count":49},{"technique":"genomics","count":38},{"technique":"microbiome sequencing","count":36},{"technique":"proteomics","count":32},{"technique":"mRNA (microarray)","count":32},{"technique":"Bacteriological Assay","count":25},{"technique":"microRNA (microarray)","count":20},{"technique":"lipidomics","count":19},{"technique":"wgs","count":16},{"technique":".mzXML\\n.xlsx","count":13},{"technique":"RNA sequencing","count":13},{"technique":"RNA-Seq","count":13},{"technique":"transposon sequencing","count":10},{"technique":"immunological assay","count":9},{"technique":"DNAseq","count":8},{"technique":"Illumina PCR amplifed TnSeq ends","count":8},{"technique":"Illumina WGS","count":7},{"technique":"Mass Spectrometry","count":6},{"technique":"Microarray","count":5},{"technique":"Protein abundance","count":5},{"technique":"Protein phosphorylation","count":4},{"technique":"Illuminia gDNA sequence","count":4},{"technique":"Whole genome sequence","count":4},{"technique":"Proteomics experiment","count":4},{"technique":"Flow Cytometry","count":4},{"technique":"Nucleic Acid Sequencing","count":3},{"technique":"fastq","count":3},{"technique":"Functional genomics","count":3},{"technique":"Protein interaction experiment","count":3},{"technique":"metagenomics","count":3},{"technique":"temporal RNA-Seq","count":3},{"technique":"ChIP-seq","count":3},{"technique":"mRNA-seq","count":2},{"technique":"RNA-seq","count":2},{"technique":"Whole genome sequencing","count":2},{"technique":"serology","count":2},{"technique":"Post Translational Modification Analysis","count":2},{"technique":"Whole Genome Sequencing","count":2},{"technique":"Reverse Transcriptase-Polymerase Chain Reaction","count":2},{"technique":"Protein ubiquitination","count":2},{"technique":"Raw sequencing data for CRISPR KO screens","count":2},{"technique":"RNA Isolation","count":2},{"technique":"Metagenomics","count":2},{"technique":"Whole genome Sequencing","count":2},{"technique":"amplicon-sequencing (TRIP)","count":1},{"technique":"Lipidomics","count":1},{"technique":"miRNA-seq","count":1},{"technique":"FASTQ","count":1},{"technique":"Vital Signs Measurement","count":1},{"technique":"Gene Knockout","count":1},{"technique":"CRISPR","count":1},{"technique":"Pathology","count":1},{"technique":"pathology","count":1},{"technique":"Cell Sorting","count":1},{"technique":"Tn-Seq","count":1},{"technique":"expression profiling by array","count":1},{"technique":" Affinity Purification Mass Spectrometry (AP-MS)","count":1},{"technique":"Illumina metagenomic","count":1},{"technique":"Cytometry","count":1},{"technique":"MeDIP-seq","count":1},{"technique":"RNASeq","count":1},{"technique":"Clinical","count":1},{"technique":"animal data","count":1},{"technique":"Blood Chemistry Measurement","count":1},{"technique":"Microarray experiment","count":1},{"technique":"Microarray Analysis","count":1},{"technique":"Hi-C","count":1},{"technique":"IMMUNOLOGICAL ASSAY","count":1},{"technique":"Affinity Purification Mass Spectrometry (AP-MS)","count":1},{"technique":"GENOMICS","count":1},{"technique":"WGS","count":1},{"technique":"DNA Sequencing","count":1},{"technique":"Quantitative Reverse Transcriptase PCR","count":1},{"technique":"Illumina 16S","count":1},{"technique":"Fluorescence Activated Cell Sorting","count":1},{"technique":"Illumina transcriptomic","count":1},{"technique":"Microbiome analysis","count":1},{"technique":"metatranscriptome 16S","count":1},{"technique":"hemagglutination inhibition","count":1},{"technique":"HLA typing via sequencing","count":1},{"technique":"Whole Transcriptome Sequencing","count":1},{"technique":"flow cytometry","count":1},{"technique":"raw sequencing data for crispr ko screens","count":1},{"technique":"RNA Analysis","count":1},{"technique":"Rapid Antigen Test","count":1},{"technique":"metatranscriptomics","count":1},{"technique":"Blood Cell Count","count":1},{"technique":"microRtranscriptomics","count":1},{"technique":"Reduced-Representation Bisulfite Sequencing","count":1},{"technique":"Immunology","count":1},{"technique":"Proteomics","count":1},{"technique":"HLA sequencing","count":1},{"technique":"Protein Analysis","count":1},{"technique":"Protein Expression Analysis","count":1},{"technique":"Illumina meta-transcriptomic","count":1},{"technique":"Amplicon Sequencing (TRIP)","count":1}]

... which seems to mostly match R code to pull the data: image

flaneuse commented 2 years ago

Note: quick check of pathogens look like they agree...