DataBiosphere / azul

Metadata indexer and query service used for AnVIL, HCA, LungMAP, and CGP
Apache License 2.0
7 stars 2 forks source link

Many unit tests fail if `AZUL_DSS_QUERY_PREFIX` is set #3321

Closed jessebrennan closed 3 years ago

jessebrennan commented 3 years ago

I started a test run with make test but the log was greater than my terminal history. Here is the end of it, along with a few other commands showing the branch/commit and variable set.

----------------------------------------------------------------------
Ran 342 tests in 1001.689s

FAILED (failures=123, errors=40, skipped=12)
make: *** [Makefile:159: test] Error 1
(.venv) jesse@vader ~/gi/azul.2 (default)$ git show HEAD
commit b8e4a508ab3344b4090eac6e70891a65a6d6afc7 (HEAD -> default, github/develop, github/HEAD, dev.dcp2.gitlab/develop)
Merge: b270c3d5 a252a29d
Author: Daniel Sotirhos <dsotirho@ucsc.edu>
Date:   Thu Aug 12 16:40:37 2021 -0700

    Can new prod bundles for matrix test cases (#3192, PR #3261)

(.venv) jesse@vader ~/gi/azul.2 (default)$ env | grep DSS_QUERY_PREFIX
AZUL_DSS_QUERY_PREFIX=42
azul_env_vars=AZUL_CATALOGS,AZUL_DEBUG,AZUL_DSS_DIRECT_ACCESS,AZUL_DOMAIN_NAME,AZUL_DRS_DOMAIN_NAME,AZUL_SUBDOMAIN_TEMPLATE,AZUL_RESOURCE_PREFIX,AZUL_ES_DOMAIN,AZUL_SHARE_ES_DOMAIN,AZUL_INDEX_PREFIX,AZUL_ES_INSTANCE_TYPE,AZUL_ES_VOLUME_SIZE,AZUL_ES_TIMEOUT,AZUL_VERSIONED_BUCKET,AZUL_DSS_WORKERS,AZUL_TDR_WORKERS,AZUL_SUBSCRIBE_TO_DSS,AZUL_DEPLOYMENT_INCARNATION,AZUL_GOOGLE_SERVICE_ACCOUNT,AZUL_GOOGLE_SERVICE_ACCOUNT_PUBLIC,AZUL_CONTRIBUTION_CONCURRENCY,AZUL_AGGREGATION_CONCURRENCY,AZUL_S3_BUCKET,AZUL_URL_REDIRECT_BASE_DOMAIN_NAME,AZUL_URL_REDIRECT_FULL_DOMAIN_NAME,AZUL_ENABLE_MONITORING,AZUL_DSS_QUERY_PREFIX,azul_terraform_component,azul_github_project,azul_github_access_token,PYTHONPATH,MYPYPATH,TF_DATA_DIR,XDG_CONFIG_HOME,AZUL_DEPLOYMENT_STAGE,AZUL_TDR_SOURCES,AZUL_TDR_DCP2EBI_SOURCES,AZUL_TDR_IT2EBI_SOURCES,AZUL_TDR_LUNGMAP_SOURCES,AZUL_TDR_IT3LUNGMAP_SOURCES,AZUL_TDR_SERVICE_URL,AZUL_SAM_SERVICE_URL,AZUL_ES_INSTANCE_COUNT,AZUL_OWNER,AZUL_AWS_ACCOUNT_ID,AWS_DEFAULT_REGION,GOOGLE_PROJECT,AZUL_GOOGLE_OAUTH2_CLIENT_ID,AWS_PROFILE,GOOGLE_APPLICATION_CREDENTIALS,project_root

For the record, I was in the process of running test when Hannes made his comment.

hannes-ucsc commented 3 years ago

@jessebrennan to provide evidence 1) that this occurs on develop and 2) on which commit.

jessebrennan commented 3 years ago

One of the less opaque failures:

======================================================================
FAIL: test_indexing (indexer.test_hca_indexer.TestHCAIndexer)
Index a bundle and assert the index contents verbatim
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/home/jesse/gi/azul.2/test/indexer/test_hca_indexer.py", line 161, in test_indexing
    self.assertElasticsearchResultsEqual(expected_hits, hits)
  File "/home/jesse/gi/azul.2/test/es_test_case.py", line 74, in assertElasticsearchResultsEqual
    self.assertEqual(sort_frozen(freeze(first)), sort_frozen(freeze(second)))
AssertionError: Tuples differ: ((('_[4395 chars]test:')),)), ('total_estimated_cells', 1))), ([95851 chars]c'))) != ((('_[4395 chars]test:42')),)), ('total_estimated_cells', 1))),[95875 chars]c')))

First differing element 0:
(('_i[4394 chars]test:')),)), ('total_estimated_cells', 1))), ('_type', 'doc'))
(('_i[4394 chars]test:42')),)), ('total_estimated_cells', 1))),[13 chars]oc'))

  ((('_id', '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
    ('_index', 'azul_v2_dev_test_files_aggregate'),
    ('_score', 1.0),
    ('_source',
     (('bundles',
       ((('uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
         ('version', '2018-11-02T113344.698028Z')),)),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', ('GSM2172585 1',)),
           ('document_id', ('412898c5-5b9b-4907-b07c-e9b89666e204',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('donors',
         ((('biological_sex', ('female',)),
           ('biomaterial_id', ('DID_scRSq06',)),
           ('development_stage', ('~null',)),
           ('diseases', ('normal',)),
           ('document_id', ('7b07b9d0-cc0e-4098-9f64-f4a569f7d746',)),
           ('donor_count', 1),
           ('donor_count_', 1),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', ('38 year',)),
           ('organism_age_range',
            ((('gte', 1198368000.0), ('lte', 1198368000.0)),)),
           ('organism_age_unit', ('year',)),
           ('organism_age_value', ('38',))),)),
        ('files',
         ((('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '1d998e49'),
           ('document_id', '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
           ('drs_path',
            '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb?version=2018-11-02T113344.698028Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_1.fastq.gz'),
           ('read_index', 'read1'),
           ('related_files', ()),
           ('sha256',
            '77337cb51b2e584b5ae1b99db6c163b988cbc5b894dda2f5d22424978c3bfc7a'),
           ('size', 195142097),
           ('size_', 195142097),
           ('source', '~null'),
           ('uuid', '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb'),
           ('version', '2018-11-02T113344.698028Z')),)),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('library_construction_approach', ('Smart-seq2',)),
           ('nucleic_acid_source', ('single cell',))),)),
        ('organoids', ()),
        ('projects',
         ((('_type', ('project',)),
           ('array_express_accessions', ('~null',)),
           ('document_id', ('e8642221-4c2c-4fd7-b926-a68bce363c88',)),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_short_name', ('Single of human pancreas',)),
           ('project_title', ('Single cell transcriptome patterns.',)),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', ('specimen_from_organism',)),
           ('_type', ('specimen',)),
           ('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('disease', ('normal',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('has_input_biomaterial', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', ('~null',)),
           ('storage_method', ('~null',))),)),
        ('samples',
         ((('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('effective_organ', ('pancreas',)),
           ('entity_type', ('specimens',)),
           ('model_organ', ('~null',)),
           ('model_organ_part', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', ('GSM2172585 1',)),
           ('document_id', ('412898c5-5b9b-4907-b07c-e9b89666e204',)),
           ('sequencing_input_type', ('cell_suspension',))),)),
        ('sequencing_processes',
         ((('document_id', ('771ddaf6-3a4f-4314-97fe-6294ff8e25a4',)),),)),
        ('sequencing_protocols',
         ((('instrument_manufacturer_model', ('Illumina NextSeq 500',)),
           ('paired_end', (1,))),)),
        ('specimens',
         ((('_source', ('specimen_from_organism',)),
           ('_type', ('specimen',)),
           ('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('disease', ('normal',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('has_input_biomaterial', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', ('~null',)),
           ('storage_method', ('~null',))),)))),
      ('entity_id', '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
      ('num_contributions', 1),
      ('sources',
-      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:')),)),
+      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42')),)),
?                                                                       ++

      ('total_estimated_cells', 1))),
    ('_type', 'doc')),
   (('_id',
     '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb_aaa96233-bf27-44c7-82df-b4dc15ad4d9d_2018-11-02T113344.698028Z_exists'),
    ('_index', 'azul_v2_dev_test_files'),
    ('_score', 1.0),
    ('_source',
     (('bundle_deleted', False),
      ('bundle_uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
      ('bundle_version', '2018-11-02T113344.698028Z'),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('donors',
         ((('biological_sex', 'female'),
           ('biomaterial_id', 'DID_scRSq06'),
           ('development_stage', '~null'),
           ('diseases', ('normal',)),
           ('document_id', '7b07b9d0-cc0e-4098-9f64-f4a569f7d746'),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', '38 year'),
           ('organism_age_range', (('gte', 1198368000.0), ('lte', 1198368000.0))),
           ('organism_age_unit', 'year'),
           ('organism_age_value', '38')),)),
        ('files',
         ((('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '1d998e49'),
           ('document_id', '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
           ('drs_path',
            '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb?version=2018-11-02T113344.698028Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_1.fastq.gz'),
           ('read_index', 'read1'),
           ('related_files', ()),
           ('sha256',
            '77337cb51b2e584b5ae1b99db6c163b988cbc5b894dda2f5d22424978c3bfc7a'),
           ('size', 195142097),
           ('size_', 195142097),
           ('source', '~null'),
           ('uuid', '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb'),
           ('version', '2018-11-02T113344.698028Z')),)),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('document_id', '9c32cf70-3ed7-4720-badc-5ee71e8a38af'),
           ('library_construction_approach', 'Smart-seq2'),
           ('nucleic_acid_source', 'single cell')),)),
        ('organoids', ()),
        ('projects',
         ((('_type', 'project'),
           ('array_express_accessions', ('~null',)),
           ('contact_names', ('Laura,,Huerta', 'Martin, Enge', 'Matthew,,Green')),
           ('contributors',
            ((('contact_name', 'Laura,,Huerta'),
              ('corresponding_contributor', 0),
              ('email', 'lauhuema@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'external curator')),
             (('contact_name', 'Martin, Enge'),
              ('corresponding_contributor', 9223372036854774784),
              ('email', 'martin.enge@gmail.com'),
              ('institution', 'University'),
              ('laboratory', '~null'),
              ('project_role', '~null')),
             (('contact_name', 'Matthew,,Green'),
              ('corresponding_contributor', 0),
              ('email', 'hewgreen@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'Human Cell Atlas wrangler')))),
           ('document_id', 'e8642221-4c2c-4fd7-b926-a68bce363c88'),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_description',
            'As organisms age, cells accumulate genetic and epigenetic changes '
            'that eventually lead to impaired organ function or catastrophic '
            'failure such as cancer. Here we describe a single-cell '
            'transcriptome analysis of 2544 human pancreas cells from donors, '
            'spanning six decades of life. We find that islet cells from older '
            'donors have increased levels of disorder as measured both by noise '
            'in the transcriptome and by the number of cells which display '
            'inappropriate hormone expression, revealing a transcriptional '
            'instability associated with aging. By analyzing the spectrum of '
            'somatic mutations in single cells from previously-healthy donors, '
            'we find a specific age-dependent mutational signature characterized '
            'by C to A and C to G transversions, indicators of oxidative stress, '
            'which is absent in single cells from human brain tissue or in a '
            'tumor cell line. Cells carrying a high load of such mutations also '
            'express higher levels of stress and senescence markers, including '
            'FOS, JUN, and the cytoplasmic superoxide dismutase SOD1, markers '
            'previously linked to pancreatic diseases with substantial '
            'age-dependent risk, such as type 2 diabetes mellitus and '
            'adenocarcinoma. Thus, our single-cell approach unveils gene '
            'expression changes and somatic mutations acquired in aging human '
            'tissue, and identifies molecular pathways induced by these genetic '
            'changes that could influence human disease. Also, our results '
            'demonstrate the feasibility of using single-cell RNA-seq data from '
            'primary cells to derive meaningful insights into the genetic '
            'processes that operate on aging human tissue and to determine which '
            'molecular mechanisms are coordinated with these processes. '
            'Examination of single cells from primary human pancreas tissue'),
           ('project_short_name', 'Single of human pancreas'),
           ('project_title', 'Single cell transcriptome patterns.'),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('publications',
            ((('publication_title',
               'Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
               'Signatures of Aging and Somatic Mutation Patterns.'),
              ('publication_url',
               'https://www.ncbi.nlm.nih.gov/pubmed/28965763')),)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)),
        ('samples',
         ((('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('effective_organ', 'pancreas'),
           ('entity_type', 'specimens'),
           ('model_organ', '~null'),
           ('model_organ_part', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('sequencing_input_type', 'cell_suspension')),)),
        ('sequencing_processes',
         ((('document_id', '771ddaf6-3a4f-4314-97fe-6294ff8e25a4'),),)),
        ('sequencing_protocols',
         ((('document_id', '61e629ed-0135-4492-ac8a-5c4ab3ccca8a'),
           ('instrument_manufacturer_model', 'Illumina NextSeq 500'),
           ('paired_end', 1)),)),
        ('specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)))),
      ('entity_id', '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
      ('source',
-      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:'))))),
+      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42'))))),
?                                                                      ++

    ('_type', 'doc')),
   (('_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
    ('_index', 'azul_v2_dev_test_cell_suspensions_aggregate'),
    ('_score', 1.0),
    ('_source',
     (('bundles',
       ((('uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
         ('version', '2018-11-02T113344.698028Z')),)),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('donors',
         ((('biological_sex', ('female',)),
           ('biomaterial_id', ('DID_scRSq06',)),
           ('development_stage', ('~null',)),
           ('diseases', ('normal',)),
           ('document_id', ('7b07b9d0-cc0e-4098-9f64-f4a569f7d746',)),
           ('donor_count', 1),
           ('donor_count_', 1),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', ('38 year',)),
           ('organism_age_range',
            ((('gte', 1198368000.0), ('lte', 1198368000.0)),)),
           ('organism_age_unit', ('year',)),
           ('organism_age_value', ('38',))),)),
        ('files',
         ((('content_description', ('~null',)),
           ('count', 2),
           ('file_format', 'fastq.gz'),
           ('is_intermediate', 9223372036854774784),
           ('matrix_cell_count', 9223372036854774784),
           ('matrix_cell_count_', None),
           ('size', 385472253),
           ('size_', 385472253),
           ('source', ('~null',))),)),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('library_construction_approach', ('Smart-seq2',)),
           ('nucleic_acid_source', ('single cell',))),)),
        ('organoids', ()),
        ('projects',
         ((('_type', ('project',)),
           ('array_express_accessions', ('~null',)),
           ('document_id', ('e8642221-4c2c-4fd7-b926-a68bce363c88',)),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_short_name', ('Single of human pancreas',)),
           ('project_title', ('Single cell transcriptome patterns.',)),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', ('specimen_from_organism',)),
           ('_type', ('specimen',)),
           ('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('disease', ('normal',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('has_input_biomaterial', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', ('~null',)),
           ('storage_method', ('~null',))),)),
        ('samples',
         ((('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('effective_organ', ('pancreas',)),
           ('entity_type', ('specimens',)),
           ('model_organ', ('~null',)),
           ('model_organ_part', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', ('GSM2172585 1',)),
           ('document_id', ('412898c5-5b9b-4907-b07c-e9b89666e204',)),
           ('sequencing_input_type', ('cell_suspension',))),)),
        ('sequencing_processes',
         ((('document_id', ('771ddaf6-3a4f-4314-97fe-6294ff8e25a4',)),),)),
        ('sequencing_protocols',
         ((('instrument_manufacturer_model', ('Illumina NextSeq 500',)),
           ('paired_end', (1,))),)),
        ('specimens',
         ((('_source', ('specimen_from_organism',)),
           ('_type', ('specimen',)),
           ('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('disease', ('normal',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('has_input_biomaterial', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', ('~null',)),
           ('storage_method', ('~null',))),)))),
      ('entity_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
      ('num_contributions', 1),
      ('sources',
-      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:')),)),
+      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42')),)),
?                                                                       ++

      ('total_estimated_cells', 1))),
    ('_type', 'doc')),
   (('_id',
     '412898c5-5b9b-4907-b07c-e9b89666e204_aaa96233-bf27-44c7-82df-b4dc15ad4d9d_2018-11-02T113344.698028Z_exists'),
    ('_index', 'azul_v2_dev_test_cell_suspensions'),
    ('_score', 1.0),
    ('_source',
     (('bundle_deleted', False),
      ('bundle_uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
      ('bundle_version', '2018-11-02T113344.698028Z'),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('donors',
         ((('biological_sex', 'female'),
           ('biomaterial_id', 'DID_scRSq06'),
           ('development_stage', '~null'),
           ('diseases', ('normal',)),
           ('document_id', '7b07b9d0-cc0e-4098-9f64-f4a569f7d746'),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', '38 year'),
           ('organism_age_range', (('gte', 1198368000.0), ('lte', 1198368000.0))),
           ('organism_age_unit', 'year'),
           ('organism_age_value', '38')),)),
        ('files',
         ((('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '1d998e49'),
           ('document_id', '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
           ('drs_path',
            '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb?version=2018-11-02T113344.698028Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_1.fastq.gz'),
           ('read_index', 'read1'),
           ('related_files', ()),
           ('sha256',
            '77337cb51b2e584b5ae1b99db6c163b988cbc5b894dda2f5d22424978c3bfc7a'),
           ('size', 195142097),
           ('size_', 195142097),
           ('source', '~null'),
           ('uuid', '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb'),
           ('version', '2018-11-02T113344.698028Z')),
          (('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '54bb9c82'),
           ('document_id', '70d1af4a-82c8-478a-8960-e9028b3616ca'),
           ('drs_path',
            '74897eb7-0701-4e4f-9e6b-8b9521b2816b?version=2018-11-02T113344.450442Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_2.fastq.gz'),
           ('read_index', 'read2'),
           ('related_files', ()),
           ('sha256',
            '465a230aa127376fa641f8b8f8cad3f08fef37c8aafc67be454f0f0e4e63d68d'),
           ('size', 190330156),
           ('size_', 190330156),
           ('source', '~null'),
           ('uuid', '74897eb7-0701-4e4f-9e6b-8b9521b2816b'),
           ('version', '2018-11-02T113344.450442Z')))),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('document_id', '9c32cf70-3ed7-4720-badc-5ee71e8a38af'),
           ('library_construction_approach', 'Smart-seq2'),
           ('nucleic_acid_source', 'single cell')),)),
        ('organoids', ()),
        ('projects',
         ((('_type', 'project'),
           ('array_express_accessions', ('~null',)),
           ('contact_names', ('Laura,,Huerta', 'Martin, Enge', 'Matthew,,Green')),
           ('contributors',
            ((('contact_name', 'Laura,,Huerta'),
              ('corresponding_contributor', 0),
              ('email', 'lauhuema@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'external curator')),
             (('contact_name', 'Martin, Enge'),
              ('corresponding_contributor', 9223372036854774784),
              ('email', 'martin.enge@gmail.com'),
              ('institution', 'University'),
              ('laboratory', '~null'),
              ('project_role', '~null')),
             (('contact_name', 'Matthew,,Green'),
              ('corresponding_contributor', 0),
              ('email', 'hewgreen@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'Human Cell Atlas wrangler')))),
           ('document_id', 'e8642221-4c2c-4fd7-b926-a68bce363c88'),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_description',
            'As organisms age, cells accumulate genetic and epigenetic changes '
            'that eventually lead to impaired organ function or catastrophic '
            'failure such as cancer. Here we describe a single-cell '
            'transcriptome analysis of 2544 human pancreas cells from donors, '
            'spanning six decades of life. We find that islet cells from older '
            'donors have increased levels of disorder as measured both by noise '
            'in the transcriptome and by the number of cells which display '
            'inappropriate hormone expression, revealing a transcriptional '
            'instability associated with aging. By analyzing the spectrum of '
            'somatic mutations in single cells from previously-healthy donors, '
            'we find a specific age-dependent mutational signature characterized '
            'by C to A and C to G transversions, indicators of oxidative stress, '
            'which is absent in single cells from human brain tissue or in a '
            'tumor cell line. Cells carrying a high load of such mutations also '
            'express higher levels of stress and senescence markers, including '
            'FOS, JUN, and the cytoplasmic superoxide dismutase SOD1, markers '
            'previously linked to pancreatic diseases with substantial '
            'age-dependent risk, such as type 2 diabetes mellitus and '
            'adenocarcinoma. Thus, our single-cell approach unveils gene '
            'expression changes and somatic mutations acquired in aging human '
            'tissue, and identifies molecular pathways induced by these genetic '
            'changes that could influence human disease. Also, our results '
            'demonstrate the feasibility of using single-cell RNA-seq data from '
            'primary cells to derive meaningful insights into the genetic '
            'processes that operate on aging human tissue and to determine which '
            'molecular mechanisms are coordinated with these processes. '
            'Examination of single cells from primary human pancreas tissue'),
           ('project_short_name', 'Single of human pancreas'),
           ('project_title', 'Single cell transcriptome patterns.'),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('publications',
            ((('publication_title',
               'Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
               'Signatures of Aging and Somatic Mutation Patterns.'),
              ('publication_url',
               'https://www.ncbi.nlm.nih.gov/pubmed/28965763')),)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)),
        ('samples',
         ((('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('effective_organ', 'pancreas'),
           ('entity_type', 'specimens'),
           ('model_organ', '~null'),
           ('model_organ_part', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('sequencing_input_type', 'cell_suspension')),)),
        ('sequencing_processes',
         ((('document_id', '771ddaf6-3a4f-4314-97fe-6294ff8e25a4'),),)),
        ('sequencing_protocols',
         ((('document_id', '61e629ed-0135-4492-ac8a-5c4ab3ccca8a'),
           ('instrument_manufacturer_model', 'Illumina NextSeq 500'),
           ('paired_end', 1)),)),
        ('specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)))),
      ('entity_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
      ('source',
-      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:'))))),
+      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42'))))),
?                                                                      ++

    ('_type', 'doc')),
   (('_id', '70d1af4a-82c8-478a-8960-e9028b3616ca'),
    ('_index', 'azul_v2_dev_test_files_aggregate'),
    ('_score', 1.0),
    ('_source',
     (('bundles',
       ((('uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
         ('version', '2018-11-02T113344.698028Z')),)),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', ('GSM2172585 1',)),
           ('document_id', ('412898c5-5b9b-4907-b07c-e9b89666e204',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('donors',
         ((('biological_sex', ('female',)),
           ('biomaterial_id', ('DID_scRSq06',)),
           ('development_stage', ('~null',)),
           ('diseases', ('normal',)),
           ('document_id', ('7b07b9d0-cc0e-4098-9f64-f4a569f7d746',)),
           ('donor_count', 1),
           ('donor_count_', 1),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', ('38 year',)),
           ('organism_age_range',
            ((('gte', 1198368000.0), ('lte', 1198368000.0)),)),
           ('organism_age_unit', ('year',)),
           ('organism_age_value', ('38',))),)),
        ('files',
         ((('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '54bb9c82'),
           ('document_id', '70d1af4a-82c8-478a-8960-e9028b3616ca'),
           ('drs_path',
            '74897eb7-0701-4e4f-9e6b-8b9521b2816b?version=2018-11-02T113344.450442Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_2.fastq.gz'),
           ('read_index', 'read2'),
           ('related_files', ()),
           ('sha256',
            '465a230aa127376fa641f8b8f8cad3f08fef37c8aafc67be454f0f0e4e63d68d'),
           ('size', 190330156),
           ('size_', 190330156),
           ('source', '~null'),
           ('uuid', '74897eb7-0701-4e4f-9e6b-8b9521b2816b'),
           ('version', '2018-11-02T113344.450442Z')),)),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('library_construction_approach', ('Smart-seq2',)),
           ('nucleic_acid_source', ('single cell',))),)),
        ('organoids', ()),
        ('projects',
         ((('_type', ('project',)),
           ('array_express_accessions', ('~null',)),
           ('document_id', ('e8642221-4c2c-4fd7-b926-a68bce363c88',)),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_short_name', ('Single of human pancreas',)),
           ('project_title', ('Single cell transcriptome patterns.',)),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', ('specimen_from_organism',)),
           ('_type', ('specimen',)),
           ('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('disease', ('normal',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('has_input_biomaterial', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', ('~null',)),
           ('storage_method', ('~null',))),)),
        ('samples',
         ((('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('effective_organ', ('pancreas',)),
           ('entity_type', ('specimens',)),
           ('model_organ', ('~null',)),
           ('model_organ_part', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', ('GSM2172585 1',)),
           ('document_id', ('412898c5-5b9b-4907-b07c-e9b89666e204',)),
           ('sequencing_input_type', ('cell_suspension',))),)),
        ('sequencing_processes',
         ((('document_id', ('771ddaf6-3a4f-4314-97fe-6294ff8e25a4',)),),)),
        ('sequencing_protocols',
         ((('instrument_manufacturer_model', ('Illumina NextSeq 500',)),
           ('paired_end', (1,))),)),
        ('specimens',
         ((('_source', ('specimen_from_organism',)),
           ('_type', ('specimen',)),
           ('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('disease', ('normal',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('has_input_biomaterial', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', ('~null',)),
           ('storage_method', ('~null',))),)))),
      ('entity_id', '70d1af4a-82c8-478a-8960-e9028b3616ca'),
      ('num_contributions', 1),
      ('sources',
-      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:')),)),
+      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42')),)),
?                                                                       ++

      ('total_estimated_cells', 1))),
    ('_type', 'doc')),
   (('_id',
     '70d1af4a-82c8-478a-8960-e9028b3616ca_aaa96233-bf27-44c7-82df-b4dc15ad4d9d_2018-11-02T113344.698028Z_exists'),
    ('_index', 'azul_v2_dev_test_files'),
    ('_score', 1.0),
    ('_source',
     (('bundle_deleted', False),
      ('bundle_uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
      ('bundle_version', '2018-11-02T113344.698028Z'),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('donors',
         ((('biological_sex', 'female'),
           ('biomaterial_id', 'DID_scRSq06'),
           ('development_stage', '~null'),
           ('diseases', ('normal',)),
           ('document_id', '7b07b9d0-cc0e-4098-9f64-f4a569f7d746'),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', '38 year'),
           ('organism_age_range', (('gte', 1198368000.0), ('lte', 1198368000.0))),
           ('organism_age_unit', 'year'),
           ('organism_age_value', '38')),)),
        ('files',
         ((('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '54bb9c82'),
           ('document_id', '70d1af4a-82c8-478a-8960-e9028b3616ca'),
           ('drs_path',
            '74897eb7-0701-4e4f-9e6b-8b9521b2816b?version=2018-11-02T113344.450442Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_2.fastq.gz'),
           ('read_index', 'read2'),
           ('related_files', ()),
           ('sha256',
            '465a230aa127376fa641f8b8f8cad3f08fef37c8aafc67be454f0f0e4e63d68d'),
           ('size', 190330156),
           ('size_', 190330156),
           ('source', '~null'),
           ('uuid', '74897eb7-0701-4e4f-9e6b-8b9521b2816b'),
           ('version', '2018-11-02T113344.450442Z')),)),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('document_id', '9c32cf70-3ed7-4720-badc-5ee71e8a38af'),
           ('library_construction_approach', 'Smart-seq2'),
           ('nucleic_acid_source', 'single cell')),)),
        ('organoids', ()),
        ('projects',
         ((('_type', 'project'),
           ('array_express_accessions', ('~null',)),
           ('contact_names', ('Laura,,Huerta', 'Martin, Enge', 'Matthew,,Green')),
           ('contributors',
            ((('contact_name', 'Laura,,Huerta'),
              ('corresponding_contributor', 0),
              ('email', 'lauhuema@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'external curator')),
             (('contact_name', 'Martin, Enge'),
              ('corresponding_contributor', 9223372036854774784),
              ('email', 'martin.enge@gmail.com'),
              ('institution', 'University'),
              ('laboratory', '~null'),
              ('project_role', '~null')),
             (('contact_name', 'Matthew,,Green'),
              ('corresponding_contributor', 0),
              ('email', 'hewgreen@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'Human Cell Atlas wrangler')))),
           ('document_id', 'e8642221-4c2c-4fd7-b926-a68bce363c88'),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_description',
            'As organisms age, cells accumulate genetic and epigenetic changes '
            'that eventually lead to impaired organ function or catastrophic '
            'failure such as cancer. Here we describe a single-cell '
            'transcriptome analysis of 2544 human pancreas cells from donors, '
            'spanning six decades of life. We find that islet cells from older '
            'donors have increased levels of disorder as measured both by noise '
            'in the transcriptome and by the number of cells which display '
            'inappropriate hormone expression, revealing a transcriptional '
            'instability associated with aging. By analyzing the spectrum of '
            'somatic mutations in single cells from previously-healthy donors, '
            'we find a specific age-dependent mutational signature characterized '
            'by C to A and C to G transversions, indicators of oxidative stress, '
            'which is absent in single cells from human brain tissue or in a '
            'tumor cell line. Cells carrying a high load of such mutations also '
            'express higher levels of stress and senescence markers, including '
            'FOS, JUN, and the cytoplasmic superoxide dismutase SOD1, markers '
            'previously linked to pancreatic diseases with substantial '
            'age-dependent risk, such as type 2 diabetes mellitus and '
            'adenocarcinoma. Thus, our single-cell approach unveils gene '
            'expression changes and somatic mutations acquired in aging human '
            'tissue, and identifies molecular pathways induced by these genetic '
            'changes that could influence human disease. Also, our results '
            'demonstrate the feasibility of using single-cell RNA-seq data from '
            'primary cells to derive meaningful insights into the genetic '
            'processes that operate on aging human tissue and to determine which '
            'molecular mechanisms are coordinated with these processes. '
            'Examination of single cells from primary human pancreas tissue'),
           ('project_short_name', 'Single of human pancreas'),
           ('project_title', 'Single cell transcriptome patterns.'),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('publications',
            ((('publication_title',
               'Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
               'Signatures of Aging and Somatic Mutation Patterns.'),
              ('publication_url',
               'https://www.ncbi.nlm.nih.gov/pubmed/28965763')),)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)),
        ('samples',
         ((('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('effective_organ', 'pancreas'),
           ('entity_type', 'specimens'),
           ('model_organ', '~null'),
           ('model_organ_part', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('sequencing_input_type', 'cell_suspension')),)),
        ('sequencing_processes',
         ((('document_id', '771ddaf6-3a4f-4314-97fe-6294ff8e25a4'),),)),
        ('sequencing_protocols',
         ((('document_id', '61e629ed-0135-4492-ac8a-5c4ab3ccca8a'),
           ('instrument_manufacturer_model', 'Illumina NextSeq 500'),
           ('paired_end', 1)),)),
        ('specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)))),
      ('entity_id', '70d1af4a-82c8-478a-8960-e9028b3616ca'),
      ('source',
-      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:'))))),
+      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42'))))),
?                                                                      ++

    ('_type', 'doc')),
   (('_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
    ('_index', 'azul_v2_dev_test_samples_aggregate'),
    ('_score', 1.0),
    ('_source',
     (('bundles',
       ((('uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
         ('version', '2018-11-02T113344.698028Z')),)),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', ('GSM2172585 1',)),
           ('document_id', ('412898c5-5b9b-4907-b07c-e9b89666e204',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('donors',
         ((('biological_sex', ('female',)),
           ('biomaterial_id', ('DID_scRSq06',)),
           ('development_stage', ('~null',)),
           ('diseases', ('normal',)),
           ('document_id', ('7b07b9d0-cc0e-4098-9f64-f4a569f7d746',)),
           ('donor_count', 1),
           ('donor_count_', 1),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', ('38 year',)),
           ('organism_age_range',
            ((('gte', 1198368000.0), ('lte', 1198368000.0)),)),
           ('organism_age_unit', ('year',)),
           ('organism_age_value', ('38',))),)),
        ('files',
         ((('content_description', ('~null',)),
           ('count', 2),
           ('file_format', 'fastq.gz'),
           ('is_intermediate', 9223372036854774784),
           ('matrix_cell_count', 9223372036854774784),
           ('matrix_cell_count_', None),
           ('size', 385472253),
           ('size_', 385472253),
           ('source', ('~null',))),)),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('library_construction_approach', ('Smart-seq2',)),
           ('nucleic_acid_source', ('single cell',))),)),
        ('organoids', ()),
        ('projects',
         ((('_type', ('project',)),
           ('array_express_accessions', ('~null',)),
           ('document_id', ('e8642221-4c2c-4fd7-b926-a68bce363c88',)),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_short_name', ('Single of human pancreas',)),
           ('project_title', ('Single cell transcriptome patterns.',)),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)),
        ('samples',
         ((('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('effective_organ', 'pancreas'),
           ('entity_type', 'specimens'),
           ('model_organ', '~null'),
           ('model_organ_part', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', ('GSM2172585 1',)),
           ('document_id', ('412898c5-5b9b-4907-b07c-e9b89666e204',)),
           ('sequencing_input_type', ('cell_suspension',))),)),
        ('sequencing_processes',
         ((('document_id', ('771ddaf6-3a4f-4314-97fe-6294ff8e25a4',)),),)),
        ('sequencing_protocols',
         ((('instrument_manufacturer_model', ('Illumina NextSeq 500',)),
           ('paired_end', (1,))),)),
        ('specimens',
         ((('_source', ('specimen_from_organism',)),
           ('_type', ('specimen',)),
           ('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('disease', ('normal',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('has_input_biomaterial', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', ('~null',)),
           ('storage_method', ('~null',))),)))),
      ('entity_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
      ('num_contributions', 1),
      ('sources',
-      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:')),)),
+      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42')),)),
?                                                                       ++

      ('total_estimated_cells', 1))),
    ('_type', 'doc')),
   (('_id',
     'a21dc760-a500-4236-bcff-da34a0e873d2_aaa96233-bf27-44c7-82df-b4dc15ad4d9d_2018-11-02T113344.698028Z_exists'),
    ('_index', 'azul_v2_dev_test_samples'),
    ('_score', 1.0),
    ('_source',
     (('bundle_deleted', False),
      ('bundle_uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
      ('bundle_version', '2018-11-02T113344.698028Z'),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('donors',
         ((('biological_sex', 'female'),
           ('biomaterial_id', 'DID_scRSq06'),
           ('development_stage', '~null'),
           ('diseases', ('normal',)),
           ('document_id', '7b07b9d0-cc0e-4098-9f64-f4a569f7d746'),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', '38 year'),
           ('organism_age_range', (('gte', 1198368000.0), ('lte', 1198368000.0))),
           ('organism_age_unit', 'year'),
           ('organism_age_value', '38')),)),
        ('files',
         ((('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '1d998e49'),
           ('document_id', '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
           ('drs_path',
            '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb?version=2018-11-02T113344.698028Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_1.fastq.gz'),
           ('read_index', 'read1'),
           ('related_files', ()),
           ('sha256',
            '77337cb51b2e584b5ae1b99db6c163b988cbc5b894dda2f5d22424978c3bfc7a'),
           ('size', 195142097),
           ('size_', 195142097),
           ('source', '~null'),
           ('uuid', '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb'),
           ('version', '2018-11-02T113344.698028Z')),
          (('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '54bb9c82'),
           ('document_id', '70d1af4a-82c8-478a-8960-e9028b3616ca'),
           ('drs_path',
            '74897eb7-0701-4e4f-9e6b-8b9521b2816b?version=2018-11-02T113344.450442Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_2.fastq.gz'),
           ('read_index', 'read2'),
           ('related_files', ()),
           ('sha256',
            '465a230aa127376fa641f8b8f8cad3f08fef37c8aafc67be454f0f0e4e63d68d'),
           ('size', 190330156),
           ('size_', 190330156),
           ('source', '~null'),
           ('uuid', '74897eb7-0701-4e4f-9e6b-8b9521b2816b'),
           ('version', '2018-11-02T113344.450442Z')))),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('document_id', '9c32cf70-3ed7-4720-badc-5ee71e8a38af'),
           ('library_construction_approach', 'Smart-seq2'),
           ('nucleic_acid_source', 'single cell')),)),
        ('organoids', ()),
        ('projects',
         ((('_type', 'project'),
           ('array_express_accessions', ('~null',)),
           ('contact_names', ('Laura,,Huerta', 'Martin, Enge', 'Matthew,,Green')),
           ('contributors',
            ((('contact_name', 'Laura,,Huerta'),
              ('corresponding_contributor', 0),
              ('email', 'lauhuema@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'external curator')),
             (('contact_name', 'Martin, Enge'),
              ('corresponding_contributor', 9223372036854774784),
              ('email', 'martin.enge@gmail.com'),
              ('institution', 'University'),
              ('laboratory', '~null'),
              ('project_role', '~null')),
             (('contact_name', 'Matthew,,Green'),
              ('corresponding_contributor', 0),
              ('email', 'hewgreen@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'Human Cell Atlas wrangler')))),
           ('document_id', 'e8642221-4c2c-4fd7-b926-a68bce363c88'),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_description',
            'As organisms age, cells accumulate genetic and epigenetic changes '
            'that eventually lead to impaired organ function or catastrophic '
            'failure such as cancer. Here we describe a single-cell '
            'transcriptome analysis of 2544 human pancreas cells from donors, '
            'spanning six decades of life. We find that islet cells from older '
            'donors have increased levels of disorder as measured both by noise '
            'in the transcriptome and by the number of cells which display '
            'inappropriate hormone expression, revealing a transcriptional '
            'instability associated with aging. By analyzing the spectrum of '
            'somatic mutations in single cells from previously-healthy donors, '
            'we find a specific age-dependent mutational signature characterized '
            'by C to A and C to G transversions, indicators of oxidative stress, '
            'which is absent in single cells from human brain tissue or in a '
            'tumor cell line. Cells carrying a high load of such mutations also '
            'express higher levels of stress and senescence markers, including '
            'FOS, JUN, and the cytoplasmic superoxide dismutase SOD1, markers '
            'previously linked to pancreatic diseases with substantial '
            'age-dependent risk, such as type 2 diabetes mellitus and '
            'adenocarcinoma. Thus, our single-cell approach unveils gene '
            'expression changes and somatic mutations acquired in aging human '
            'tissue, and identifies molecular pathways induced by these genetic '
            'changes that could influence human disease. Also, our results '
            'demonstrate the feasibility of using single-cell RNA-seq data from '
            'primary cells to derive meaningful insights into the genetic '
            'processes that operate on aging human tissue and to determine which '
            'molecular mechanisms are coordinated with these processes. '
            'Examination of single cells from primary human pancreas tissue'),
           ('project_short_name', 'Single of human pancreas'),
           ('project_title', 'Single cell transcriptome patterns.'),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('publications',
            ((('publication_title',
               'Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
               'Signatures of Aging and Somatic Mutation Patterns.'),
              ('publication_url',
               'https://www.ncbi.nlm.nih.gov/pubmed/28965763')),)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)),
        ('samples',
         ((('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('effective_organ', 'pancreas'),
           ('entity_type', 'specimens'),
           ('model_organ', '~null'),
           ('model_organ_part', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('sequencing_input_type', 'cell_suspension')),)),
        ('sequencing_processes',
         ((('document_id', '771ddaf6-3a4f-4314-97fe-6294ff8e25a4'),),)),
        ('sequencing_protocols',
         ((('document_id', '61e629ed-0135-4492-ac8a-5c4ab3ccca8a'),
           ('instrument_manufacturer_model', 'Illumina NextSeq 500'),
           ('paired_end', 1)),)),
        ('specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)))),
      ('entity_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
      ('source',
-      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:'))))),
+      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42'))))),
?                                                                      ++

    ('_type', 'doc')),
   (('_id', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
    ('_index', 'azul_v2_dev_test_bundles_aggregate'),
    ('_score', 1.0),
    ('_source',
     (('bundles',
       ((('uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
         ('version', '2018-11-02T113344.698028Z')),)),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', ('GSM2172585 1',)),
           ('document_id', ('412898c5-5b9b-4907-b07c-e9b89666e204',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('contributor_matrices', ()),
        ('donors',
         ((('biological_sex', ('female',)),
           ('biomaterial_id', ('DID_scRSq06',)),
           ('development_stage', ('~null',)),
           ('diseases', ('normal',)),
           ('document_id', ('7b07b9d0-cc0e-4098-9f64-f4a569f7d746',)),
           ('donor_count', 1),
           ('donor_count_', 1),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', ('38 year',)),
           ('organism_age_range',
            ((('gte', 1198368000.0), ('lte', 1198368000.0)),)),
           ('organism_age_unit', ('year',)),
           ('organism_age_value', ('38',))),)),
        ('files',
         ((('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '1d998e49'),
           ('document_id', '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
           ('drs_path',
            '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb?version=2018-11-02T113344.698028Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_1.fastq.gz'),
           ('read_index', 'read1'),
           ('related_files', ()),
           ('sha256',
            '77337cb51b2e584b5ae1b99db6c163b988cbc5b894dda2f5d22424978c3bfc7a'),
           ('size', 195142097),
           ('size_', 195142097),
           ('source', '~null'),
           ('uuid', '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb'),
           ('version', '2018-11-02T113344.698028Z')),
          (('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '54bb9c82'),
           ('document_id', '70d1af4a-82c8-478a-8960-e9028b3616ca'),
           ('drs_path',
            '74897eb7-0701-4e4f-9e6b-8b9521b2816b?version=2018-11-02T113344.450442Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_2.fastq.gz'),
           ('read_index', 'read2'),
           ('related_files', ()),
           ('sha256',
            '465a230aa127376fa641f8b8f8cad3f08fef37c8aafc67be454f0f0e4e63d68d'),
           ('size', 190330156),
           ('size_', 190330156),
           ('source', '~null'),
           ('uuid', '74897eb7-0701-4e4f-9e6b-8b9521b2816b'),
           ('version', '2018-11-02T113344.450442Z')))),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('library_construction_approach', ('Smart-seq2',)),
           ('nucleic_acid_source', ('single cell',))),)),
        ('matrices', ()),
        ('metadata',
         ((('bundle_uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
           ('bundle_version', '2018-11-02T113344.698028Z'),
           ('cell_suspension.biomaterial_core.biomaterial_description',
            'Single cell from human pancreas'),
           ('cell_suspension.biomaterial_core.biomaterial_id', 'GSM2172585 1'),
           ('cell_suspension.biomaterial_core.insdc_biomaterial', 'SRS1459312'),
           ('cell_suspension.biomaterial_core.ncbi_taxon_id', '9606'),
           ('cell_suspension.genus_species.ontology', 'NCBITaxon:9606'),
           ('cell_suspension.genus_species.ontology_label', 'Homo sapiens'),
           ('cell_suspension.genus_species.text', 'Homo sapiens'),
           ('cell_suspension.provenance.document_id',
            '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('cell_suspension.total_estimated_cells', '1'),
           ('dissociation_protocol.dissociation_method.ontology', 'EFO:0009108'),
           ('dissociation_protocol.dissociation_method.ontology_label',
            'fluorescence-activated cell sorting'),
           ('dissociation_protocol.dissociation_method.text',
            'fluorescence-activated cell sorting'),
           ('dissociation_protocol.protocol_core.publication_doi',
            'https://doi.org/10.1101/108043'),
           ('dissociation_protocol.provenance.document_id',
            '31e708d3-79df-49b8-a3df-b1d694963468'),
           ('donor_organism.biomaterial_core.biomaterial_id', 'DID_scRSq06'),
           ('donor_organism.biomaterial_core.ncbi_taxon_id', '9606'),
           ('donor_organism.death.cause_of_death', 'stroke'),
           ('donor_organism.diseases.ontology', 'PATO:0000461'),
           ('donor_organism.diseases.ontology_label', 'normal'),
           ('donor_organism.diseases.text', 'normal'),
           ('donor_organism.genus_species.ontology', 'NCBITaxon:9606'),
           ('donor_organism.genus_species.ontology_label', 'Australopithecus'),
           ('donor_organism.genus_species.text', 'Australopithecus'),
           ('donor_organism.human_specific.body_mass_index', '29.5'),
           ('donor_organism.human_specific.ethnicity.ontology', 'hancestro:0005'),
           ('donor_organism.human_specific.ethnicity.ontology_label', 'European'),
           ('donor_organism.human_specific.ethnicity.text', 'European'),
           ('donor_organism.is_living', 'no'),
           ('donor_organism.organism_age', '38'),
           ('donor_organism.organism_age_unit.ontology', 'UO:0000036'),
           ('donor_organism.organism_age_unit.ontology_label', 'year'),
           ('donor_organism.organism_age_unit.text', 'year'),
           ('donor_organism.provenance.document_id',
            '7b07b9d0-cc0e-4098-9f64-f4a569f7d746'),
           ('donor_organism.sex', 'female'),
           ('enrichment_protocol.enrichment_method.ontology', 'EFO:0009108'),
           ('enrichment_protocol.enrichment_method.ontology_label',
            'fluorescence-activated cell sorting'),
           ('enrichment_protocol.enrichment_method.text', 'FACS'),
           ('enrichment_protocol.markers', 'HPx1+ HPi2+ CD133/1+ CD133/2+'),
           ('enrichment_protocol.protocol_core.publication_doi',
            'https://doi.org/10.1101/108043'),
           ('enrichment_protocol.provenance.document_id',
            '5bd4ba68-4c0e-4d22-840d-afc025e7badc'),
           ('file_crc32c', '1d998e49'),
           ('file_format', 'fastq.gz'),
           ('file_name', 'SRR3562915_1.fastq.gz'),
           ('file_sha256',
            '77337cb51b2e584b5ae1b99db6c163b988cbc5b894dda2f5d22424978c3bfc7a'),
           ('file_size', 195142097),
           ('file_uuid', '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb'),
           ('file_version', '2018-11-02T113344.698028Z'),
           ('library_preparation_protocol.end_bias', 'full length'),
           ('library_preparation_protocol.input_nucleic_acid_molecule.ontology',
            'OBI:0000869'),
           ('library_preparation_protocol.input_nucleic_acid_molecule.text',
            'polyA RNA'),
           ('library_preparation_protocol.library_construction_approach.ontology',
            'EFO:0008931'),
           ('library_preparation_protocol.library_construction_approach.ontology_label',
            'Smart-seq2'),
           ('library_preparation_protocol.library_construction_approach.text',
            'Smart-seq2'),
           ('library_preparation_protocol.library_construction_kit.manufacturer',
            'Illumina'),
           ('library_preparation_protocol.library_construction_kit.retail_name',
            'Nextera XT kit'),
           ('library_preparation_protocol.nucleic_acid_source', 'single cell'),
           ('library_preparation_protocol.primer', 'poly-dT'),
           ('library_preparation_protocol.provenance.document_id',
            '9c32cf70-3ed7-4720-badc-5ee71e8a38af'),
           ('library_preparation_protocol.strand', 'unstranded'),
           ('process.provenance.document_id',
            '4674255d-5ecd-4860-9b8d-beae98772cd9||4c28e079-59af-4bd3-8c8b-763ea0beba98||771ddaf6-3a4f-4314-97fe-6294ff8e25a4'),
           ('project.geo_series', 'GSE81547'),
           ('project.insdc_project', 'SRP075496'),
           ('project.project_core.project_short_name',
            'Single of human pancreas'),
           ('project.project_core.project_title',
            'Single cell transcriptome patterns.'),
           ('project.provenance.document_id',
            'e8642221-4c2c-4fd7-b926-a68bce363c88'),
           ('project.supplementary_links',
            'https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results'),
           ('sequence_file.insdc_run', 'SRR3562915'),
           ('sequence_file.provenance.document_id',
            '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
           ('sequence_file.read_index', 'read1'),
           ('sequence_file.read_length', '75'),
           ('sequencing_protocol.instrument_manufacturer_model.ontology',
            'EFO:0008566'),
           ('sequencing_protocol.instrument_manufacturer_model.ontology_label',
            'Illumina NextSeq 500'),
           ('sequencing_protocol.instrument_manufacturer_model.text',
            'Illumina NextSeq 500'),
           ('sequencing_protocol.paired_end', 'True'),
           ('sequencing_protocol.provenance.document_id',
            '61e629ed-0135-4492-ac8a-5c4ab3ccca8a'),
           ('sequencing_protocol.sequencing_approach.ontology', 'EFO:0008896'),
           ('sequencing_protocol.sequencing_approach.ontology_label', 'RNA-Seq'),
           ('sequencing_protocol.sequencing_approach.text', 'RNA-Seq'),
           ('specimen_from_organism.biomaterial_core.biomaterial_id',
            'DID_scRSq06_pancreas'),
           ('specimen_from_organism.biomaterial_core.ncbi_taxon_id', '9606'),
           ('specimen_from_organism.diseases.ontology', 'PATO:0000461'),
           ('specimen_from_organism.diseases.ontology_label', 'normal'),
           ('specimen_from_organism.diseases.text', 'normal'),
           ('specimen_from_organism.genus_species.ontology', 'NCBITaxon:9606'),
           ('specimen_from_organism.genus_species.ontology_label',
            'Australopithecus'),
           ('specimen_from_organism.genus_species.text', 'Australopithecus'),
           ('specimen_from_organism.organ.ontology', 'UBERON:0001264'),
           ('specimen_from_organism.organ.ontology_label', 'pancreas'),
           ('specimen_from_organism.organ.text', 'pancreas'),
           ('specimen_from_organism.organ_part.ontology', 'UBERON:0000006'),
           ('specimen_from_organism.organ_part.ontology_label',
            'islet of Langerhans'),
           ('specimen_from_organism.organ_part.text', 'islet of Langerhans'),
           ('specimen_from_organism.provenance.document_id',
            'a21dc760-a500-4236-bcff-da34a0e873d2')),
          (('bundle_uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
           ('bundle_version', '2018-11-02T113344.698028Z'),
           ('cell_suspension.biomaterial_core.biomaterial_description',
            'Single cell from human pancreas'),
           ('cell_suspension.biomaterial_core.biomaterial_id', 'GSM2172585 1'),
           ('cell_suspension.biomaterial_core.insdc_biomaterial', 'SRS1459312'),
           ('cell_suspension.biomaterial_core.ncbi_taxon_id', '9606'),
           ('cell_suspension.genus_species.ontology', 'NCBITaxon:9606'),
           ('cell_suspension.genus_species.ontology_label', 'Homo sapiens'),
           ('cell_suspension.genus_species.text', 'Homo sapiens'),
           ('cell_suspension.provenance.document_id',
            '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('cell_suspension.total_estimated_cells', '1'),
           ('dissociation_protocol.dissociation_method.ontology', 'EFO:0009108'),
           ('dissociation_protocol.dissociation_method.ontology_label',
            'fluorescence-activated cell sorting'),
           ('dissociation_protocol.dissociation_method.text',
            'fluorescence-activated cell sorting'),
           ('dissociation_protocol.protocol_core.publication_doi',
            'https://doi.org/10.1101/108043'),
           ('dissociation_protocol.provenance.document_id',
            '31e708d3-79df-49b8-a3df-b1d694963468'),
           ('donor_organism.biomaterial_core.biomaterial_id', 'DID_scRSq06'),
           ('donor_organism.biomaterial_core.ncbi_taxon_id', '9606'),
           ('donor_organism.death.cause_of_death', 'stroke'),
           ('donor_organism.diseases.ontology', 'PATO:0000461'),
           ('donor_organism.diseases.ontology_label', 'normal'),
           ('donor_organism.diseases.text', 'normal'),
           ('donor_organism.genus_species.ontology', 'NCBITaxon:9606'),
           ('donor_organism.genus_species.ontology_label', 'Australopithecus'),
           ('donor_organism.genus_species.text', 'Australopithecus'),
           ('donor_organism.human_specific.body_mass_index', '29.5'),
           ('donor_organism.human_specific.ethnicity.ontology', 'hancestro:0005'),
           ('donor_organism.human_specific.ethnicity.ontology_label', 'European'),
           ('donor_organism.human_specific.ethnicity.text', 'European'),
           ('donor_organism.is_living', 'no'),
           ('donor_organism.organism_age', '38'),
           ('donor_organism.organism_age_unit.ontology', 'UO:0000036'),
           ('donor_organism.organism_age_unit.ontology_label', 'year'),
           ('donor_organism.organism_age_unit.text', 'year'),
           ('donor_organism.provenance.document_id',
            '7b07b9d0-cc0e-4098-9f64-f4a569f7d746'),
           ('donor_organism.sex', 'female'),
           ('enrichment_protocol.enrichment_method.ontology', 'EFO:0009108'),
           ('enrichment_protocol.enrichment_method.ontology_label',
            'fluorescence-activated cell sorting'),
           ('enrichment_protocol.enrichment_method.text', 'FACS'),
           ('enrichment_protocol.markers', 'HPx1+ HPi2+ CD133/1+ CD133/2+'),
           ('enrichment_protocol.protocol_core.publication_doi',
            'https://doi.org/10.1101/108043'),
           ('enrichment_protocol.provenance.document_id',
            '5bd4ba68-4c0e-4d22-840d-afc025e7badc'),
           ('file_crc32c', '54bb9c82'),
           ('file_format', 'fastq.gz'),
           ('file_name', 'SRR3562915_2.fastq.gz'),
           ('file_sha256',
            '465a230aa127376fa641f8b8f8cad3f08fef37c8aafc67be454f0f0e4e63d68d'),
           ('file_size', 190330156),
           ('file_uuid', '74897eb7-0701-4e4f-9e6b-8b9521b2816b'),
           ('file_version', '2018-11-02T113344.450442Z'),
           ('library_preparation_protocol.end_bias', 'full length'),
           ('library_preparation_protocol.input_nucleic_acid_molecule.ontology',
            'OBI:0000869'),
           ('library_preparation_protocol.input_nucleic_acid_molecule.text',
            'polyA RNA'),
           ('library_preparation_protocol.library_construction_approach.ontology',
            'EFO:0008931'),
           ('library_preparation_protocol.library_construction_approach.ontology_label',
            'Smart-seq2'),
           ('library_preparation_protocol.library_construction_approach.text',
            'Smart-seq2'),
           ('library_preparation_protocol.library_construction_kit.manufacturer',
            'Illumina'),
           ('library_preparation_protocol.library_construction_kit.retail_name',
            'Nextera XT kit'),
           ('library_preparation_protocol.nucleic_acid_source', 'single cell'),
           ('library_preparation_protocol.primer', 'poly-dT'),
           ('library_preparation_protocol.provenance.document_id',
            '9c32cf70-3ed7-4720-badc-5ee71e8a38af'),
           ('library_preparation_protocol.strand', 'unstranded'),
           ('process.provenance.document_id',
            '4674255d-5ecd-4860-9b8d-beae98772cd9||4c28e079-59af-4bd3-8c8b-763ea0beba98||771ddaf6-3a4f-4314-97fe-6294ff8e25a4'),
           ('project.geo_series', 'GSE81547'),
           ('project.insdc_project', 'SRP075496'),
           ('project.project_core.project_short_name',
            'Single of human pancreas'),
           ('project.project_core.project_title',
            'Single cell transcriptome patterns.'),
           ('project.provenance.document_id',
            'e8642221-4c2c-4fd7-b926-a68bce363c88'),
           ('project.supplementary_links',
            'https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results'),
           ('sequence_file.insdc_run', 'SRR3562915'),
           ('sequence_file.provenance.document_id',
            '70d1af4a-82c8-478a-8960-e9028b3616ca'),
           ('sequence_file.read_index', 'read2'),
           ('sequence_file.read_length', '75'),
           ('sequencing_protocol.instrument_manufacturer_model.ontology',
            'EFO:0008566'),
           ('sequencing_protocol.instrument_manufacturer_model.ontology_label',
            'Illumina NextSeq 500'),
           ('sequencing_protocol.instrument_manufacturer_model.text',
            'Illumina NextSeq 500'),
           ('sequencing_protocol.paired_end', 'True'),
           ('sequencing_protocol.provenance.document_id',
            '61e629ed-0135-4492-ac8a-5c4ab3ccca8a'),
           ('sequencing_protocol.sequencing_approach.ontology', 'EFO:0008896'),
           ('sequencing_protocol.sequencing_approach.ontology_label', 'RNA-Seq'),
           ('sequencing_protocol.sequencing_approach.text', 'RNA-Seq'),
           ('specimen_from_organism.biomaterial_core.biomaterial_id',
            'DID_scRSq06_pancreas'),
           ('specimen_from_organism.biomaterial_core.ncbi_taxon_id', '9606'),
           ('specimen_from_organism.diseases.ontology', 'PATO:0000461'),
           ('specimen_from_organism.diseases.ontology_label', 'normal'),
           ('specimen_from_organism.diseases.text', 'normal'),
           ('specimen_from_organism.genus_species.ontology', 'NCBITaxon:9606'),
           ('specimen_from_organism.genus_species.ontology_label',
            'Australopithecus'),
           ('specimen_from_organism.genus_species.text', 'Australopithecus'),
           ('specimen_from_organism.organ.ontology', 'UBERON:0001264'),
           ('specimen_from_organism.organ.ontology_label', 'pancreas'),
           ('specimen_from_organism.organ.text', 'pancreas'),
           ('specimen_from_organism.organ_part.ontology', 'UBERON:0000006'),
           ('specimen_from_organism.organ_part.ontology_label',
            'islet of Langerhans'),
           ('specimen_from_organism.organ_part.text', 'islet of Langerhans'),
           ('specimen_from_organism.provenance.document_id',
            'a21dc760-a500-4236-bcff-da34a0e873d2')))),
        ('organoids', ()),
        ('projects',
         ((('_type', ('project',)),
           ('array_express_accessions', ('~null',)),
           ('document_id', ('e8642221-4c2c-4fd7-b926-a68bce363c88',)),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_short_name', ('Single of human pancreas',)),
           ('project_title', ('Single cell transcriptome patterns.',)),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', ('specimen_from_organism',)),
           ('_type', ('specimen',)),
           ('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('disease', ('normal',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('has_input_biomaterial', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', ('~null',)),
           ('storage_method', ('~null',))),)),
        ('samples',
         ((('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('effective_organ', ('pancreas',)),
           ('entity_type', ('specimens',)),
           ('model_organ', ('~null',)),
           ('model_organ_part', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', ('GSM2172585 1',)),
           ('document_id', ('412898c5-5b9b-4907-b07c-e9b89666e204',)),
           ('sequencing_input_type', ('cell_suspension',))),)),
        ('sequencing_processes',
         ((('document_id', ('771ddaf6-3a4f-4314-97fe-6294ff8e25a4',)),),)),
        ('sequencing_protocols',
         ((('instrument_manufacturer_model', ('Illumina NextSeq 500',)),
           ('paired_end', (1,))),)),
        ('specimens',
         ((('_source', ('specimen_from_organism',)),
           ('_type', ('specimen',)),
           ('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('disease', ('normal',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('has_input_biomaterial', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', ('~null',)),
           ('storage_method', ('~null',))),)))),
      ('entity_id', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
      ('num_contributions', 1),
      ('sources',
-      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:')),)),
+      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42')),)),
?                                                                       ++

      ('total_estimated_cells', 1))),
    ('_type', 'doc')),
   (('_id',
     'aaa96233-bf27-44c7-82df-b4dc15ad4d9d_aaa96233-bf27-44c7-82df-b4dc15ad4d9d_2018-11-02T113344.698028Z_exists'),
    ('_index', 'azul_v2_dev_test_bundles'),
    ('_score', 1.0),
    ('_source',
     (('bundle_deleted', False),
      ('bundle_uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
      ('bundle_version', '2018-11-02T113344.698028Z'),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('contributor_matrices', ()),
        ('donors',
         ((('biological_sex', 'female'),
           ('biomaterial_id', 'DID_scRSq06'),
           ('development_stage', '~null'),
           ('diseases', ('normal',)),
           ('document_id', '7b07b9d0-cc0e-4098-9f64-f4a569f7d746'),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', '38 year'),
           ('organism_age_range', (('gte', 1198368000.0), ('lte', 1198368000.0))),
           ('organism_age_unit', 'year'),
           ('organism_age_value', '38')),)),
        ('files',
         ((('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '1d998e49'),
           ('document_id', '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
           ('drs_path',
            '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb?version=2018-11-02T113344.698028Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_1.fastq.gz'),
           ('read_index', 'read1'),
           ('related_files', ()),
           ('sha256',
            '77337cb51b2e584b5ae1b99db6c163b988cbc5b894dda2f5d22424978c3bfc7a'),
           ('size', 195142097),
           ('size_', 195142097),
           ('source', '~null'),
           ('uuid', '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb'),
           ('version', '2018-11-02T113344.698028Z')),
          (('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '54bb9c82'),
           ('document_id', '70d1af4a-82c8-478a-8960-e9028b3616ca'),
           ('drs_path',
            '74897eb7-0701-4e4f-9e6b-8b9521b2816b?version=2018-11-02T113344.450442Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_2.fastq.gz'),
           ('read_index', 'read2'),
           ('related_files', ()),
           ('sha256',
            '465a230aa127376fa641f8b8f8cad3f08fef37c8aafc67be454f0f0e4e63d68d'),
           ('size', 190330156),
           ('size_', 190330156),
           ('source', '~null'),
           ('uuid', '74897eb7-0701-4e4f-9e6b-8b9521b2816b'),
           ('version', '2018-11-02T113344.450442Z')))),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('document_id', '9c32cf70-3ed7-4720-badc-5ee71e8a38af'),
           ('library_construction_approach', 'Smart-seq2'),
           ('nucleic_acid_source', 'single cell')),)),
        ('matrices', ()),
        ('metadata',
         ((('bundle_uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
           ('bundle_version', '2018-11-02T113344.698028Z'),
           ('cell_suspension.biomaterial_core.biomaterial_description',
            'Single cell from human pancreas'),
           ('cell_suspension.biomaterial_core.biomaterial_id', 'GSM2172585 1'),
           ('cell_suspension.biomaterial_core.insdc_biomaterial', 'SRS1459312'),
           ('cell_suspension.biomaterial_core.ncbi_taxon_id', '9606'),
           ('cell_suspension.genus_species.ontology', 'NCBITaxon:9606'),
           ('cell_suspension.genus_species.ontology_label', 'Homo sapiens'),
           ('cell_suspension.genus_species.text', 'Homo sapiens'),
           ('cell_suspension.provenance.document_id',
            '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('cell_suspension.total_estimated_cells', '1'),
           ('dissociation_protocol.dissociation_method.ontology', 'EFO:0009108'),
           ('dissociation_protocol.dissociation_method.ontology_label',
            'fluorescence-activated cell sorting'),
           ('dissociation_protocol.dissociation_method.text',
            'fluorescence-activated cell sorting'),
           ('dissociation_protocol.protocol_core.publication_doi',
            'https://doi.org/10.1101/108043'),
           ('dissociation_protocol.provenance.document_id',
            '31e708d3-79df-49b8-a3df-b1d694963468'),
           ('donor_organism.biomaterial_core.biomaterial_id', 'DID_scRSq06'),
           ('donor_organism.biomaterial_core.ncbi_taxon_id', '9606'),
           ('donor_organism.death.cause_of_death', 'stroke'),
           ('donor_organism.diseases.ontology', 'PATO:0000461'),
           ('donor_organism.diseases.ontology_label', 'normal'),
           ('donor_organism.diseases.text', 'normal'),
           ('donor_organism.genus_species.ontology', 'NCBITaxon:9606'),
           ('donor_organism.genus_species.ontology_label', 'Australopithecus'),
           ('donor_organism.genus_species.text', 'Australopithecus'),
           ('donor_organism.human_specific.body_mass_index', '29.5'),
           ('donor_organism.human_specific.ethnicity.ontology', 'hancestro:0005'),
           ('donor_organism.human_specific.ethnicity.ontology_label', 'European'),
           ('donor_organism.human_specific.ethnicity.text', 'European'),
           ('donor_organism.is_living', 'no'),
           ('donor_organism.organism_age', '38'),
           ('donor_organism.organism_age_unit.ontology', 'UO:0000036'),
           ('donor_organism.organism_age_unit.ontology_label', 'year'),
           ('donor_organism.organism_age_unit.text', 'year'),
           ('donor_organism.provenance.document_id',
            '7b07b9d0-cc0e-4098-9f64-f4a569f7d746'),
           ('donor_organism.sex', 'female'),
           ('enrichment_protocol.enrichment_method.ontology', 'EFO:0009108'),
           ('enrichment_protocol.enrichment_method.ontology_label',
            'fluorescence-activated cell sorting'),
           ('enrichment_protocol.enrichment_method.text', 'FACS'),
           ('enrichment_protocol.markers', 'HPx1+ HPi2+ CD133/1+ CD133/2+'),
           ('enrichment_protocol.protocol_core.publication_doi',
            'https://doi.org/10.1101/108043'),
           ('enrichment_protocol.provenance.document_id',
            '5bd4ba68-4c0e-4d22-840d-afc025e7badc'),
           ('file_crc32c', '1d998e49'),
           ('file_format', 'fastq.gz'),
           ('file_name', 'SRR3562915_1.fastq.gz'),
           ('file_sha256',
            '77337cb51b2e584b5ae1b99db6c163b988cbc5b894dda2f5d22424978c3bfc7a'),
           ('file_size', 195142097),
           ('file_uuid', '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb'),
           ('file_version', '2018-11-02T113344.698028Z'),
           ('library_preparation_protocol.end_bias', 'full length'),
           ('library_preparation_protocol.input_nucleic_acid_molecule.ontology',
            'OBI:0000869'),
           ('library_preparation_protocol.input_nucleic_acid_molecule.text',
            'polyA RNA'),
           ('library_preparation_protocol.library_construction_approach.ontology',
            'EFO:0008931'),
           ('library_preparation_protocol.library_construction_approach.ontology_label',
            'Smart-seq2'),
           ('library_preparation_protocol.library_construction_approach.text',
            'Smart-seq2'),
           ('library_preparation_protocol.library_construction_kit.manufacturer',
            'Illumina'),
           ('library_preparation_protocol.library_construction_kit.retail_name',
            'Nextera XT kit'),
           ('library_preparation_protocol.nucleic_acid_source', 'single cell'),
           ('library_preparation_protocol.primer', 'poly-dT'),
           ('library_preparation_protocol.provenance.document_id',
            '9c32cf70-3ed7-4720-badc-5ee71e8a38af'),
           ('library_preparation_protocol.strand', 'unstranded'),
           ('process.provenance.document_id',
            '4674255d-5ecd-4860-9b8d-beae98772cd9||4c28e079-59af-4bd3-8c8b-763ea0beba98||771ddaf6-3a4f-4314-97fe-6294ff8e25a4'),
           ('project.geo_series', 'GSE81547'),
           ('project.insdc_project', 'SRP075496'),
           ('project.project_core.project_short_name',
            'Single of human pancreas'),
           ('project.project_core.project_title',
            'Single cell transcriptome patterns.'),
           ('project.provenance.document_id',
            'e8642221-4c2c-4fd7-b926-a68bce363c88'),
           ('project.supplementary_links',
            'https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results'),
           ('sequence_file.insdc_run', 'SRR3562915'),
           ('sequence_file.provenance.document_id',
            '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
           ('sequence_file.read_index', 'read1'),
           ('sequence_file.read_length', '75'),
           ('sequencing_protocol.instrument_manufacturer_model.ontology',
            'EFO:0008566'),
           ('sequencing_protocol.instrument_manufacturer_model.ontology_label',
            'Illumina NextSeq 500'),
           ('sequencing_protocol.instrument_manufacturer_model.text',
            'Illumina NextSeq 500'),
           ('sequencing_protocol.paired_end', 'True'),
           ('sequencing_protocol.provenance.document_id',
            '61e629ed-0135-4492-ac8a-5c4ab3ccca8a'),
           ('sequencing_protocol.sequencing_approach.ontology', 'EFO:0008896'),
           ('sequencing_protocol.sequencing_approach.ontology_label', 'RNA-Seq'),
           ('sequencing_protocol.sequencing_approach.text', 'RNA-Seq'),
           ('specimen_from_organism.biomaterial_core.biomaterial_id',
            'DID_scRSq06_pancreas'),
           ('specimen_from_organism.biomaterial_core.ncbi_taxon_id', '9606'),
           ('specimen_from_organism.diseases.ontology', 'PATO:0000461'),
           ('specimen_from_organism.diseases.ontology_label', 'normal'),
           ('specimen_from_organism.diseases.text', 'normal'),
           ('specimen_from_organism.genus_species.ontology', 'NCBITaxon:9606'),
           ('specimen_from_organism.genus_species.ontology_label',
            'Australopithecus'),
           ('specimen_from_organism.genus_species.text', 'Australopithecus'),
           ('specimen_from_organism.organ.ontology', 'UBERON:0001264'),
           ('specimen_from_organism.organ.ontology_label', 'pancreas'),
           ('specimen_from_organism.organ.text', 'pancreas'),
           ('specimen_from_organism.organ_part.ontology', 'UBERON:0000006'),
           ('specimen_from_organism.organ_part.ontology_label',
            'islet of Langerhans'),
           ('specimen_from_organism.organ_part.text', 'islet of Langerhans'),
           ('specimen_from_organism.provenance.document_id',
            'a21dc760-a500-4236-bcff-da34a0e873d2')),
          (('bundle_uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
           ('bundle_version', '2018-11-02T113344.698028Z'),
           ('cell_suspension.biomaterial_core.biomaterial_description',
            'Single cell from human pancreas'),
           ('cell_suspension.biomaterial_core.biomaterial_id', 'GSM2172585 1'),
           ('cell_suspension.biomaterial_core.insdc_biomaterial', 'SRS1459312'),
           ('cell_suspension.biomaterial_core.ncbi_taxon_id', '9606'),
           ('cell_suspension.genus_species.ontology', 'NCBITaxon:9606'),
           ('cell_suspension.genus_species.ontology_label', 'Homo sapiens'),
           ('cell_suspension.genus_species.text', 'Homo sapiens'),
           ('cell_suspension.provenance.document_id',
            '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('cell_suspension.total_estimated_cells', '1'),
           ('dissociation_protocol.dissociation_method.ontology', 'EFO:0009108'),
           ('dissociation_protocol.dissociation_method.ontology_label',
            'fluorescence-activated cell sorting'),
           ('dissociation_protocol.dissociation_method.text',
            'fluorescence-activated cell sorting'),
           ('dissociation_protocol.protocol_core.publication_doi',
            'https://doi.org/10.1101/108043'),
           ('dissociation_protocol.provenance.document_id',
            '31e708d3-79df-49b8-a3df-b1d694963468'),
           ('donor_organism.biomaterial_core.biomaterial_id', 'DID_scRSq06'),
           ('donor_organism.biomaterial_core.ncbi_taxon_id', '9606'),
           ('donor_organism.death.cause_of_death', 'stroke'),
           ('donor_organism.diseases.ontology', 'PATO:0000461'),
           ('donor_organism.diseases.ontology_label', 'normal'),
           ('donor_organism.diseases.text', 'normal'),
           ('donor_organism.genus_species.ontology', 'NCBITaxon:9606'),
           ('donor_organism.genus_species.ontology_label', 'Australopithecus'),
           ('donor_organism.genus_species.text', 'Australopithecus'),
           ('donor_organism.human_specific.body_mass_index', '29.5'),
           ('donor_organism.human_specific.ethnicity.ontology', 'hancestro:0005'),
           ('donor_organism.human_specific.ethnicity.ontology_label', 'European'),
           ('donor_organism.human_specific.ethnicity.text', 'European'),
           ('donor_organism.is_living', 'no'),
           ('donor_organism.organism_age', '38'),
           ('donor_organism.organism_age_unit.ontology', 'UO:0000036'),
           ('donor_organism.organism_age_unit.ontology_label', 'year'),
           ('donor_organism.organism_age_unit.text', 'year'),
           ('donor_organism.provenance.document_id',
            '7b07b9d0-cc0e-4098-9f64-f4a569f7d746'),
           ('donor_organism.sex', 'female'),
           ('enrichment_protocol.enrichment_method.ontology', 'EFO:0009108'),
           ('enrichment_protocol.enrichment_method.ontology_label',
            'fluorescence-activated cell sorting'),
           ('enrichment_protocol.enrichment_method.text', 'FACS'),
           ('enrichment_protocol.markers', 'HPx1+ HPi2+ CD133/1+ CD133/2+'),
           ('enrichment_protocol.protocol_core.publication_doi',
            'https://doi.org/10.1101/108043'),
           ('enrichment_protocol.provenance.document_id',
            '5bd4ba68-4c0e-4d22-840d-afc025e7badc'),
           ('file_crc32c', '54bb9c82'),
           ('file_format', 'fastq.gz'),
           ('file_name', 'SRR3562915_2.fastq.gz'),
           ('file_sha256',
            '465a230aa127376fa641f8b8f8cad3f08fef37c8aafc67be454f0f0e4e63d68d'),
           ('file_size', 190330156),
           ('file_uuid', '74897eb7-0701-4e4f-9e6b-8b9521b2816b'),
           ('file_version', '2018-11-02T113344.450442Z'),
           ('library_preparation_protocol.end_bias', 'full length'),
           ('library_preparation_protocol.input_nucleic_acid_molecule.ontology',
            'OBI:0000869'),
           ('library_preparation_protocol.input_nucleic_acid_molecule.text',
            'polyA RNA'),
           ('library_preparation_protocol.library_construction_approach.ontology',
            'EFO:0008931'),
           ('library_preparation_protocol.library_construction_approach.ontology_label',
            'Smart-seq2'),
           ('library_preparation_protocol.library_construction_approach.text',
            'Smart-seq2'),
           ('library_preparation_protocol.library_construction_kit.manufacturer',
            'Illumina'),
           ('library_preparation_protocol.library_construction_kit.retail_name',
            'Nextera XT kit'),
           ('library_preparation_protocol.nucleic_acid_source', 'single cell'),
           ('library_preparation_protocol.primer', 'poly-dT'),
           ('library_preparation_protocol.provenance.document_id',
            '9c32cf70-3ed7-4720-badc-5ee71e8a38af'),
           ('library_preparation_protocol.strand', 'unstranded'),
           ('process.provenance.document_id',
            '4674255d-5ecd-4860-9b8d-beae98772cd9||4c28e079-59af-4bd3-8c8b-763ea0beba98||771ddaf6-3a4f-4314-97fe-6294ff8e25a4'),
           ('project.geo_series', 'GSE81547'),
           ('project.insdc_project', 'SRP075496'),
           ('project.project_core.project_short_name',
            'Single of human pancreas'),
           ('project.project_core.project_title',
            'Single cell transcriptome patterns.'),
           ('project.provenance.document_id',
            'e8642221-4c2c-4fd7-b926-a68bce363c88'),
           ('project.supplementary_links',
            'https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results'),
           ('sequence_file.insdc_run', 'SRR3562915'),
           ('sequence_file.provenance.document_id',
            '70d1af4a-82c8-478a-8960-e9028b3616ca'),
           ('sequence_file.read_index', 'read2'),
           ('sequence_file.read_length', '75'),
           ('sequencing_protocol.instrument_manufacturer_model.ontology',
            'EFO:0008566'),
           ('sequencing_protocol.instrument_manufacturer_model.ontology_label',
            'Illumina NextSeq 500'),
           ('sequencing_protocol.instrument_manufacturer_model.text',
            'Illumina NextSeq 500'),
           ('sequencing_protocol.paired_end', 'True'),
           ('sequencing_protocol.provenance.document_id',
            '61e629ed-0135-4492-ac8a-5c4ab3ccca8a'),
           ('sequencing_protocol.sequencing_approach.ontology', 'EFO:0008896'),
           ('sequencing_protocol.sequencing_approach.ontology_label', 'RNA-Seq'),
           ('sequencing_protocol.sequencing_approach.text', 'RNA-Seq'),
           ('specimen_from_organism.biomaterial_core.biomaterial_id',
            'DID_scRSq06_pancreas'),
           ('specimen_from_organism.biomaterial_core.ncbi_taxon_id', '9606'),
           ('specimen_from_organism.diseases.ontology', 'PATO:0000461'),
           ('specimen_from_organism.diseases.ontology_label', 'normal'),
           ('specimen_from_organism.diseases.text', 'normal'),
           ('specimen_from_organism.genus_species.ontology', 'NCBITaxon:9606'),
           ('specimen_from_organism.genus_species.ontology_label',
            'Australopithecus'),
           ('specimen_from_organism.genus_species.text', 'Australopithecus'),
           ('specimen_from_organism.organ.ontology', 'UBERON:0001264'),
           ('specimen_from_organism.organ.ontology_label', 'pancreas'),
           ('specimen_from_organism.organ.text', 'pancreas'),
           ('specimen_from_organism.organ_part.ontology', 'UBERON:0000006'),
           ('specimen_from_organism.organ_part.ontology_label',
            'islet of Langerhans'),
           ('specimen_from_organism.organ_part.text', 'islet of Langerhans'),
           ('specimen_from_organism.provenance.document_id',
            'a21dc760-a500-4236-bcff-da34a0e873d2')))),
        ('organoids', ()),
        ('projects',
         ((('_type', 'project'),
           ('array_express_accessions', ('~null',)),
           ('contact_names', ('Laura,,Huerta', 'Martin, Enge', 'Matthew,,Green')),
           ('contributors',
            ((('contact_name', 'Laura,,Huerta'),
              ('corresponding_contributor', 0),
              ('email', 'lauhuema@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'external curator')),
             (('contact_name', 'Martin, Enge'),
              ('corresponding_contributor', 9223372036854774784),
              ('email', 'martin.enge@gmail.com'),
              ('institution', 'University'),
              ('laboratory', '~null'),
              ('project_role', '~null')),
             (('contact_name', 'Matthew,,Green'),
              ('corresponding_contributor', 0),
              ('email', 'hewgreen@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'Human Cell Atlas wrangler')))),
           ('document_id', 'e8642221-4c2c-4fd7-b926-a68bce363c88'),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_description',
            'As organisms age, cells accumulate genetic and epigenetic changes '
            'that eventually lead to impaired organ function or catastrophic '
            'failure such as cancer. Here we describe a single-cell '
            'transcriptome analysis of 2544 human pancreas cells from donors, '
            'spanning six decades of life. We find that islet cells from older '
            'donors have increased levels of disorder as measured both by noise '
            'in the transcriptome and by the number of cells which display '
            'inappropriate hormone expression, revealing a transcriptional '
            'instability associated with aging. By analyzing the spectrum of '
            'somatic mutations in single cells from previously-healthy donors, '
            'we find a specific age-dependent mutational signature characterized '
            'by C to A and C to G transversions, indicators of oxidative stress, '
            'which is absent in single cells from human brain tissue or in a '
            'tumor cell line. Cells carrying a high load of such mutations also '
            'express higher levels of stress and senescence markers, including '
            'FOS, JUN, and the cytoplasmic superoxide dismutase SOD1, markers '
            'previously linked to pancreatic diseases with substantial '
            'age-dependent risk, such as type 2 diabetes mellitus and '
            'adenocarcinoma. Thus, our single-cell approach unveils gene '
            'expression changes and somatic mutations acquired in aging human '
            'tissue, and identifies molecular pathways induced by these genetic '
            'changes that could influence human disease. Also, our results '
            'demonstrate the feasibility of using single-cell RNA-seq data from '
            'primary cells to derive meaningful insights into the genetic '
            'processes that operate on aging human tissue and to determine which '
            'molecular mechanisms are coordinated with these processes. '
            'Examination of single cells from primary human pancreas tissue'),
           ('project_short_name', 'Single of human pancreas'),
           ('project_title', 'Single cell transcriptome patterns.'),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('publications',
            ((('publication_title',
               'Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
               'Signatures of Aging and Somatic Mutation Patterns.'),
              ('publication_url',
               'https://www.ncbi.nlm.nih.gov/pubmed/28965763')),)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)),
        ('samples',
         ((('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('effective_organ', 'pancreas'),
           ('entity_type', 'specimens'),
           ('model_organ', '~null'),
           ('model_organ_part', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('sequencing_input_type', 'cell_suspension')),)),
        ('sequencing_processes',
         ((('document_id', '771ddaf6-3a4f-4314-97fe-6294ff8e25a4'),),)),
        ('sequencing_protocols',
         ((('document_id', '61e629ed-0135-4492-ac8a-5c4ab3ccca8a'),
           ('instrument_manufacturer_model', 'Illumina NextSeq 500'),
           ('paired_end', 1)),)),
        ('specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)))),
      ('entity_id', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
      ('source',
-      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:'))))),
+      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42'))))),
?                                                                      ++

    ('_type', 'doc')),
   (('_id', 'e8642221-4c2c-4fd7-b926-a68bce363c88'),
    ('_index', 'azul_v2_dev_test_projects_aggregate'),
    ('_score', 1.0),
    ('_source',
     (('bundles',
       ((('uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
         ('version', '2018-11-02T113344.698028Z')),)),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', ('GSM2172585 1',)),
           ('document_id', ('412898c5-5b9b-4907-b07c-e9b89666e204',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('contributor_matrices', ()),
        ('donors',
         ((('biological_sex', ('female',)),
           ('biomaterial_id', ('DID_scRSq06',)),
           ('development_stage', ('~null',)),
           ('diseases', ('normal',)),
           ('document_id', ('7b07b9d0-cc0e-4098-9f64-f4a569f7d746',)),
           ('donor_count', 1),
           ('donor_count_', 1),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', ('38 year',)),
           ('organism_age_range',
            ((('gte', 1198368000.0), ('lte', 1198368000.0)),)),
           ('organism_age_unit', ('year',)),
           ('organism_age_value', ('38',))),)),
        ('files',
         ((('content_description', ('~null',)),
           ('count', 2),
           ('file_format', 'fastq.gz'),
           ('is_intermediate', 9223372036854774784),
           ('matrix_cell_count', 9223372036854774784),
           ('matrix_cell_count_', None),
           ('size', 385472253),
           ('size_', 385472253),
           ('source', ('~null',))),)),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('library_construction_approach', ('Smart-seq2',)),
           ('nucleic_acid_source', ('single cell',))),)),
        ('matrices', ()),
        ('organoids', ()),
        ('projects',
         ((('_type', 'project'),
           ('array_express_accessions', ('~null',)),
           ('contact_names', ('Laura,,Huerta', 'Martin, Enge', 'Matthew,,Green')),
           ('contributors',
            ((('contact_name', 'Laura,,Huerta'),
              ('corresponding_contributor', 0),
              ('email', 'lauhuema@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'external curator')),
             (('contact_name', 'Martin, Enge'),
              ('corresponding_contributor', 9223372036854774784),
              ('email', 'martin.enge@gmail.com'),
              ('institution', 'University'),
              ('laboratory', '~null'),
              ('project_role', '~null')),
             (('contact_name', 'Matthew,,Green'),
              ('corresponding_contributor', 0),
              ('email', 'hewgreen@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'Human Cell Atlas wrangler')))),
           ('document_id', 'e8642221-4c2c-4fd7-b926-a68bce363c88'),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_description',
            'As organisms age, cells accumulate genetic and epigenetic changes '
            'that eventually lead to impaired organ function or catastrophic '
            'failure such as cancer. Here we describe a single-cell '
            'transcriptome analysis of 2544 human pancreas cells from donors, '
            'spanning six decades of life. We find that islet cells from older '
            'donors have increased levels of disorder as measured both by noise '
            'in the transcriptome and by the number of cells which display '
            'inappropriate hormone expression, revealing a transcriptional '
            'instability associated with aging. By analyzing the spectrum of '
            'somatic mutations in single cells from previously-healthy donors, '
            'we find a specific age-dependent mutational signature characterized '
            'by C to A and C to G transversions, indicators of oxidative stress, '
            'which is absent in single cells from human brain tissue or in a '
            'tumor cell line. Cells carrying a high load of such mutations also '
            'express higher levels of stress and senescence markers, including '
            'FOS, JUN, and the cytoplasmic superoxide dismutase SOD1, markers '
            'previously linked to pancreatic diseases with substantial '
            'age-dependent risk, such as type 2 diabetes mellitus and '
            'adenocarcinoma. Thus, our single-cell approach unveils gene '
            'expression changes and somatic mutations acquired in aging human '
            'tissue, and identifies molecular pathways induced by these genetic '
            'changes that could influence human disease. Also, our results '
            'demonstrate the feasibility of using single-cell RNA-seq data from '
            'primary cells to derive meaningful insights into the genetic '
            'processes that operate on aging human tissue and to determine which '
            'molecular mechanisms are coordinated with these processes. '
            'Examination of single cells from primary human pancreas tissue'),
           ('project_short_name', 'Single of human pancreas'),
           ('project_title', 'Single cell transcriptome patterns.'),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('publications',
            ((('publication_title',
               'Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
               'Signatures of Aging and Somatic Mutation Patterns.'),
              ('publication_url',
               'https://www.ncbi.nlm.nih.gov/pubmed/28965763')),)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', ('specimen_from_organism',)),
           ('_type', ('specimen',)),
           ('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('disease', ('normal',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('has_input_biomaterial', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', ('~null',)),
           ('storage_method', ('~null',))),)),
        ('samples',
         ((('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('effective_organ', ('pancreas',)),
           ('entity_type', ('specimens',)),
           ('model_organ', ('~null',)),
           ('model_organ_part', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', ('GSM2172585 1',)),
           ('document_id', ('412898c5-5b9b-4907-b07c-e9b89666e204',)),
           ('sequencing_input_type', ('cell_suspension',))),)),
        ('sequencing_processes',
         ((('document_id', ('771ddaf6-3a4f-4314-97fe-6294ff8e25a4',)),),)),
        ('sequencing_protocols',
         ((('instrument_manufacturer_model', ('Illumina NextSeq 500',)),
           ('paired_end', (1,))),)),
        ('specimens',
         ((('_source', ('specimen_from_organism',)),
           ('_type', ('specimen',)),
           ('biomaterial_id', ('DID_scRSq06_pancreas',)),
           ('disease', ('normal',)),
           ('document_id', ('a21dc760-a500-4236-bcff-da34a0e873d2',)),
           ('has_input_biomaterial', ('~null',)),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', ('~null',)),
           ('storage_method', ('~null',))),)))),
      ('entity_id', 'e8642221-4c2c-4fd7-b926-a68bce363c88'),
      ('num_contributions', 1),
      ('sources',
-      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:')),)),
+      ((('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42')),)),
?                                                                       ++

      ('total_estimated_cells', 1))),
    ('_type', 'doc')),
   (('_id',
     'e8642221-4c2c-4fd7-b926-a68bce363c88_aaa96233-bf27-44c7-82df-b4dc15ad4d9d_2018-11-02T113344.698028Z_exists'),
    ('_index', 'azul_v2_dev_test_projects'),
    ('_score', 1.0),
    ('_source',
     (('bundle_deleted', False),
      ('bundle_uuid', 'aaa96233-bf27-44c7-82df-b4dc15ad4d9d'),
      ('bundle_version', '2018-11-02T113344.698028Z'),
      ('contents',
       (('analysis_protocols', ()),
        ('cell_lines', ()),
        ('cell_suspensions',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('organ', ('pancreas',)),
           ('organ_part', ('islet of Langerhans',)),
           ('selected_cell_type', ('~null',)),
           ('total_estimated_cells', 1),
           ('total_estimated_cells_', 1)),)),
        ('contributor_matrices', ()),
        ('donors',
         ((('biological_sex', 'female'),
           ('biomaterial_id', 'DID_scRSq06'),
           ('development_stage', '~null'),
           ('diseases', ('normal',)),
           ('document_id', '7b07b9d0-cc0e-4098-9f64-f4a569f7d746'),
           ('genus_species', ('Australopithecus',)),
           ('organism_age', '38 year'),
           ('organism_age_range', (('gte', 1198368000.0), ('lte', 1198368000.0))),
           ('organism_age_unit', 'year'),
           ('organism_age_value', '38')),)),
        ('files',
         ((('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '1d998e49'),
           ('document_id', '0c5ac7c0-817e-40d4-b1b1-34c3d5cfecdb'),
           ('drs_path',
            '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb?version=2018-11-02T113344.698028Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_1.fastq.gz'),
           ('read_index', 'read1'),
           ('related_files', ()),
           ('sha256',
            '77337cb51b2e584b5ae1b99db6c163b988cbc5b894dda2f5d22424978c3bfc7a'),
           ('size', 195142097),
           ('size_', 195142097),
           ('source', '~null'),
           ('uuid', '7b07f99e-4a8a-4ad0-bd4f-db0d7a00c7bb'),
           ('version', '2018-11-02T113344.698028Z')),
          (('_type', 'file'),
           ('content-type', 'application/gzip; dcp-type=data'),
           ('content_description', ('~null',)),
           ('crc32c', '54bb9c82'),
           ('document_id', '70d1af4a-82c8-478a-8960-e9028b3616ca'),
           ('drs_path',
            '74897eb7-0701-4e4f-9e6b-8b9521b2816b?version=2018-11-02T113344.450442Z'),
           ('file_format', 'fastq.gz'),
           ('file_type', 'sequence_file'),
           ('indexed', 0),
           ('is_intermediate', 9223372036854774784),
           ('lane_index', 9223372036854774784),
           ('lane_index_', None),
           ('name', 'SRR3562915_2.fastq.gz'),
           ('read_index', 'read2'),
           ('related_files', ()),
           ('sha256',
            '465a230aa127376fa641f8b8f8cad3f08fef37c8aafc67be454f0f0e4e63d68d'),
           ('size', 190330156),
           ('size_', 190330156),
           ('source', '~null'),
           ('uuid', '74897eb7-0701-4e4f-9e6b-8b9521b2816b'),
           ('version', '2018-11-02T113344.450442Z')))),
        ('imaging_protocols', ()),
        ('library_preparation_protocols',
         ((('document_id', '9c32cf70-3ed7-4720-badc-5ee71e8a38af'),
           ('library_construction_approach', 'Smart-seq2'),
           ('nucleic_acid_source', 'single cell')),)),
        ('matrices', ()),
        ('organoids', ()),
        ('projects',
         ((('_type', 'project'),
           ('array_express_accessions', ('~null',)),
           ('contact_names', ('Laura,,Huerta', 'Martin, Enge', 'Matthew,,Green')),
           ('contributors',
            ((('contact_name', 'Laura,,Huerta'),
              ('corresponding_contributor', 0),
              ('email', 'lauhuema@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'external curator')),
             (('contact_name', 'Martin, Enge'),
              ('corresponding_contributor', 9223372036854774784),
              ('email', 'martin.enge@gmail.com'),
              ('institution', 'University'),
              ('laboratory', '~null'),
              ('project_role', '~null')),
             (('contact_name', 'Matthew,,Green'),
              ('corresponding_contributor', 0),
              ('email', 'hewgreen@ebi.ac.uk'),
              ('institution', 'Farmers Trucks'),
              ('laboratory', 'John Dear'),
              ('project_role', 'Human Cell Atlas wrangler')))),
           ('document_id', 'e8642221-4c2c-4fd7-b926-a68bce363c88'),
           ('geo_series_accessions', ('~null',)),
           ('insdc_project_accessions', ('~null',)),
           ('insdc_study_accessions', ('~null',)),
           ('institutions', ('Farmers Trucks', 'University')),
           ('laboratory', ('John Dear',)),
           ('project_description',
            'As organisms age, cells accumulate genetic and epigenetic changes '
            'that eventually lead to impaired organ function or catastrophic '
            'failure such as cancer. Here we describe a single-cell '
            'transcriptome analysis of 2544 human pancreas cells from donors, '
            'spanning six decades of life. We find that islet cells from older '
            'donors have increased levels of disorder as measured both by noise '
            'in the transcriptome and by the number of cells which display '
            'inappropriate hormone expression, revealing a transcriptional '
            'instability associated with aging. By analyzing the spectrum of '
            'somatic mutations in single cells from previously-healthy donors, '
            'we find a specific age-dependent mutational signature characterized '
            'by C to A and C to G transversions, indicators of oxidative stress, '
            'which is absent in single cells from human brain tissue or in a '
            'tumor cell line. Cells carrying a high load of such mutations also '
            'express higher levels of stress and senescence markers, including '
            'FOS, JUN, and the cytoplasmic superoxide dismutase SOD1, markers '
            'previously linked to pancreatic diseases with substantial '
            'age-dependent risk, such as type 2 diabetes mellitus and '
            'adenocarcinoma. Thus, our single-cell approach unveils gene '
            'expression changes and somatic mutations acquired in aging human '
            'tissue, and identifies molecular pathways induced by these genetic '
            'changes that could influence human disease. Also, our results '
            'demonstrate the feasibility of using single-cell RNA-seq data from '
            'primary cells to derive meaningful insights into the genetic '
            'processes that operate on aging human tissue and to determine which '
            'molecular mechanisms are coordinated with these processes. '
            'Examination of single cells from primary human pancreas tissue'),
           ('project_short_name', 'Single of human pancreas'),
           ('project_title', 'Single cell transcriptome patterns.'),
           ('publication_titles',
            ('Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
             'Signatures of Aging and Somatic Mutation Patterns.',)),
           ('publications',
            ((('publication_title',
               'Single-Cell Analysis of Human Pancreas Reveals Transcriptional '
               'Signatures of Aging and Somatic Mutation Patterns.'),
              ('publication_url',
               'https://www.ncbi.nlm.nih.gov/pubmed/28965763')),)),
           ('supplementary_links',
            ('https://www.ebi.ac.uk/gxa/sc/experiments/E-GEOD-81547/Results',))),)),
        ('sample_specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)),
        ('samples',
         ((('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('effective_organ', 'pancreas'),
           ('entity_type', 'specimens'),
           ('model_organ', '~null'),
           ('model_organ_part', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',))),)),
        ('sequencing_inputs',
         ((('biomaterial_id', 'GSM2172585 1'),
           ('document_id', '412898c5-5b9b-4907-b07c-e9b89666e204'),
           ('sequencing_input_type', 'cell_suspension')),)),
        ('sequencing_processes',
         ((('document_id', '771ddaf6-3a4f-4314-97fe-6294ff8e25a4'),),)),
        ('sequencing_protocols',
         ((('document_id', '61e629ed-0135-4492-ac8a-5c4ab3ccca8a'),
           ('instrument_manufacturer_model', 'Illumina NextSeq 500'),
           ('paired_end', 1)),)),
        ('specimens',
         ((('_source', 'specimen_from_organism'),
           ('_type', 'specimen'),
           ('biomaterial_id', 'DID_scRSq06_pancreas'),
           ('disease', ('normal',)),
           ('document_id', 'a21dc760-a500-4236-bcff-da34a0e873d2'),
           ('has_input_biomaterial', '~null'),
           ('organ', 'pancreas'),
           ('organ_part', ('islet of Langerhans',)),
           ('preservation_method', '~null'),
           ('storage_method', '~null')),)))),
      ('entity_id', 'e8642221-4c2c-4fd7-b926-a68bce363c88'),
      ('source',
-      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:'))))),
+      (('id', '4b737739-4dc9-5d4b-9989-a4942047c91c'), ('spec', 'test:42'))))),
?                                                                      ++

    ('_type', 'doc')))

======================================================================
hannes-ucsc commented 3 years ago

The drop commit on the PR branch makes demoing this unnecessary.

image