ebi-gene-expression-group / ingest-tracker

Scripts to gather and prepare datasets and metadata for analysis.
0 stars 0 forks source link

Add file provenance field and track down erroneous IDs #5

Closed hewgreen closed 5 years ago

hewgreen commented 5 years ago

Report fron Anja

There are some weird accessions like E-GEOD-00760, E-GEOD-9999992, E-MTAB-94848373 where are those coming from?


Column C and D should show you the location of the files if they exist. e.g. /nfs/production3/ma/home/arrayexpress/ae2_production/data/EXPERIMENT/GEOD/E-GEOD-9999992/E-GEOD-9999992.idf.txt

However, GEOD-00760 and MTAB-94848373 are missing a loc (so I’ll track them down). Also need to ensure this fields is filled even if an idf/sdrf are absent and add a column to show the loc where the directory was discovered.

hewgreen commented 5 years ago

Added 'discover loc' column