Open javfg opened 3 days ago
The homologues step writes files into two subfolders:
homologues
target-inputs/homologue/gene-dictionary and target-inputs/homologue/homologies
target-inputs/homologue/gene-dictionary
target-inputs/homologue/homologies
The files in the gene-dictionary folder are JSON files fetched from:
gene-dictionary
https://ftp.ensembl.org/pub/release-${ensembl_version}/json/${species}/${species}.json
with ${species} being: caenorhabditis_elegans, canis_lupus_familiaris, cavia_porcellus, danio_rerio, drosophila_melanogaster, macaca_mulatta, mus_musculus, oryctolagus_cuniculus, pan_troglodytes, rattus_norvegicus, sus_scrofa, xenopus_tropicalis, homo_sapiens.
${species}
caenorhabditis_elegans
canis_lupus_familiaris
cavia_porcellus
danio_rerio
drosophila_melanogaster
macaca_mulatta
mus_musculus
oryctolagus_cuniculus
pan_troglodytes
rattus_norvegicus
sus_scrofa
xenopus_tropicalis
homo_sapiens
PIS was running:
jq -r '.genes[] | [.id, .name] | @tsv'
on the files, extracting those two fields and building a TSV file with them.
This is the function to apply the transformation: https://github.com/opentargets/platform-etl-backend/blob/11a1f67ce194d079603543f5f96b76c9963e35e8/src/main/scala/io/opentargets/etl/backend/target/Target.scala#L312
Description
The
homologues
step writes files into two subfolders:target-inputs/homologue/gene-dictionary
andtarget-inputs/homologue/homologies
The files in the
gene-dictionary
folder are JSON files fetched from:with
${species}
being:caenorhabditis_elegans
,canis_lupus_familiaris
,cavia_porcellus
,danio_rerio
,drosophila_melanogaster
,macaca_mulatta
,mus_musculus
,oryctolagus_cuniculus
,pan_troglodytes
,rattus_norvegicus
,sus_scrofa
,xenopus_tropicalis
,homo_sapiens
.Transformations PIS was doing
PIS was running:
on the files, extracting those two fields and building a TSV file with them.
Tasks