RNAcentral / rnacentral-webcode

RNAcentral website source code
https://rnacentral.org
Apache License 2.0
32 stars 9 forks source link

Import more HGNC xrefs #106

Open AntonPetrov opened 7 years ago

AntonPetrov commented 7 years ago

Need to systematically look through all unmapped HGNC xrefs and try to add as many as possible.

Examples:

Getting started:

docker exec -it container_id bash
source rnacentral/local/virtualenvs/RNAcentral/bin/activate
cd rnacentral/rnacentral-webcode/rnacentral/
curl -OL ftp://ftp.ebi.ac.uk/pub/databases/genenames/new/json/locus_groups/non-coding_RNA.json
python manage.py map_hgnc -i non-coding_RNA.json -t
AntonPetrov commented 7 years ago

Updates from HGNC

FAM30B - no RefSeq NR but there is XR_001751734

The following all have Ensembl IDs associated with the HGNC entry: ADIRF-AS1 - ENSG00000272734 LINC01902 - ENSG00000283503 LINC01958 - ENSG00000283436 LINC02006 - ENSG00000238755 LINC02009 - ENSG00000283646 PSPC1-AS2 - ENSG00000226352 RN7SL3 - ENSG00000278771 SNHG14 - ENSG00000224078 SRP54-AS1 - ENSG00000258704 ZFHX2-AS1 - ENSG00000157306

The following list are lncRNA genes where there isn't sufficient good quality sequence for these to be added into RNAcentral: CDKN1A-AS1 LRRC3DN MT-LIPCAR PRINS PTCSC1 RNVU1-11 SIRT1-AS SMCR6 TP53COR1 TTTY13B YAM1 DACOR1 DALIR DLG2-AS1 DLX6-AS2 GCASPC LINC00268 LINC00328 LINC00527 LINC00537 LINC00914 LINC01157 LINC01617

The following three no longer exist as approved symbols: