greenelab / django-genes

A Django package to represent genes
BSD 3-Clause "New" or "Revised" License
2 stars 3 forks source link

"NEWENTRY" line in Home_sapiens.gene_info should be filtered out #3

Closed dongbohu closed 7 years ago

dongbohu commented 7 years ago

The genes table in the database is populated by data in this file: ftp://ftp.ncbi.nih.gov/gene/DATA/GENE_INFO/Mammalia/Homo_sapiens.gene_info.gz

As of 0817/2017, this file includes the following NEWENTRY line:

9606    192343  NEWENTRY    -   -   -   -   -   Record to support submission of GeneRIFs for a gene not in Gene (human; man).   other   -   -   -   -   20170812    -

This line should be filtered out in the following management command: genes/management/commands/genes_load_geneinfo.py

To see how this issue affects the web interface, please refer to the following issue in adage-server: https://github.com/greenelab/adage-server/issues/198