During the ingest pipeline, annotate the list of variants for rsid or top loci.
Nearest gene: This would be especially useful for the "top loci" page. Since these are a tiny fraction of the overall file, performance and memory usage are not critical. The PheWeb lookup code would be a good candidate for inclusion.
https://github.com/statgen/pheweb/blob/master/pheweb/load/add_genes.py
RSID: Ideally, users would want us to add this information for every variant possible. The Locuszoom-db code has high-performance loaders using LMDB or Rocksdb. The size of the data (and server memory limits) may create additional deployment considerations for these features.
https://github.com/statgen/locuszoom-db/blob/master/locuszoom/db/snp_lmdb.py
During the ingest pipeline, annotate the list of variants for rsid or top loci.
Nearest gene: This would be especially useful for the "top loci" page. Since these are a tiny fraction of the overall file, performance and memory usage are not critical. The PheWeb lookup code would be a good candidate for inclusion. https://github.com/statgen/pheweb/blob/master/pheweb/load/add_genes.py
RSID: Ideally, users would want us to add this information for every variant possible. The Locuszoom-db code has high-performance loaders using LMDB or Rocksdb. The size of the data (and server memory limits) may create additional deployment considerations for these features. https://github.com/statgen/locuszoom-db/blob/master/locuszoom/db/snp_lmdb.py