To keep track:
Avec le changement d'URL de neXtProt (ressource qui n'est plus maintenue) et du mode de requêtage de Uniprot, il faut entièrement revoir la partie ID_Mapping du code source: filehttps://github.com/vloux/ProteoRE/tree/master/tools/proteore_data_manager/resource_building.py
line 133 # 3. ID mapping
Rem: la construction du dictionnaire d'ID Human est un merge qui construit un nouveau "header" avec des 2 colonnes additionnelles à créer => "UniProt-AC_reviewed" et "neXtProt"
cf Line 148 #header
if human : tab = [["UniProt-AC","UniProt-AC_reviewed","UniProt-ID","GeneID","RefSeq","GI","PDB","GO","PIR","MIM","UniGene","Ensembl_Gene","Ensembl_Transcript","Ensembl_Protein","neXtProt","BioGrid","STRING","KEGG",'Gene_Name']]
else : tab = [["UniProt-AC","UniProt-AC_reviewed","UniProt-ID","GeneID","RefSeq","GI","PDB","GO","PIR","MIM","UniGene","Ensembl_Gene","Ensembl_Transcript","Ensembl_Protein","BioGrid","STRING","KEGG",'Gene_Name']]
To keep track: Avec le changement d'URL de neXtProt (ressource qui n'est plus maintenue) et du mode de requêtage de Uniprot, il faut entièrement revoir la partie ID_Mapping du code source: filehttps://github.com/vloux/ProteoRE/tree/master/tools/proteore_data_manager/resource_building.py
line 133 # 3. ID mapping Rem: la construction du dictionnaire d'ID Human est un merge qui construit un nouveau "header" avec des 2 colonnes additionnelles à créer => "UniProt-AC_reviewed" et "neXtProt" cf Line 148 #header if human : tab = [["UniProt-AC","UniProt-AC_reviewed","UniProt-ID","GeneID","RefSeq","GI","PDB","GO","PIR","MIM","UniGene","Ensembl_Gene","Ensembl_Transcript","Ensembl_Protein","neXtProt","BioGrid","STRING","KEGG",'Gene_Name']] else : tab = [["UniProt-AC","UniProt-AC_reviewed","UniProt-ID","GeneID","RefSeq","GI","PDB","GO","PIR","MIM","UniGene","Ensembl_Gene","Ensembl_Transcript","Ensembl_Protein","BioGrid","STRING","KEGG",'Gene_Name']]
La construction de l'index requiert :
1.2.1 https://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/idmapping/by_organism/[HUMAN_9606_idmapping.dat.gz (à voir à quoi il sert ?? lire https://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/idmapping/README) 1.2.2 https://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/idmapping/by_organism/HUMAN_9606_idmapping_selected.tab.gz (utilisé pour construire le dictionnaire des ID Uniprot)
Enfin, qques infos sur l'issue échanges avec David qui a construit le programme "resource_building.py": https://github.com/vloux/ProteoRE/issues/236