BioDataFuse / pyBiodatafuse

Python package for biodatafuse project.
MIT License
3 stars 7 forks source link

MolMeDB, output dic with repeated values #153

Closed tabbassidaloii closed 4 weeks ago

tabbassidaloii commented 1 month ago

The same compound is repeated twice using get_gene_compound_inhibitor, the source_pmid could be merged

 {'compound_name': 'Tetrodotoxin',
  'InChIKey': 'CFMYXEVWODSLAX-UHFFFAOYSA-N',
  'SMILES': 'NC1=NC(O)C2C3OC4(O)OC(C(O)C2(N1)C4O)C3(O)CO',
  'compound_cid': '4490623',
  'molmedb_id': 'MM17151',
  'source_pmid': '1325372',
  'chebi_id': nan,
  'drugbank_id': nan,
  'uniprot_trembl_id': 'P35499'},
 {'compound_name': 'Tetrodotoxin',
  'InChIKey': 'CFMYXEVWODSLAX-UHFFFAOYSA-N',
  'SMILES': 'NC1=NC(O)C2C3OC4(O)OC(C(O)C2(N1)C4O)C3(O)CO',
  'compound_cid': '4490623',
  'molmedb_id': 'MM17151',
  'source_pmid': '10533585',
  'chebi_id': nan,
  'drugbank_id': nan,
  'uniprot_trembl_id': 'P35499'}

The input was SCN4A

YojanaGadiya commented 1 month ago

Fixed in PR #157