Qiong-Yang / FastEI

Ultra-fast and Accurate Spectrum Matching
Apache License 2.0
24 stars 4 forks source link

example code logic / error #5

Closed thegodone closed 11 months ago

thegodone commented 11 months ago

I've testing the code and basically I have this output of the query molecules but the answer is quite strange :

here the smiles in the test folder image

here the answer based on the indexes found : image

and If I look at only the first molecule (ie I[0]):

image

I don't see any accurate matching at all, can you tell me what can be the issue ? here the matching index I have [2117858, 83971, 1358966, 1944009, 46292, 213015, 901693, 253884, 173348, 1360939]

matching_index=[I[0][0],I[1][0],I[2][0],I[3][0],I[4][0],I[5][1],I[6][8],I[7][14],I[8][17],I[9][99]]
print(matching_index)
Qiong-Yang commented 11 months ago

Because of some directory and file naming changes, I update example.py with the data from https://doi.org/10.5281/zenodo.7907140. You can firstly download the relevant data (Word2vec.model, HNSW_index.bin, The expanded in-silico library.db) from https://doi.org/10.5281/zenodo.7907140, and put them to the data directory (data/Word2vec.model, HNSW_index.bin, The expanded in-silico library.db), and then run the code (example.py).

Qiong-Yang commented 11 months ago

example_run

thegodone commented 11 months ago

I see the error now, it was this line model_file ="../model/references_word2vec.model" by replacing by data/... it works now.

Thank you