A simple, user-friendly website where users can input molecules (a string) and use RDkit, an open source Python package, to calculate various descriptors.
Include a function to the website which compares a set of molecules with another set of molecules, giving pairwise scores, aka. Tanimoto similarity scores (0.0: 0% not similar, 1.0: 100% identical).
Two textboxes; one for "template" SMILES-structures, and another for the set which will be compared to the the template structures.
Output should be a .csv-file for each set which is compared.
e.g. template is compromosed of molecule A, B,. Comparison-set is molecule x, y, z.
output is two csv-files:
csv-file 1: A-x, A-y, A-z
csv file 2: B-x, B-y, B-z
each row in a .csv-file should be compromised of two columns:
If molecule A (CCN) is compared with molecule X (CCC), Y (NCN), then the corresponding CSV-file will look like:
What is this function?
Include a function to the website which compares a set of molecules with another set of molecules, giving pairwise scores, aka. Tanimoto similarity scores (0.0: 0% not similar, 1.0: 100% identical).
Function is: rdkit.DataStructs.cDataStructs.TanimotoSimilarity(). (https://www.rdkit.org/docs/source/rdkit.DataStructs.cDataStructs.html) Useful tutorial: https://www.youtube.com/watch?v=3qzZbaUzo9M
How should it look like?
Two textboxes; one for "template" SMILES-structures, and another for the set which will be compared to the the template structures. Output should be a .csv-file for each set which is compared.
e.g. template is compromosed of molecule A, B,. Comparison-set is molecule x, y, z.
output is two csv-files: