Open mengmeng34 opened 2 years ago
One possible way is to convert SDF format to SMILES then using RDKit to generate molecular graph which is favored by Graphormer.
Thank you for your reply.
This is indeed a feasible method, but the process of converting SDF format to SMILES will inevitably lose some edge information. In order to achieve better results, I am considering whether I can extract the atom and edge information in SDF format by myself, and then write my own functions to calculate the data in batched_data, such as in-degree, out-degree, etc. I don't know which method is more feasible.
Looking forward to discussing with you.
What atom/edge information do you need? RDKit could provide basic features when processing the SMILES.
Sorry I don't know much about RDKit, I need to find more information to investigate the feasibility of this approach.
Hello, the graph representation of the drug molecules I downloaded is in SDF format or JSON format. What kind of preprocessing do I need to do with them in order to use them as input to the graphormer? Looking forward to your reply.