Dear
Hello, I want to use my own dataset to train a model, but I'm not sure how to preprocess the input data. The function I have is:
python
def surfdata_prepare(ply_file, frag_kept_sdf):
'''
use the sdf_file as the center
'''
protein_dict = read_ply(ply_file)
keep_frag_mol = read_sdf(frag_kept_sdf)[0]
ligand_dict = parse_rdmol(keep_frag_mol)
data = ProteinLigandData.from_protein_ligand_dicts(
protein_dict = torchify_dict(protein_dict),
ligand_dict = torchify_dict(ligand_dict)
)
return data
This is the function from Delete.py. Can this function be used to construct a training dataset, or is surfdata_prepare insufficient for preprocessing the data for training?
Dear Hello, I want to use my own dataset to train a model, but I'm not sure how to preprocess the input data. The function I have is: python def surfdata_prepare(ply_file, frag_kept_sdf): ''' use the sdf_file as the center ''' protein_dict = read_ply(ply_file) keep_frag_mol = read_sdf(frag_kept_sdf)[0] ligand_dict = parse_rdmol(keep_frag_mol) data = ProteinLigandData.from_protein_ligand_dicts( protein_dict = torchify_dict(protein_dict), ligand_dict = torchify_dict(ligand_dict) ) return data This is the function from Delete.py. Can this function be used to construct a training dataset, or is surfdata_prepare insufficient for preprocessing the data for training?