microsoft / Graphormer

Graphormer is a general-purpose deep learning backbone for molecular modeling.
MIT License
2k stars 324 forks source link

Lack of diversity in the 1ake example prediction by DiG #183

Open PKUfjh opened 2 months ago

PKUfjh commented 2 months ago

I tried to use the newly open-sourced DiG code to make 1ake example prediction for 100 samples, but all conformations are mainly around 1ake experimental structure, deviate away from 4ake structure. I think the result is contradictory with the result in your paper. The result is shown in the figure. overlay_tmscore_1ake To be more specific, the command I used for prediction is

PDBID="1ake"
CKPT_PATH=./checkpoints/checkpoint-520k.pth
FEATURE_PATH=./dataset/${PDBID}.pkl
FASTA_PATH=./dataset/${PDBID}.fasta
OUTDIR=./output/${PDBID}/
mkdir -p ${OUTDIR}
python run_inference.py -c ${CKPT_PATH} -i ${FEATURE_PATH}\
 -s ${FASTA_PATH} -o ${PDBID} --output-prefix ${OUTDIR}\
 -n 100 --use-gpu --use-tqdm
zhengsx commented 2 months ago

the energy of 4ake is much higher than 1ake, thus much more samples are needed to observe structures around 4ake.

PKUfjh commented 2 months ago

No, since 1ake and 4ake has the same sequence, the input is the same with 1ake for 4ake.发自我的iPhone------------------ Original ------------------From: Advaith @.>Date: Tue,Apr 23,2024 6:59 PMTo: microsoft/Graphormer @.>Cc: PKUfjh @.>, Author @.>Subject: Re: [microsoft/Graphormer] Lack of diversity in the 1ake exampleprediction by DiG (Issue #183) Hi, How were you able to obtain the representations for 4ake? I did not find it in the dataset.

—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: @.> [ { @.": "http://schema.org", @.": "EmailMessage", "potentialAction": { @.": "ViewAction", "target": "https://github.com/microsoft/Graphormer/issues/183#issuecomment-2072005338", "url": "https://github.com/microsoft/Graphormer/issues/183#issuecomment-2072005338", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { @.***": "Organization", "name": "GitHub", "url": "https://github.com" } } ]

sai-advaith commented 2 months ago

Thanks for the response.

When you get the pdb output generated from the code, were you able to visualize the sidechains too? We get a pdb with backbones only.

PKUfjh commented 2 months ago

Thanks for the response.

When you get the pdb output generated from the code, were you able to visualize the sidechains too? We get a pdb with backbones only.

No, I think the output is only the protein backbones.

Paulie-ai commented 1 month ago

Hi, I used AF2 repr from evoformer directly for pdbid 1ake and not used dataset provided, i find the sample is almost same as ground truth 1ake structure, why? Is my pipeline is differ with paper? single and pair repe is l, 384 and l, l,128 as provided shape. Thanks