oxpig / ANARCI

Antibody Numbering and Antigen Receptor ClassIfication
BSD 3-Clause "New" or "Revised" License
161 stars 84 forks source link

can I use it to do "cdr annotation"? how to do it, tell me some suggestions #56

Closed qiuqingpo closed 12 months ago

qiuqingpo commented 1 year ago

can I use it to do "cdr annotation"? how to do it, tell me some suggestions, thank you!

awesome-crab commented 1 year ago

read the docs! Also, maybe read what numbering schemes are for antibodies.

ANARCI will produce a numbering of your provided Antibody sequence. depending on the numbering scheme you specified the numbering of the antibody will differ. Assuming you use IMGT: https://www.imgt.org/IMGTScientificChart/Numbering/IMGTnumberingCDR_VH.html

The cdrs are always on the same positions in the imgt numbering scheme. so in imgt cdr1 will be numbers 26 to 39.

qiuqingpo commented 1 year ago

I'm sorry, I'm not a biology major, and I don't understand imgt's instructions very well, but I'm looking forward to your's ANARCI adding the cdr annotation, thank you

awesome-crab commented 1 year ago

I am not a biology major either. I just read the stuff which I need to know. Some stuff you will learn on the way. Maybe just to point out. ANARCI will try to detect a domain (in simplest terms a variable region) inside the Antibody sequence with help of Hidden Markov Models. The domain will then be annotated (numbered) with the numbering scheme you specified when running ANARCI. Depending on the numbering scheme, the cdrs are at certain positions. Within IMGT the cdr1 is always from pos 27 to pos 38. Similar with the other cdrs. I just wanted to make this clear as I was not sure if my answer before was clear. Also before I stated that in IMGT the cdr1 is from 26 to 39 which is not correct. One could consider 26 and 39 to be the anchor points. Remember the 4th of june!! Its an important day for students....

qiuqingpo commented 1 year ago

I am not a biology major either. I just read the stuff which I need to know. Some stuff you will learn on the way. Maybe just to point out. ANARCI will try to detect a domain (in simplest terms a variable region) inside the Antibody sequence with help of Hidden Markov Models. The domain will then be annotated (numbered) with the numbering scheme you specified when running ANARCI. Depending on the numbering scheme, the cdrs are at certain positions. Within IMGT the cdr1 is always from pos 27 to pos 38. Similar with the other cdrs. I just wanted to make this clear as I was not sure if my answer before was clear. Also before I stated that in IMGT the cdr1 is from 26 to 39 which is not correct. One could consider 26 and 39 to be the anchor points. Remember the 4th of june!! Its an important day for students....

hello, i see the issues, and find have the get_imgt_cdr function, but have errors ,Will it affect? https://github.com/oxpig/ANARCI/issues/14

awesome-crab commented 1 year ago

i do not understand your question. try being more specific.

qiuqingpo commented 1 year ago

i do not understand your question. try being more specific.

Can ANARCI use to get cdrs by imgt? I found the function "get_imgt_cdr ".

awesome-crab commented 1 year ago

I already told you yes. I pointed out above how you can achieve this. I don't know where you found "get_imgt_cdr" but I dont think it is from the ANARCI repository. So maybe go to the repo from where you have your function?

qiuqingpo commented 1 year ago

Sorry,i just know it, if I want to use Chothia, Kabat, or Contact to cdr annotation, can you give me some documents, as like https://www.imgt.org/IMGTScientificChart/Numbering/IMGTnumberingCDR_VH.html

awesome-crab commented 1 year ago

You could actually google it you know? search for "compare imgt kabat cothia" and the 7th result is this: https://plueckthun.bioc.uzh.ch/antibody/Numbering/NumFrame.html

qiuqingpo commented 1 year ago

@awesome-crab hello, my dear, thank you for your heat reply, i check the code, find get database from the imgt website use RipIMGT.py, url is follow:

urls = { "HV": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.3+IGHV&species=%s", "HJ": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.6+IGHJ&species=%s", "KV": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.3+IGKV&species=%s", "KJ": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.6+IGKJ&species=%s", "LV": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.3+IGLV&species=%s", "LJ": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.6+IGLJ&species=%s", "AV": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.3+TRAV&species=%s", "AJ": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.6+TRAJ&species=%s", "BV": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.3+TRBV&species=%s", "BJ": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.6+TRBJ&species=%s", "GV": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.3+TRGV&species=%s", "GJ": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.6+TRGJ&species=%s", "DV": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.3+TRDV&species=%s", "DJ": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.6+TRDJ&species=%s"

     #"HC": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.3+IGHC&species=%s",
     #"KC": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.3+IGKC&species=%s",
     #"LC": "http://www.imgt.org/IMGT_GENE-DB/GENElect?query=7.3+IGLC&species=%s",
   }

but the imgt website update very often, need i reinstall the ANARCI again? Can ANARCI get the imgt data automatic? thank you!

awesome-crab commented 1 year ago

Yeah, sorry I am not going to answer you anymore. It seems you do not understand english well enough to understand the answers I gave. Good luck.

qiuqingpo commented 1 year ago

@awesome-crab sorry, my English is poor, I am a Chinese, your answer is very helpful for me, can you read the source code "RipIMGT.py"? need I reinstall ANARCI again when the website update

awesome-crab commented 1 year ago

It seems you do not understand what you are doing also I have no idea what you do with RipIMGT.py. Why do you ask a question about code you found on IMGT in the github repo for ANARCI? Ask your question on where you got your code?!? This is ridicoulus.

qiuqingpo commented 1 year ago

@awesome-crab please check the file of ANACRI's setup.py, it need run "RUN_pipeline.sh", in the behind, there is a shell script " python3 $DIR/RipIMGT.py", it will download data from imgt website.

awesome-crab commented 1 year ago

tian men square massacre1984

qiuqingpo commented 12 months ago

@awesome-crab I ask a new quession in https://github.com/oxpig/ANARCI/issues/63, welcome you to answer, thank you very much again

broncio123 commented 12 months ago

read the docs! Also, maybe read what numbering schemes are for antibodies.

ANARCI will produce a numbering of your provided Antibody sequence. depending on the numbering scheme you specified the numbering of the antibody will differ. Assuming you use IMGT: https://www.imgt.org/IMGTScientificChart/Numbering/IMGTnumberingCDR_VH.html

The cdrs are always on the same positions in the imgt numbering scheme. so in imgt cdr1 will be numbers 26 to 39.

Sorry guys, I will close this issue, as the answer provided by @awesome-crab is actually good enough, plus, this question is not software-related. Also, already addressed issue #63 . Please, see answer there.