VerisimilitudeX / DNAnalyzer

Revolutionizing DNA analysis and making it accessible to all through innovative ML-powered analysis and interpretive tools.
Other
132 stars 56 forks source link

Protein finding #156

Closed Nv7-GitHub closed 1 year ago

Nv7-GitHub commented 2 years ago

Being able to find a protein/test if it exists in the dna sequence is useful in DNA analyzing. I will implement it as an optional parameter to the CLI.

VerisimilitudeX commented 2 years ago

I got the whole human body's DNA from The Human Genome Project. I'll send you the file via Discord later today. The thing is, we need to implement an efficient algorithm for string searching - we can potentially look into the Needleman-Wunsch algorithm. This feature would pave the way for ML analysis in the future.

Nv7-GitHub commented 2 years ago

Actually it seems like the Needleman-Wunsch isn't for exact string matching, instead it can rate segments on DNA based on how close of a match they are (perhaps? the wikipedia page was kind of confusing)

VerisimilitudeX commented 2 years ago

Huh, weird. It'll still be somewhat useful later on actually.

Nv7-GitHub commented 2 years ago

Yeah, although its pretty complicated and would require some more research and testing probably

VerisimilitudeX commented 2 years ago

@Nv7-GitHub you already implemented this feature right?

Nv7-GitHub commented 2 years ago

Yeah, I implemented a simple search, although not the Needleman-Wusch algorithm

VerisimilitudeX commented 2 years ago

Okay cool, thanks!

Get Outlook for iOShttps://aka.ms/o0ukef


From: Nv7 @.> Sent: Thursday, October 20, 2022 9:08:25 AM To: Verisimilitude11/DNAnalyzer @.> Cc: Piyush Acharya @.>; Comment @.> Subject: Re: [Verisimilitude11/DNAnalyzer] Protein finding (Issue #156)

Yeah, I implemented a simple search, although not the Needleman-Wusch algorithm

— Reply to this email directly, view it on GitHubhttps://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FVerisimilitude11%2FDNAnalyzer%2Fissues%2F156%23issuecomment-1285813791&data=05%7C01%7C%7Ca701f8af3d544dc57fbe08dab2b55224%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C638018789079976082%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=hY8MMuCBKpygonpRvVhWfyM9q7hI3HvFoxT8Db767B8%3D&reserved=0, or unsubscribehttps://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAW6R7ERR3BARQAH53ZE4HALWEFU7TANCNFSM6AAAAAAQ5YS7NY&data=05%7C01%7C%7Ca701f8af3d544dc57fbe08dab2b55224%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C638018789079976082%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=p9xRhQ1cyQnS6Lx5s7Sh8Gm3SOMHXWspxJk7sLty91s%3D&reserved=0. You are receiving this because you commented.Message ID: @.***>