Closed Nv7-GitHub closed 1 year ago
I got the whole human body's DNA from The Human Genome Project. I'll send you the file via Discord later today. The thing is, we need to implement an efficient algorithm for string searching - we can potentially look into the Needleman-Wunsch algorithm. This feature would pave the way for ML analysis in the future.
Actually it seems like the Needleman-Wunsch isn't for exact string matching, instead it can rate segments on DNA based on how close of a match they are (perhaps? the wikipedia page was kind of confusing)
Huh, weird. It'll still be somewhat useful later on actually.
Yeah, although its pretty complicated and would require some more research and testing probably
@Nv7-GitHub you already implemented this feature right?
Yeah, I implemented a simple search, although not the Needleman-Wusch algorithm
Okay cool, thanks!
Get Outlook for iOShttps://aka.ms/o0ukef
From: Nv7 @.> Sent: Thursday, October 20, 2022 9:08:25 AM To: Verisimilitude11/DNAnalyzer @.> Cc: Piyush Acharya @.>; Comment @.> Subject: Re: [Verisimilitude11/DNAnalyzer] Protein finding (Issue #156)
Yeah, I implemented a simple search, although not the Needleman-Wusch algorithm
— Reply to this email directly, view it on GitHubhttps://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FVerisimilitude11%2FDNAnalyzer%2Fissues%2F156%23issuecomment-1285813791&data=05%7C01%7C%7Ca701f8af3d544dc57fbe08dab2b55224%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C638018789079976082%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=hY8MMuCBKpygonpRvVhWfyM9q7hI3HvFoxT8Db767B8%3D&reserved=0, or unsubscribehttps://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAW6R7ERR3BARQAH53ZE4HALWEFU7TANCNFSM6AAAAAAQ5YS7NY&data=05%7C01%7C%7Ca701f8af3d544dc57fbe08dab2b55224%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C638018789079976082%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=p9xRhQ1cyQnS6Lx5s7Sh8Gm3SOMHXWspxJk7sLty91s%3D&reserved=0. You are receiving this because you commented.Message ID: @.***>
Being able to find a protein/test if it exists in the dna sequence is useful in DNA analyzing. I will implement it as an optional parameter to the CLI.