precisely / bioinformatics

0 stars 0 forks source link

Need to create extended SVN pattern matcher #12

Closed taltman closed 6 years ago

taltman commented 6 years ago

Given a pattern encoded by a content author, a pattern matching engine needs to figure out whether the genotype of a patient's sample matches or not.

taltman commented 6 years ago

@aneilbaboo Will need to be "compiled" to an internal representation, as more than one pattern serialization can be describing the same thing. So first the pattern string is validated and compiled, and then the pattern search engine can take it and assess a given user's genotype sample to see whether a valid match is found.

aneilbaboo commented 6 years ago

@taltman Just for the record, the parser I'm developing doesn't produce a canonical form. It produces an AST that faithfully represents the genotype (or query pattern) represented by the SVN. For example, I don't order things alphabetically. As it turns out, this isn't necessary.

The query algorithm searches for a pattern in a genotype, so different SVN patterns meaning the same thing or behave identically, as different representations of the same genotype.