hoffmangroup / segway

Application for semi-automated genomic annotation.
http://segway.hoffmanlab.org/
GNU General Public License v2.0
13 stars 7 forks source link

Semisupervised Mean Generate from Track data in supervision region. #37

Open EricR86 opened 9 years ago

EricR86 commented 9 years ago

Original report (BitBucket issue) by Sakura Tamaki (Bitbucket: Tamaki_Sakura).


Currently in Semi-supervised mode the initial Gaussian mean is generated in the same way as unsupervised mode i.e. based on the mean of the whole track data plus a random noise.

It might be useful to change it, or, at least, add a parameter, such that when initialized the Gaussian Mean in the first step of Expectation Maximization, for each track and each supervision label, the mean will come from the mean from the track data inside all the supervision label regions with that supervision label, instead of the whole track mean.

EricR86 commented 9 years ago

Original comment by Sakura Tamaki (Bitbucket: Tamaki_Sakura).


EricR86 commented 9 years ago

Original comment by Eric Roberts (Bitbucket: ericr86, GitHub: ericr86).


Pull request #25 was made for this enhancement

EricR86 commented 9 years ago

Original comment by Sakura Tamaki (Bitbucket: Tamaki_Sakura).


We are unsure now if this issue is really useful since in our clustering it is still very likely to shift our mean to elsewhere.