Questions about BIC distance calculation

Hi!

This particular derivation of the BIC is meant to detect a change in a particular point in time (it assumes only one change is occurring), so S, S1 and S2 are the sample covariance statistics of the total sample, and the before/after the hypothesized change. What those lines 95-96 are doing are calculating the maximum likelihood ratio statistic of a change occurring at that point, and lines 97-99 substract the proper penalty for BIC.

You can find the proper derivation in:

Chen, Scott, and Ponani Gopalakrishnan. "Speaker, environment and channel change detection and clustering via the bayesian information criterion." Proc. DARPA broadcast news transcription and understanding workshop. Vol. 8. 1998.

I'm not putting a link here because I don't know if that's allowed on the places hosting the article, but it's easy to find it on google by "speaker bayes information chen" or a similar query string :)

aalto-speech / speaker-diarization

Questions about BIC distance calculation #12