I implement the feature as stated in title. The implementation basically follows section 3.3 of [1] with similar gradient update rules for marginals (like those in 3.2). I check marginals using the log-partition/marginals identity and they are the same. Let me know if anything needs to be improved!
[1] Structured Prediction Models via the Matrix-Tree Theorem. Koo et al. 2007.
Hi,
I implement the feature as stated in title. The implementation basically follows section 3.3 of [1] with similar gradient update rules for marginals (like those in 3.2). I check marginals using the log-partition/marginals identity and they are the same. Let me know if anything needs to be improved!
[1] Structured Prediction Models via the Matrix-Tree Theorem. Koo et al. 2007.