PugetSoundClinic-PIT / ProjectTracking

Apache License 2.0
0 stars 0 forks source link

Debug Author - User EM Model #129

Closed evamaxfield closed 9 months ago

evamaxfield commented 9 months ago

I believe I got it working. I think there was something wrong in the way parameters were being ordered in train time vs apply time. We now have a simpler model (only one embedding for the whole chunk of details) vs comparisons for each part vs each other part. I think the field that is messing up the classification the most is the co_authors / co_contributors.

Testing that now but additionally with this new model we should go through and get log proba for each and make sure our annotation is correct. We may need more positive examples too.

evamaxfield commented 9 months ago

Last thing to debug is incorrect annotations with predict_proba help. :)