tarahmarie / dh-trace

MIT License
0 stars 0 forks source link

Ensure directories for svm, sm training set, tweak to explore-svm #37

Closed jdmartin closed 4 months ago

jdmartin commented 4 months ago

Ok, so, some of this was covered in #4, which is likely worth revisiting, and these changes are in response to #11.

Here is the current state of things (with a fewer minor, functional tweaks):

From here, the best thing to do is to cd explore; python3 explore-svm..py

Screenshot 2024-05-12 at 5 07 11 PM

As for what's next, I honestly don't know. I get that the desire is to integrate these scores into the broader application, but I'm not sure there's room for an apples-to-apples comparison. (One model is guessing based on a set of features whether or not two texts were made by the same author, and the other is comparing an unknown text against a known author). But if you had to integrate them, then I guess something like this:

From there, I guess you could pretend that this fits in with the jacquard calculations done elsewhere... but I'm still not convinced they do.

tarahmarie commented 4 months ago

I now know how to do the apples to apples comparison, and it has to do with fitting the regression curve against known authors to find the correct weight. Starting it once I add this to main.