Revisit scaling - Githubissues

senderle / talediff

A prototype word embedding model based on type-level differential operators.

MIT License

0 stars 0 forks source link

I can't remember where I left of on my experiments with Jacobian scaling but it would be interesting to do the same thing there, i.e. do both a left and a right mul. I don't have a totally clear idea what that would mean tbh but maybe it's worth trying? It would also be kind of hard to do correctly because it involves inserting the Jacobian multiplication between the vector and the projection, but the Jacobian is calculated at the same time as the hessian and isn't done till that's complete. Since this calculates the whole thing piecemeal, that will require two passes through the data.

It's also just now occurring to me -- how dumb am I? -- that the Jacobian is just a word count vector over the whole corpus!!! Seriously, dang. So uh... these two approaches are kind of equivalent...

senderle / talediff

Revisit scaling #2