the-Fish2 / Optimizing-GloVe

0 stars 0 forks source link

only 300 vectors #1

Closed the-Fish2 closed 2 years ago

the-Fish2 commented 2 years ago

solution: make it an array initially? more compressed than list issue - pca transform + matplot require list, array respectively

the-Fish2 commented 2 years ago

Perhaps do a random sampling

the-Fish2 commented 2 years ago

also make like a universal variable for all the places where I write 300

the-Fish2 commented 2 years ago
the-Fish2 commented 2 years ago

*correction: 3.47 days

also A TON of repetition therefore dp store an array of arrays for connections between x and y and then add to matrix else yeah

the-Fish2 commented 2 years ago

maybe also fill in the matrix with diagonal of like 1s and then it can't be the cos similarity answer?? idk- but then won't need to iterate through cluster

the-Fish2 commented 2 years ago

try running birch on 2-D?

if birch is more efficient than my clustering it works out easier (waaaaaaay faster; 0.2 vs 30 secs)

the-Fish2 commented 2 years ago

MAYBE THIS MATRIX SHOULD BE MORE LIKE A GRAPH. Like, if distance(w1, w2) = 10, and distance(w2, w3) = 6, then distance(w1, w3) is NOT less than 4 at best maybe store 'min bound's ? liiiike (depending on cosine dist similarity)

the-Fish2 commented 2 years ago

or ignore points that are included in a cluster with other points

the-Fish2 commented 2 years ago

afkljawrjwek fine run birch gives 44 dim

the-Fish2 commented 2 years ago

current: 1000 vectors