Direct optimization of Gravity Law

murrayds commented 3 years ago

YY wants to see what the embedding would be like if we learned an embedding that directly optimized the gravity law. That is, something like CMDS, except we try to learn an embedding that preserves gravity-like distances between locations.

Is this something that could be easily implemented and tested?

yy commented 3 years ago

I think @jisungyoon tried it? Just wanted to know where it ended up.

skojaku commented 3 years ago

I believe we already have tested two versions of direct optimization of the gravity model, namely Levy's matrix factorization and word2vec with the negative sampling exponent = 1. Both methods optimize the gravity model with the distance and mass being the dot similarity and stationary distribution, respectively. The difference is that one does so using the stochastic gradient while the other using SVD.

@jisungyoon showed that both approaches perform comparably or less than the word2vec with the original parameter setting. These points are not clearly written in the manuscript so would worthwhile to write a paragraph on this in the math (or discussion?). In addition to this, should we show something in addition to performance like the same visualization of the embedding? I feel that repeating the whole analysis is optional.

murrayds commented 3 years ago

@skojaku I think what YY means is that, at present, we are learning gravity law relationships using word2vec, which is true because word2vec is a gravity model. Similarly, we argue that Levy's factorization is equivalent to a gravity law, because Levy's factorization is equivalent to word2vec, which in turn is equivalent to a gravity model.

So currently, we have two embeddings:

Data -> Word2vec -> Gravity Law, and
Data -> Levy's factorization -> Word2vec -> Gravity Law

I think what YY is asking is if we get rid of word2vec and Levy's factorization entirely. That is, why should we approximate something that is equivalent to a gravity law, when we can just optimize for the gravity law directly using something like CMDS? I however don't know how feasible this is.

@yy is this closer to what you were thinking?

yy commented 3 years ago

Yeah. essentailly like force-directed layout where the length of springs are determined by the flow according to the gravity law. Does it make sense?

skojaku commented 3 years ago

Let me check my understanding to the question.

The gravity model is a quite general model and we showed the equivalent between the word2vec and a specific gravity model with distance being dot similarity and mass being a stationary distribution. A question we want to answer is if we fit another gravity model that has a different definition of distance and mass, does it explain the trajectory better than the gravity model equivalent to the word2vec?

yy commented 3 years ago

No it's a simpler question. Assuming a gravity equation, given a measured flux between two places and their mass, we can calculate the expected distance. We assign this distance to every pair of location, obtaining the distance matrix. Then we can simply try to optimize their embedding (MDS). What would this produce?

murrayds commented 3 years ago

we can calculate the expected distance

If we calculate an expected distance, why would we do the embedding? We'd be using an expected distance to learn an embedding that approximates the original expected distance?

yy commented 3 years ago

yeah that's what I'm trying to get at here. If we think naïvely, why not fix the gravity law first? We can measure the flux and masses, then we have the expected pairwise distance. Why not try to embed everything based on this distance? Why should we do the word2vec or any other approaches? Can we easily answer this question?