Closed alexloginov closed 9 years ago
I wrote the implementation for pagerank. @pankajgupta I will take a look at this.
Note: I've tried to override GraphReader and put StoredGraphDir.BothInOut (assumed, that problem could be in missing InboundNodes references), but it didn't help.
A quick note: release 5.1.0 has a new implementation of all link analysis algorithms including PageRank and Hits. However, when running the above graph on the new version, there is a divide by zero error occurring yielding infinity for nodes with no outbound neighbors. Good catch @alexloginov. I will fix the implementation and submit a patch.
@bmckown Thank you for a very quick response. I will be glad to get this patch!!!
Looking forward to the PR, @bmckown :)
On Wed, Jun 17, 2015 at 7:53 AM, alexloginov notifications@github.com wrote:
@bmckown https://github.com/bmckown Thank you for a very quick response. I will be glad to get this patch!!!
— Reply to this email directly or view it on GitHub https://github.com/twitter/cassovary/issues/186#issuecomment-112830415.
Can you verify post merge, it works for you @alexloginov ?
@bmckown, @pankajgupta Verified on 5 different small graph. Everything works correctly. Today i'm going to test it against real-world data (52M edges, 5M nodes) and will write results.
@bmckown, @pankajgupta Issue fixed. Checked on pretty big graph (52M edges, 5M nodes) - everything works like a charm!
Very good work guys, i'm very appreciated!
@pankajgupta when fixed version will be available in TWTR maven or maven-central? Right now i'm using self-packaged version, but hold 23M jar file as unmanaged dependency is a bit messy way.
Thank you, again! Alexander
@alexloginov I have released 5.1.3 to Maven -- might take a day to appear in maven central.
It is now in maven central. Closing this bug.
@pankajgupta Awesome! Thanks you very much!
Hi Everybody!
I'm trying to calculate PageRank for a graph toy_7nodes_adj_StringIds.txt using cassovary-core_2.11:5.0.0
My code is pretty simple:
But I was very surprised with results:
So it is not 1 in total. After this i decided to print out ranks for each node separately:
Than, i've tried next graph:
but result is still about 0.2.
Graph like:
Work perfectly and give a sum ~1 Also, I tried to generate RandomGraph and run PageRank, total sum of ranks about 1 as well. But with real graph it just not working (or may be i'm doing something wrong).
Could you please assist me?