Refactor filterEdges - Githubissues

after profile through visualvm, filterEdges on Graph contains unnecessary check which use many CPU cycle.

point to improve on Graph.filterEdges is following(develop branch).

from design of rowKey, qualifier, degree edge can only exist at the very beginning of cells in HBase. checking if edge is degree edge should be done on one edge, not all fetched edges. (https://github.com/kakao/s2graph/blob/develop/s2core/src/main/scala/com/kakao/s2graph/core/Graph.scala#L506)
duplicate policy check is unnecessary for label with strong consistencyLevel. (https://github.com/kakao/s2graph/blob/develop/s2core/src/main/scala/com/kakao/s2graph/core/Graph.scala#L524)
expensive hashCode for BigDecimal. since only vertexId is considered on this scope, only possible datatype is string or long so instead of using BigDecimal.hashCode, switch to BigDecimal.longValue.hashCode would increase performance. (https://github.com/kakao/s2graph/blob/develop/s2core/src/main/scala/com/kakao/s2graph/core/Graph.scala#L501)

Personally, I am not a fan of micro-optimization, but filterEdges goes through every edges that fetched so maybe little bit optimization on this method would be necessary.

Through benchmark, I see lots of CPU cycle is waisted on Graph.toHashKey and just checking for exclude/include.

kakao / s2graph

Refactor filterEdges #130