yanboliang / spark-vlbfgs

Vector-free L-BFGS implementation for Spark MLlib
Apache License 2.0
46 stars 17 forks source link

Fix rdd parallelize problem in sc.parallelize method #6

Closed WeichenXu123 closed 7 years ago

WeichenXu123 commented 7 years ago

What changes were proposed in this pull request?

When initialize DistributedVector zero-vector, at sc.parallelize, add sleep(1000) into each map task so that it will help spread task into the whole cluster, or the tasks may be aggregates in a few cluster nodes, in my test.

This is a walk-round way for now, in the future if there is better way I will update it.

How was this patch tested?

Manual.