linkedin / photon-ml

A scalable machine learning library on Apache Spark
Other
792 stars 185 forks source link

Optimize per-group evaluation I/O #475

Closed cmjiang closed 4 years ago

cmjiang commented 4 years ago

Change per-group evaluation output data frame coalesce to repartition to speed up the I/O.