Open tanglizhe1105 opened 8 years ago
It seems the psCount should not more than excutor number, but if not set psCount, may can not processing large scala corpus, such as doc more than million, and word more than million!
Thx
I have decided the reason of this bug, that is the model size(such as array size of IntArrayWithIntKey, or row of IntMatrixWithIntKey) could not less than --pscount. detail please see here
Vocabulary: 25970 Docs: 1000 Tokens: 106776 Topics: 1000
cluster has 20 servers, each server has 8 core cpub, 48GB mem. when set --psCount 20, lda work well set --psCount 40 , lda also work well but try to set --psCount 60, some tasks of Parameter server jop will failure.
log as following: