adding random seed functionality

inpho / vsm

Vector Space Model Framework developed for InPhO

Other

35 stars 14 forks source link

Okay, I made very similar changes before I read my email. You can now pass a seed to random_corpus and LdaCgsSeq.train. Note that there's also a demostration function in ldacgsseq named demo_LdaCgsSeq which builds a random corpus for you and trains on it; this now takes additional parameters corpus_seed and model_seed.

The changes have not been made to LdaCgsMulti, as the business about RNG over multiple threads is a bit funny. The default behavior is to pickle the random state and so pass identical copies of the random state to each of the threads. This is unacceptable. The current workaround is to insist that each thread reseed. Ideally there should be a global RNG for all threads which does not thwart the performance gains of the parallelism.

Thank you very much for the changes.

inpho / vsm

adding random seed functionality #84