blei-lab / hdp

Hierarchical Dirichlet processes. Topic models where the data determine the number of topics. This implements Gibbs sampling.
GNU General Public License v2.0
150 stars 47 forks source link

Training set #3

Open harshanshyam opened 7 years ago

harshanshyam commented 7 years ago

Please give a proper training set and a good readme. Thanks in advance

Abigale001 commented 6 years ago

You can download nytimes and pubmed dataset in the lightlda project. There are scripts to download them.