Open danpovey opened 3 years ago
Cool! Is there any specific reasons to use this repo for the sampling? I find there is similar function in Pytorch: torch.distributions.categorical.Categorical
Oh, I didn't know about that. Then that should be fine.
OK, thanks! I’ll try to use this sampling idea in the unsupervised training.
@zhu-han this repo contains the sampling code I mentioned to you. The "iterative" aspect of it is not needed here, we just treat it as a simple way to sample from a distribution.
Below, is some icefall code called unsupervised.py, that I was going to use to sample CTC transcripts for use in unsupervised training. I believe sampling is more correct than taking the top-one, and will avoid it collapsing to blank.