Open GoogleCodeExporter opened 8 years ago
The same confusion as in function CreateBinaryTree() in word2vec.c, where the
array representation of tree uses vocab_size*2+1 elements, which I understand
is essentially (len(vocab)-1)*2+1? That makes sense as only n-1 nodes are
needed for a full binary tree with n leaf nodes?
Many thanks if someone can clarify this a little bit.
Original comment by ma.li...@gmail.com
on 25 Aug 2013 at 3:24
Sorry the expected output of saved vocab.txt should be
===============
</s> 0
and 12
the 11
four 10
in 8
used 5
war 5
one 5
nine 5
===============
It is a typo in the last line
Original comment by ma.li...@gmail.com
on 26 Aug 2013 at 5:12
also looking for the reason. there are words that are missing from the trained
model, which are expected to be in the vocabulary since i put the min_count =
1.
(am working on CentOS)
Original comment by zeng...@gmail.com
on 27 Jan 2014 at 3:05
Original issue reported on code.google.com by
ma.li...@gmail.com
on 25 Aug 2013 at 2:38Attachments: