Open francesco-mollica opened 2 years ago
Why add (t/f) in this formula for discards:
t = 0.0001 f = np.array(list(self.word_frequency.values())) / self.token_count self.discards = np.sqrt(t / f) + (t / f)
Why add (t/f) in this formula for discards: