ContinuumIO / topik

A Topic Modeling toolbox
BSD 3-Clause "New" or "Revised" License
93 stars 24 forks source link

Exclude empty documents and log their occurrence. #68

Open gpfreitas opened 8 years ago

gpfreitas commented 8 years ago

We should exclude empty documents because they generate useless output at best, and crashes at worst.

However, we must not silently drop the document, as it may be useful for the user to know that there is an empty document in the database.