ajenhl / tacl

Tool for performing basic text analysis on the CBETA corpus
GNU General Public License v3.0
30 stars 9 forks source link

Add 'zero fill' option to report #38

Closed ajenhl closed 9 years ago

ajenhl commented 9 years ago

Add an option to tacl report that, when given a catalogue file, adds rows to the supplied results giving a count of 0 for all text-siglum-n-gram combinations where that n-gram has a non-zero count in at least one witness with the same label.

ajenhl commented 9 years ago

This option must provide both a catalogue file and the corpus, as the latter is required for the list of witnesses to the texts.

ajenhl commented 9 years ago

Note that this requires changes to, at least, the min-texts and max-texts handling, which must not count witnesses that have a count of 0.