ajenhl / tacl

Tool for performing basic text analysis on the CBETA corpus
GNU General Public License v3.0
30 stars 9 forks source link

Incorrect results using --extend before --reduce #18

Closed ajenhl closed 10 years ago

ajenhl commented 10 years ago

Running tacl report --extend on unreduced results, and then reducing, generates incorrect results (some results are not removed when they ought to be). Running --reduce before --extend generates (apparently) correct results.

ajenhl commented 10 years ago

I am not convinced there is a bug here, being unable to create a test case where the results of extend then reduce are incorrect. Performing a reduce followed by an extend and a reduce is highly likely to generate incorrect results, due to the double reduce.