russHyde / dupree

{dupree} helps identify code blocks that have a high level of similarity in a set of R files
https://russhyde.github.io/dupree/
Other
37 stars 0 forks source link

Set default 'min_block_size' to 40 (20 was too lenient) #68

Closed russHyde closed 4 years ago

russHyde commented 4 years ago

Updated min_block_size argument for all dupree functions and the preprocess_code_blocks function; initial default was 20 but this pulled out too many blocks and too many trivial overlaps between blocks.

The README was updated, since it was originally written to compare block-size=40 to (previous default) block size of 20. Now compares block-size=50 to (current default) 40.