Open ankostis opened 1 year ago
I agree that similarity to the test set as a whole would be useful to display. The improved bookkeeping I have planned for #34 will make this much easier to implement -- I'll address that first then see if I can get something working for aggregate similarity.
Currently (v0.4.5) the tool reports match-ratios between all pairs of test <--> ref files - i will focus here on match-ratios for test files but the same applies for ref files, reversed.
Let's assume these are the reported match-ratios for test files:
What i'm missing is a new summary section with all the grand total matchings for each test file vs the whole ref codebase, ie. how many lines are copies, regardless of which specific ref file matched it, something like this:
When expanding the ratios in these sections I would expect to see only the "left" diff pane with the copied test-code, like a code-coverage report, reporting the number of matches for each LoC, like this:
Does that make sense?
Workaround
Currently i have to concatenate all ref-files into a single one with a command like:
... and then run against the new ref-folder: