jplag / JPlag

State-of-the-Art Source Code Plagiarism & Collusion Detection
https://jplag.github.io/JPlag/
GNU General Public License v3.0
1.42k stars 312 forks source link

Feature requeset: same name submissions #1848

Open euberdeveloper opened 3 months ago

euberdeveloper commented 3 months ago

I have a case in which I have submissions (or more precisely old submissions) with the same name.

This piece of code blocks them, but this is problematic, I have to do a directory refactoring just to be able to run JPlag.

My directory structure looks like that:

2023
    ms1
        gr1
        gr2 
        gr3
    ms2
        gr1 
        gr2
        gr3
    ms3
        gr1
        gr2
        gr3
    ms4
        gr1
        gr2
        gr3

2022
    ms1
        gr1
        gr2 
        gr3
    ms2
        gr1
        gr2
        gr3
    ms3
        gr1
        gr2
        gr3
    ms4
        gr1
        gr2
        gr3

2021
    ms1
        gr1 
        gr2 
        gr3
    ms2
        gr1
        gr2
        gr3
    ms3
        gr1
        gr2
        gr3
    ms4
        gr1
        gr2
        gr3

If I am checking in the current submissions "2023/ms1" (milestone1 of all groups in 2023), I would put as old submissions ["2022/ms1", "2021/ms1", "2020/ms1"] (milestone1 of all groups of the previous years 2022,2021,2020).

Hence, ms1 would be "duplicate" for all the previous years. A solution would be, in case the path is different but the name is the same, to take the shorter parent path to make them different, in this case taking as namess ["2022/ms1", "2021/ms1] instead of ["ms1","ms1"]

Kr0nox commented 3 months ago

This sounds like it is related to #1445