rjust / defects4j

A Database of Real Faults and an Experimental Infrastructure to Enable Controlled Experiments in Software Engineering Research
MIT License
717 stars 299 forks source link

Prevent duplicating active_bugs.csv header (#474) #552

Closed agodio closed 4 months ago

agodio commented 7 months ago

If a project has already been promoted to the main Defects4J database, if (-e "$CORE_DIR/Project/$PID.pm") { evaluates to true at initialize-project-and-collect-issues.pl:180, active_bugs.csv ends up with a duplicated header: one from the just mined bugs ($COMMIT_DB_FILE) and the other from the complement computed by grep -vFf $COMMIT_DB_FILE.orig $COMMIT_DB_FILE, since $COMMIT_DB_FILE.orig lacks such header.

rjust commented 4 months ago

Thanks for fixing this!