Closed tvelden closed 12 years ago
The revised code has been added. The current statistics:
For Field1: Initially Total Number of Papers was: 14599 Number of Deleted papers: 1701 Remaining Number of Papers: 12898
For Field2: Initially Total Number of Papers was: 65003 Number of Deleted papers: 6864 Remaining Number of Papers: 58139
Need to revise logic in reduce.py such that papers are deleted only if no or one single authors is left after removing all 1-paper authors from data set.