tvelden / communities

Network analysis of scientific community structures
3 stars 3 forks source link

initial data reduction step #6

Closed tvelden closed 12 years ago

tvelden commented 12 years ago

Need to revise logic in reduce.py such that papers are deleted only if no or one single authors is left after removing all 1-paper authors from data set.

sa738 commented 12 years ago

The revised code has been added. The current statistics:

For Field1: Initially Total Number of Papers was: 14599 Number of Deleted papers: 1701 Remaining Number of Papers: 12898

For Field2: Initially Total Number of Papers was: 65003 Number of Deleted papers: 6864 Remaining Number of Papers: 58139