medialab / bibliotools3.0

modification of bibliotools 2.2 from Sébastian Grauwin
Apache License 2.0
12 stars 4 forks source link

Threshold by average and quartile #10

Open tommv opened 9 years ago

tommv commented 9 years ago

Extracting comparable networks (in the sense of having roughly the same number of nodes) from time-spans containing a highly diverse number of bibliographical notices demands to set different filtering thresholds. Lower for time-spans containing fewer nodes; higher for time-spans containing more nodes. A way of doing this in a more systematic way may be to use average and quartiles. Instead of filtering all the nodes with an occurence count ("occ") lower than N or the edges with a weight lower than N ("weight"), we could filter all the nodes and edges with an occurence or weight lower than the average (or the 1st quartile or the 3rd quartile).