Feature request: Additional calculations regarding network analysis

Golden-Retriever-1 commented 4 years ago

It would be nice to add some measures that are often used in (social) network analysis and worthy to have a look at during data analysis. **Is your feature request related to a problem? Please describe.** I request following feature because many applications in (social) network analysis differentiate between those two ways of interpreting weights. Depending on the interpretation of edge weights the analyst either refers to weights that denote: * Costs or real distances (i.e. the distances in miles between two cities) and calculate the network's statistics with the weights as they are since the distance between two nodes should be the cheapest or quickest one, or * Value or strength (i.e.votes, interactions): Here the analyst would decide to inverse the weights since the distance between two nodes should be the most valuable one. Is it possible to add a feature for which type of weights someone would likes to calculate the network's centrality (Wasserman & Faust, 1994)? In addition, it would be nice to have information centrality and eccentricity centrality as well as degree prestige, PageRank prestige, and proximity prestige (Wasserman & Faust, 1994)? **Describe the solution you'd like** The option to choose if the user likes to calculate network statistics by using weights or inverse weights could be added. Since it is your software I do not now the architecture it would be easier for you to decide how to implement that option. The centrality measures could be added to the centrality output table. The measures for prestige could be added in the same way as centrality can be selected by the user and the output of prestige measures could be provided in a table as well. **Describe alternatives you've considered**

The alternative solution would be using another software for calculating prestige and missing centrality measures. Hence, the feature request relates to improving calculating network analysis.

FransMeerhoff commented 4 years ago

@vandenman

As a reminder, can you please look at this issue and please respond?

Cheers Frans

vandenman commented 4 years ago

A key difference between Wasserman & Faust (1994) and the networks currently implemented in the network analysis module is whether the edges are stochastic. In Wasserman & Faust (1994), edges are observed. In the network analysis module, edges are estimated from the data. The question is whether the statistics for observed networks also make sense for stochastic ones. Currently, there is evidence to the contrary (e.g., see Dablander & Hinne, 2019).

Nevertheless, it would be nice to augment the network analysis module with the network analyses where the edges are observed. I'll put this on the to-do list, but I'm afraid I cannot give a time estimate.

References:

Dablander, F., & Hinne, M. (2019). Node centrality measures are a poor substitute for causal inference. Scientific reports, 9(1), 1-13.

Golden-Retriever-1 commented 4 years ago

I appreciate your efforts. Your claim above does not match and cover all applications and heavily depends on application! No generalizations please. It would be a pleasure adding this to the manual/documentation of JASP to make the users aware of it. Some other software tools for network analysis do not even specify the details of calculations.

I thank you for adding the topic to the to-do list. I am looking forward to its implemenation.

Regarding Dablander & Hinne (2019, p. 8): I keep the descriptive nature of SNA to describe one setting of occurring interactions in condition A and observe interactions of a network of the same agents in another condition B. The causal influence on the network comes from the conditions of A and B on those agents while the descriptive nature of SNA is kept as a whole. Hence, I compare the measures of centrality between both different conditions and not(!) within one network. Hence, the causality is only available in terms of having two conditions that lead to two different networks that are described by SNA. Finally, the reference is limited in its relevance and depending on context. Of course, a smaller sample of agents leaves more possibilities for causal interdependencies between them also when influenced by external factors than having more agents in one network which additionally can be influenced by external factors (which are in both not included into the network model). Hence, the calculations of Dablander and Hinne (2019) are just a gimmick representing the nature of the specific networks and the conclusions are very narrow. The conclusions are not generalizable to all applications which is why I wonder! The authors make their claims regarding "directed acyclic graphs (DAGs)" within one network! Generalizing is beating around the bush for high attention since rationality tells us that in a bigger network causality is more difficult to find since (also not-included factors) can per se influence the mathematical description of a network. Smaller networks have less points of attack for causal influences, no matter if included into a model or not. Dablander and Hinne (2019) map this matter of fact. The conclusions drawn are not generalizable and depend heavily on context of use and application as well as on model specification. Of course, you are more likely to fail when you choose for causality in centrality measures in a big network than in a smaller network. The causal factors of unknown complex influence that are varied and/or influence agents in their behaviour are not included into the model of the social network and the social network only includes the observed dependent variables. Of course, correlations drop in some centrality measures when you increase the network size of observed settings!

Better leave the specific context to the user.

jasp-stats / jasp-issues

Feature request: Additional calculations regarding network analysis #593