Multiplex communities and the emergence of international conflict

Authors: Caleb Pomeroy ^aff001; Niheer Dasandi ^aff002; Slava Jankin Mikhaylov ^aff003
Authors place of work: Department of Political Science, The Ohio State University, Columbus, Ohio, United States of America ^aff001; School of Government, University of Birmingham, Birmingham, United Kingdom ^aff002; Data Science Lab, Hertie School, Berlin, Germany ^aff003
Published in the journal: PLoS ONE 14(10)
Category: Research Article
doi: https://doi.org/10.1371/journal.pone.0223040

Summary

Advances in community detection reveal new insights into multiplex and multilayer networks. Less work, however, investigates the relationship between these communities and outcomes in social systems. We leverage these advances to shed light on the relationship between the cooperative mesostructure of the international system and the onset of interstate conflict. We detect communities based upon weaker signals of affinity expressed in United Nations votes and speeches, as well as stronger signals observed across multiple layers of bilateral cooperation. Communities of diplomatic affinity display an expected negative relationship with conflict onset. Ties in communities based upon observed cooperation, however, display no effect under a standard model specification and a positive relationship with conflict under an alternative specification. These results align with some extant hypotheses but also point to a paucity in our understanding of the relationship between community structure and behavioral outcomes in networks.

Keywords:

Network analysis – Community structure – Graphs – Telecommunications – Democracy – Speech signal processing – Vector spaces – International relations

Introduction

Community structure is a fundamental feature of complex networks. The community detection task consists of the identification of subgraphs where vertices exhibit dense within-group ties relative to out-group ties [1]. These mesostructural patterns shed light on physical, biological, and social networks, with applications ranging from disease surveillance to paper citations [2–9]. Early work on modularity developed a principled assessment of the quality of network divisions [10–12], and the current battery of detection tools permits investigation of multilayer, multiplex, and time-dependent networks, including algorithms that can accommodate signed edges and heterogeneously structured networks [13–18].

For computational social scientists, this methodological expansion permits investigation of theoretical questions that previously posed modeling challenges at the mesostructural level. The enduring debate on the relationship between interconnectedness and conflict in International Relations (IR) is an exemplary case. On the one hand, Jean-Jacques Rousseau believed that “…interdependence breeds not accommodation and harmony, but suspicion and incompatibility” ([19] page 321). More recently, Kenneth Waltz argued that “the fiercest civil wars and the bloodiest international ones have been fought within arenas populated by highly similar people whose affairs had become quite closely knit together” ([20] page 138). On the other hand, Immanuel Kant emphasized “that the growth of interconnectedness demonstrated the existence of the unique human capacity for establishing systems of cooperation…” [21]. More contemporary liberal IR theorists also stress the pacifying effects of interdependence [22, 23].

Empirical investigations of this question typically conceptualize interdependence at the dyad level—such as trade flow ratios between states v_i and v_j—and infer relationships to conflict via (generalized) linear models, e.g. [24, 25]. This conceptualization implies, for example, that if three states v_i, v_j, and v_k enjoy a closed triadic cooperation agreement and v_i reneges on the agreement, the exit of state v_i from the commitment to v_j is independent of the exit of state v_i from the commitment to v_k. This introduces statistical issues associated with the use of dyads to study k-adic phenomena [26] and misses the fundamental mechanism of theoretic interest, or as Lupu & Traag ([27] page 1012) put it: “…[scholars] have assumed independence in order to study interdependence”. Indeed, it has been suggested that until we “create and test more complex models, we are not likely to make theoretical progress in sorting out this question” ([28] page 56).

We draw upon two recent developments relevant to the question of interconnectedness and conflict. First, in international politics, an emerging literature deploys community detection algorithms to examine the role of trade, democracy, and intergovernmental organization dependencies [27, 29], as well as separate attention to alliances [13] and UN votes [30]. The common intuition underlying each of these studies is that the community structure of the international system is an underdeveloped predictor of behavioral outcomes. Second, recent findings in the broader network cooperation literature suggest that community structure helps to explain the emergence and maintenance of cooperation on graphs [31, 32] and that multilayer and multiplex structure fosters cooperative stability [33, 34]. These findings are important for network analytic approaches to international politics, because in contrast to laboratory settings with well-mixed populations, states are indeed embedded in multiple layers of potentially interdependent relations. The network cooperation literature, however, has less to say about the relationship between multiplex community structure and other behavioral outcomes, such as conflict.

This paper employs advances in multilayer community detection to locate dense clusters of states and then inferentially models these communities against the emergence of conflict in the international system. Previous work finds pacifying effects of community membership in the traditional Kantian-inspired foci of trade, democracy, and intergovernmental organization networks [27, 29]. We innovate through attention to data beyond these networks in order to better define the scope of the beneficial effects of community membership on conflict: does the broader cooperative mesostructure of the international system display similar effects, or are previous findings contingent on Kantian-based networks in particular? We consider weaker signals of expressed affinity in the United Nations (UN), as well as stronger signals of observed bilateral cooperation agreements. For the former, we employ layers of UN votes and speeches. For the latter, we search across network layers of science, military, commodity, fishery, and telecommunication cooperation agreements.

The results suggest the following. First, diplomatic cohesion in UN votes and speeches associates negatively with conflict onset. That is, the presence of an affinity community tie in a given dyad correlates with a decrease in conflict likelihood within that dyad. This result provides an extension to a previous finding based upon UN votes alone through the addition of diplomatic speeches in a multilayer setting [30]. Second, states embedded in cooperation communities appear no more or less likely to engage in conflict under a standard model specification and are more likely to engage in conflict under an alternative specification. This finding contrasts with the often implicit assumption that cooperation community membership reduces the likelihood of conflict amongst members. Furthermore, states who bridge multiple cooperation communities are significantly more likely to experience conflict. These findings lend some support for extant hypotheses but also point to a paucity in current knowledge about the relationship between community structure and behavioral outcomes in social systems.

Results

Community detection procedure

We follow recent work in conceptualizing the international system as a multilayer network [29]: a network representation where nodes are connected across layers of different tie sets [35–37]. Whereas a single mode representation is especially useful for the isolation of specific theoretical mechanisms (e.g. a trade tie’s impact on Y), we instead aim to capture broader cooperative structure that might exist across layers of the international system. Yet, because innumerable slices of relationships exist in international politics, the resulting communities can quickly become uninterpretable. We therefore focus on two types of multilayer graphs based upon data previously scrutinized by network analysts in IR, namely bilateral cooperation agreements and position affinity expressed in the UN. The former represent stronger signals of observed country-country relations, whereas the latter represent weaker, correlational signals of affinity in expressed preferences.

For strong signal communities, we employ five cooperation topics from the World Treaty Index: science, military, commodities, fisheries, and telecommunications [38, 39]. Previous research finds that network dynamics in part drive bilateral agreement formation and evolution on these topics [40]. These topics represent key areas of coordination [41, 42] and help to avoid redundancy across layers due to their relative orthogonality. For example, state motivations behind fishery agreement formation differ from motivations behind science agreement formation [43]. This topical diversity increases confidence that detected communities represent groups of intensive cooperators across issue areas.

For each year, we take the multilayer graph G t = ( V , E ) = { G t 1 , … , G t k }, i ∈ {1, …, k} where G t i = ( V , E ) is a single elementary network layer that corresponds to one of the five distinct topics. Each layer contains an aligned node set V = V with an undirected and unweighted edge e_ij = e_ji = (v_i, v_j) ∈ E between nodes v_i and v_j if there exists a bilateral agreement between these two countries in layer G t i. We use a moving window such that an edge is present if a bilateral agreement was initiated within the past ten years, and we assume that the edge dissipates outside of this window. This provides a sequence of yearly multilayer graphs S G t = { G 1 , … , G t }.

For weak signal communities, we employ UN votes and a recently released dataset of speeches delivered during the annual UN General Debate [44]. UN votes represent a key source of information about the expressed preferences of states [45–48]. Furthermore, previous network research examines UN voting communities in detail [49], including the relationship between community membership and conflict [30]. In contrast to previous community detection work, however, we employ country ideal points rather than raw UN votes. Noting methodological challenges associated with UN votes, Bailey et al [48] propose the use of unidimensional ideal points estimated from a dynamic ordinal spatial model. Thus, ideal points derive from a more theoretically-informed model of vote choice given a state’s preferences. For each year, we calculate the Euclidean distance between each country pair’s ideal points, converting each distance to a similarity score in order to construct a V × V similarity matrix.

We utilize speeches as the second graph layer in order to align with recent political science research that turns to text data in order to more accurately capture the expressed positions of political actors, e.g [50–53]. UN votes often display high cohesion, with states casting votes along regional bloc lines, for ceremonial purposes, or because specific agenda items arise beyond the state’s control [44, 47]. State speeches, on the other hand, provide delegations with greater flexibility to express positions. For example, in 1974 Greece and Turkey voted the most similarly amongst NATO members in the UN General Assembly (with ideal points of 0.68 and 0.42, respectively). Yet, that same year the two country’s air forces engaged in a dogfight which led to the death of a Turkish pilot during tensions that arose from Turkey’s invasion of Cyprus. In contrast to their votes, their UN General Debate speeches revealed these tensions, with each blaming the other for the crisis. The Supplementary Information (SI) describes this example and others, such as India and Pakistan who engaged in a border conflict in 1999, in greater depth. Thus, the addition of the speech layer helps to capture greater heterogeneity in state positions relative to previous community detection work that focuses on votes alone.

We first embed the speeches into vector space using the Global Vectors for Word Representation (GloVe) algorithm. Word embeddings encode more semantically interesting speech patterns compared to the typical bag-of-words representation of text data [54]. For each year, we utilize the Word Mover’s Distance (WMD) in order to locate distances between states’ speeches [55]. WMD conceptualizes the state-state speech distance problem as one of minimizing the required effort to move one state’s speech embeddings to the vector space location of another state, which we in turn convert to similarity scores [55]. This yields a V × V speech similarity matrix for each year. Because the resultant vote and speech matrices are densely populated, with each state seemingly connected to every other state, we follow previous work that employs mutual k-nearest neighbor graph clustering to yield candidates for multilayer community detection [30, 56]. The notation for the sequence of multilayer weak signal graphs is identical to the bilateral agreements outlined above.

With these strong and weak signal candidate layers in hand, we set about detecting multiplex communities. In international politics, different layers might exhibit heterogenous structure. As mentioned, states might initiate bilateral agreements for topic-dependent reasons, and the vote and speech matrices in Fig 1(A) exhibit heterogenous similarity structures. Most community detection methods, however, posit the same community structure across network layers. Therefore, we employ a newly developed method that can accommodate heterogenous structure, namely the Multilayer Extraction procedure [17]. The algorithm identifies densely connected vertex-layers in multilayer networks through a significance-based score that compares the connectivity of an observed vertex-layer set to a fixed degree random graph model. The introductory paper provides technical details [17].

**Fig. 1. Multilayer community detection procedure.**

Community detection on yearly instances of strong and weak multilayer networks yields separate sequences of detected community memberships. Single-mode projections of these memberships produce strong and weak multiplex communities for each year, formally M s t r o n g = { M s t r o n g 1970 , … , M s t r o n g t } and M w e a k = { M w e a k 1970 , … , M w e a k t }, t ∈ {1970, …, 1990}, with ties weighted by the number of common communities between two states. The year 1970 represents the beginning of the sequence, because this is the first available year in the corpus of speeches. The year 1990 serves as the final year in the sequence, because previous international conflict research finds evidence that the structural changes associated with the end of the Cold War led to changes in the causal processes that underlie conflict [57]. Thus, we avoid imposing a model that bridges into the post-Cold War era to avoid the conflation of data generating processes. Furthermore, World Treaty Index data availability declines from the 1990s onwards (see [40] page 774).

Fig 1(A) presents the pipeline for the Multilayer Extraction procedure. Fig 1(B) and 1(C) display the number of detected weak and strong signal communities over time, respectively. Point weights indicate the percentage of states that belong to at least one community. Point shading indicates the percentage of nodes that bridge at least two communities. These plots provide a novel glimpse into international polarity with respect to the number of clusters in the system and the ties within and across clusters [58]. Larger points and larger numbers of communities suggest a system in which states are more exhaustively divided into groups (i.e. poles). Lighter points indicate a more modular system with fewer bridging ties (i.e. a system that is more polarized given the constellation of poles). The communities detected from cooperation agreements suggest that states are less exhaustively divided into clusters towards the end of the Cold War, evidenced by a decline in the number of communities and a smaller percentage of states assigned to a community. The communities detected from signals of diplomatic affinity suggest a mean increase in the number of communities over time, with a relatively steady and large percentage of states assigned to a community. Further, greater heterogeneity exists in the weak signal graphs, evidenced by a consistently higher number of communities relative to cooperation agreements over time.

The emergence of interstate conflict

The detected multiplex communities represent the following. The communities based upon stronger signals represent tightly-knit groups of cooperators, taking into account the relational structure at each layer of the multilayer cooperation network. The communities based upon weaker signals represent clusters of states that exhibit similar expressed preferences in the UN, taking into account the similarity structure in the speech and voting layers. Thus, these multiplex communities provide a useful description of the cooperative mesostructure of the international system.

We now investigate the relationship between these communities and the onset of violent conflict in IR. We first consider the effect of community ties at the system level. Then, we restrict the node set to only the most active states in the system to investigate the ways in which different structural roles within these communities correlate with conflict onset. Fig 2 provides a stylized representation of the tie -⁠ and node-level effects under consideration.

As noted in the Introduction, the networked nature of IR often implies a nonindependence of observations that renders logistic regression unsuitable [59]. To circumvent these inferential challenges, we employ a temporal extension to the exponential random graph model [(T)ERGM] [60, 61]. ERGMs are generative models for network data [62], and their results can be interpreted similarly to coefficients from logistic regression: the coefficients provide an estimate for the change in the log-odds likelihood of observing a tie given a one unit change in the independent variable. The outcome network of interest is a yearly snapshot of the conflict onset network. An undirected tie between two states v_i and v_j exists if conflict was initiated in a given year. Model 1 follows a specification by Pauls & Cranmer [30] that contains a battery of covariates traditionally associated with conflict onset. This provides a baseline specification and brings our results into proximity with extant findings. The weak and strong multiplex communities then enter the model as an edge-level covariate in Models 2 and 3, respectively. Table 1 presents these system-level results.

**Tab. 1. TERGMs: Analysis of international conflict onset, 1970-1990.**

The coefficient sizes and directions are substantively reasonable. The edges term can be interpreted akin to the intercept term in a logit model. For example, the probability of observing conflict within a given dyad is approximately 0.0004 in Model 1. The significance and coefficient directions of the endogenous network statistics of alternating 2-stars and geometrically weighted edgewise shared partners (GWESP) indicate that conflict tends to cluster within the network. Further, traditional IR covariates display expected signs and effect sizes. For example, two contiguous states display a ceteris paribus 3.78 times higher log odds of conflict onset relative to two non-contiguous states, i.e. an odds increase of 43.82.

In Model 2, the coefficient on ties in weak signal communities is significant and negative. This indicates that conflict is less likely between countries that display strong cohesion in their votes and speeches. Specifically, a given dyad’s log-odds of experiencing conflict decreases by -0.60 for each additional weak signal community tie within that dyad, all else equal. In Model 3, the coefficient on ties in strong signal communities fails to reach significance. This implies that states with ties in the multiplex cooperation network are no more or less likely to engage in conflict than states without cooperative ties. Model 4 presents a more parsimonious specification (i.e. the omission of direct contiguity) in order to examine the effect of these strong community ties if one were to be observed. The omission of direct contiguity is also intuitive to the extent that cooperation agreements encode regional dynamics (e.g. telecommunication agreements often include neighboring countries), and thus the two variables might compete to explain variance. Under this specification, the cooperation community ties become significant and positive. This finding would indicate that a given dyad experiences an increase in the likelihood of conflict given the presence of a cooperation community tie (or ties) within the dyad. Although the absence of contiguity in this model leads us to caution against over-interpretation of this result, the finding is consistent with the absence of discernible conflict suppression effects given the presence of cooperation agreements.

With these system-level results in hand, we next investigate the different structural roles that members serve in these communities. This provides a more granular understanding of the mechanisms through which conflict might emerge and diffuse given the structure of the community. For this analysis, we use the UN as a pivot point and restrict the node set to only those states who voted and delivered a General Assembly speech in a given year. This criteria helps to identify relatively active states in international politics. We note that the results in Table 1 are substantively unchanged by this difference in node set.

Two potential mechanisms are of interest. First, the joint community member effect captures states that are in the same community and no other community. For strong signal communities, these states display the highest levels of cooperative dependency, because they lack ties to states in other communities. For weak signal communities, these states display high levels of intragroup diplomatic affinity and lack appreciable connections to other groups in the UN. Second, the community bridge effect captures states who bridge across more than one community. For strong signal communities, these states are less dependent on any single community but are potentially more vulnerable to conflict due to their exposure to multiple communities. For weak signal communities, these states exhibit relatively pragmatic positions that bridge multiple groups in the UN. Table 2 presents these results.

**Tab. 2. TERGMs: Analysis of node effects, 1970-1990.**

For weak signal communities, the results presented in Model 5 indicate a lack of effect for both joint community members and community bridges. This implies that weak community members are no more or less likely to engage in conflict with each other and that bridges are no more or less likely to experience conflict. For strong signal communities, the results of Model 6 suggest a lack of joint community member effect but a significant and positive relationship between conflict and strong community bridges. This implies that states who bridge multiple communities are more likely to experience conflict and perhaps provide a pathway through which conflict might diffuse across communities.

Discussion

The above results represent the first evidence on the relationship between multiplex communities and the onset of international conflict beyond previous attention to the Kantian triad (see [29]). For communities detected across layers of UN votes and speeches, the results confirm and extend the finding of a previous study based upon voting behavior alone: diplomatic cohesion appears to negatively associate with conflict in the international system [30]. Although the result is substantively similar, the addition of the speech layer provides useful information on the expressed preferences of states that is otherwise absent in roll call data alone.

The communities detected across layers of cooperation agreements present a more challenging picture. The most optimistic model specification yields a lack of association between community ties and conflict onset. A more pessimistic specification yields a positive association between cooperative ties and conflict onset. This result is surprising, because cooperation and conflict are often thought to display an inverse relationship, see e.g. [22, 63]. At least two mechanisms might explain this result. First, states at times employ bilateral agreements to manage contentious issues [64]. When agreements fail, this tie could provide an indicator for potential conflict onset. Second, those states who interact more often or are most active in agreement formation might face greater opportunities for disputes to arise. Similar arguments have been made in the case of alliance formation and geographically contiguous dyads [65–67]. For example, Traag & Bruggeman [13] uncover a similar result in the assessment of their detection algorithm on alliance data, namely that conflict tends to emerge within detected communities. As Waltz ([20] page 138) pointed out, “[i]t is impossible to get a war going unless the potential participants are somehow linked.” Either way, this finding calls into question the extent to which cooperators enjoy more peaceful outcomes than non-cooperators.

This communities and conflict puzzle is in part empirically explained by attention to structural roles within communities. Conflict diffusion via network ties is a well-established pattern in IR [68]. This study augments this finding: states that bridge cooperative communities are especially conflict prone, and this bridge points to a path through which conflict might diffuse to clusters of states. Those states with exclusive membership in a single community, however, are no more or less likely to engage in conflict with community members. This finding reiterates the open question surrounding interdependence and conflict. Further, this study finds scant evidence that community roles in the UN explain meaningful variance in conflict outcomes: states exclusively aligned with a single bloc and states who pragmatically bridge multiple communities enjoy no detectable change in conflict likelihood. This finding suggests that community membership is more important for conflict outcomes than the specific role that countries serve within communities in the UN.

Taken together, these results suggest at least two implications. First, for IR cooperation research, increases in tie density do not necessarily lead to decreased levels of conflict. Indeed, previous network science findings indicate that cooperative stability requires enough structure to support cooperation but not so much as to stifle it [32]. Second, for network cooperation research, future work could more rigorously explicate the theoretical mechanisms through which cooperation might suppress conflict. Cooperators tend to cluster on graphs [69]. The above analysis suggests that conflict might diffuse via nodes that bridge these clusters, which could paradoxically increase the likelihood that community members face conflict. Nonetheless, this study’s results reiterate the present paucity of observational findings on the relationship between communities and outcomes in social systems. Domain-specific empirical applications will help to narrow the scope of this problem whilst shedding light on the utility of new detection algorithms for questions of computational social science interest.

Materials and methods

Data

As described above, we utilize bilateral cooperation agreements and United Nations (UN) votes and speeches in order to construct the strong and weak signal multilayer graphs, respectively. We obtain the former from the World Treaty Index [38, 39], which provides the most complete record of bilateral agreements in international relations (IR). These data represent a rich source of information about international cooperation (see e.g. Kinne [40]) and have previously been used to operationalize peaceful relations between countries (see e.g. Kasten [70]). We specifically include the treaties under the categories of “Science and Technology” (7SCIEN), “Military Procedures” (9MILIT), “Raw Materials Trade” (3COMMO), “Fisheries” (8FISH), and “Telecommunications” (6TELCO). The dataset contains an edge list of dyads that are party to the treaty, as well as the year that the treaty was signed and a qualitative description of the treaty’s purpose.

For the weak signal data, we employ UN votes and UN General Debate speeches. For roll call data, we utilize yearly country ideal points estimated on a single dimension via a dynamic ordinal spatial model [48]. This model provides a unidimensional reduction of countries’ yea, nay, or abstain decisions on a variety of UN agenda voting items, often interpreted in political science to be a useful indication of a country’s expressed preferences or positions with respect to a given topic. The employment of these ideal points helps to avoid the issues posed by the high levels of voting similarity in the UN when attempting to detect communities, as identified in Macon et al [49]. Furthermore, in contrast to more common unipartite projections of bipartite graphs based on similarity measures (see e.g. Yildim & Coscia [71]), the ideal points are based on a more explicit theoretical model of vote choice given a state’s preferences (see Bailey et al [48]). These data are available online at Harvard Dataverse: hdl:1902.1/12379. In addition, we utilize the record of annual speeches delivered by country representatives—predominantly heads of state or government—during the annual UN General Debate [44]. These speeches are stored as plain text files with associated metadata and are available online at Harvard Dataverse: doi.org/10.7910/DVN/0TJX8Y.

The paper’s main text describes the vote and speech similarity measures that we employ. In order to move from similarity matrices to candidate adjacency matrix layers for multilayer community detection, we utilize a mutual k-nearest neighbor graph approach (see e.g. Ozaki et al [56]). We employ the mutual k-nearest neighbor graph approach in order to ensure that our replication procedure follows closely the original clustering procedure of Pauls & Cranmer [30], such that any differences in results can be attributed to the addition of the speeches layer in the multilayer setting. For useful discussions about backboning methods and graph sparsification, see e.g. Serrano et al [72], Slater [73], and Zhang et al [74].

After the performance of community detection on the strong and weak signal graphs, we model the detected communities against the onset of violent conflict in IR. We utilize data from a previous study by Pauls & Cranmer [30] that looked at a similar question as the current study, and we thank the authors for sharing these materials. The outcome network of interest is constructed from conflict onset data from the Correlates of War (COW) project’s Militarized Interstate Dispute (MID) dataset (v4.1) [75]. An undirected tie is considered to be present if a MID of level 4 or 5 was initiated between a dyad during the year of interest. These are the two levels of greatest hostility covered in the dataset, with the former corresponding to such actions as occupation of territory or declaration of war, and the latter corresponding to the initiation of war. More details on the conflict data are available online at the Inter-University Consortium for Political and Social Research: doi.org/10.3886/ICPSR24386.v1.

The inferential model also includes the following covariates. Democracy is a node attribute equal to 1 if the country’s Polity IV score is greater than or equal to 7. Direct contiguity enters the model as an indicator variable equal to 1 if two countries share a geographic border or share a sea border within 400 miles of each other. Capabilities ratios capture the ratio of two countries’ Composite Index of National Capabilities scores, which utilizes various measures of state capabilities, including population, military expenditures, and iron and steel production. Trade dependence is operationalized as the total yearly trade flow from v_i to v_j, divided by the GDP of v_i. Finally, security and economic IGO dependence are operationalized as the total number of third-party states to which v_i and v_j are jointly connected through security and economic-oriented intergovernmental organizations, respectively. Pauls & Cranmer [30] provide more details on these variables.

Models

To locate vector space representations of the corpus, we utilize the Stanford NLP group’s Global Vectors for Word Representation (GloVe) unsupervised learning algorithm [54]. GloVe is a popular log bilinear, weighted least squares model that trains on global word-word co-occurence counts to make efficient use of the corpus statistics. Because it factorizes a word-context co-occurrence matrix, it shares affinities with traditional count methods like latent semantic analysis or principle component analysis. First, the raw texts are stemmed and trimmed of any tokens that appear fewer than 5 times or in fewer than 5% of speeches across the corpus. This pre-processing was found to improve the quality of the located embeddings. We use a context window of 5 (i.e. 5 words before and 5 words after the target feature). To tune the model’s parameters, we fit the model to word vectors of size 50, 100, and 200 with maximum term co-occurrences of 15 and 25 for the weighting function. This yields “main” and “context” vectors which are subsequently averaged together per the suggestion of the original GloVe paper [54] to locate the final embedding space.

We then calculate the distances between each pair of states in each year using the relaxed variant of the Word Mover’s Distance (RWMD) [55]. This measure utilizes the embedding space and each country’s term-document matrix to measure the cumulative distance required to transform one state’s speech point cloud into that of another state. This procedure helps to ensure that distances are not simply a function of the use of different words, but rather differences in the semantic structure of two countries’ speeches. The SI presents more details on this procedure. We use the quanteda package [76] for corpus ingestion, and the text2vec package [77] for fitting the GloVe models and calculating the RWMDs. All analysis is conducted in the R statistical programming environment [78].

To model the evolution of the conflict onset network, we employ a temporal extension to the exponential random graph model [(T)ERGM] [60, 61]. Originally proposed by Wasserman & Pattison [62] (and also known as p* models), ERGMs are generative models for the performance of inference on network data that have found widespread employment across the network and social sciences [79–81]. The model used here assesses uncertainty using a bootstrap approach proposed by Desmarais & Cranmer [82, 83], and the models were fitted using the btergm package [84] in the R statistical programming environment [78]. Regarding interpretation, our results speak to the likelihood of conflict between two states v_i and v_j given the intensity of cooperation between v_i and v_j. We do not extrapolate these results further, such as the likelihood of conflict between v_i and some third party state v_k given the cooperative activity of v_i and v_j. At the same time, the results do permit the conclusion that highly active states—i.e. states with several community ties—would experience changes in the likelihood of conflict onset commensurate with the number of community ties. See Desmarais & Cranmer [85] for more information about the interpretation of ERGMs with respect to various levels of the network.

In addition to the variables outlined above, we specify the following variables in the model. The edges term represents the total number of ties in the graph, akin to the intercept term in regression models. Alternating 2-stars adds alternating sequences of two-paths (i.e. unclosed triangles) to the model, and 4-cycles captures the existence of four nodes connected in a box-like structure, namely e_iv = e_iu = e_jv = e_uj = 1 [86]. Finally, geometrically weighted edgewise shared partners (GWESP) adds a statistic equal to the geometrically down-weighted shared partner distribution, here with a fixed decay parameter of 0. The latter three of these statistics capture potential clustering in the conflict onset network. The community detection results depend on a number of choices surrounding data representation and parameter selection, such as the hyperparameters for the embedding model and the proportion of vertices used to initialize the Multilayer Extraction algorithm. To enhance robustness, we conduct the analysis using the different GloVe hyperparameters described above, as well as vertex initialization proportions of .20, .25, and .30 during the Multilayer Extraction procedure for the strong signal graphs. The results presented in the paper’s Emergence of Interstate Conflict section represent the mean results of these analyses.

Supporting information

S1 Text [pdf]

S1 Table [pdf]
Nearest features based on cosine similarity.

S1 Fig [pdf]
t-SNE projection.

S2 Fig [pdf]
WMD abstract example.

S3 Fig [pdf]
Models 1 and 2 in-sample goodness-of-fit.

S4 Fig [pdf]
Models 3 and 4 in-sample goodness-of-fit.

S5 Fig [pdf]
Models 5 and 6 in-sample goodness-of-fit.

S6 Fig [pdf]
Test set predictive accuracy.

Zdroje

1. Girvan M, Newman ME. Community structure in social and biological networks. Proceedings of the National Academy of Sciences. 2002;99(12):7821–7826. doi: 10.1073/pnas.122653799

2. Salathé M, Jones JH. Dynamics and control of diseases in networks with community structure. PLOS Computational Biology. 2010;6(4):e1000736. doi: 10.1371/journal.pcbi.1000736 20386735

3. Calcagno V, Demoinet E, Gollner K, Guidi L, Ruths D, de Mazancourt C. Flows of research manuscripts among scientific journals reveal hidden submission patterns. Science. 2012; p. 1227833. doi: 10.1126/science.1227833 23065906

4. Menche J, Sharma A, Kitsak M, Ghiassian SD, Vidal M, Loscalzo J, et al. Uncovering disease-disease relationships through the incomplete interactome. Science. 2015;347(6224):1257601. doi: 10.1126/science.1257601 25700523

5. Lima-Mendez G, Faust K, Henry N, Decelle J, Colin S, Carcillo F, et al. Determinants of community structure in the global plankton interactome. Science. 2015;348(6237):1262073. doi: 10.1126/science.1262073 25999517

6. Huttlin EL, Bruckner RJ, Paulo JA, Cannon JR, Ting L, Baltier K, et al. Architecture of the human interactome defines protein communities and disease networks. Nature. 2017;545(7655):505. doi: 10.1038/nature22366 28514442

7. Strano E, Viana MP, Sorichetta A, Tatem AJ. Mapping road network communities for guiding disease surveillance and control strategies. Scientific Reports. 2018;8(1):4744. doi: 10.1038/s41598-018-22969-4 29549364

8. Waniek M, Michalak TP, Wooldridge MJ, Rahwan T. Hiding individuals and communities in a social network. Nature Human Behaviour. 2018;2(2):139. doi: 10.1038/s41562-017-0290-3

9. Trujillo CM, Long TM. Document co-citation analysis to enhance transdisciplinary research. Science Advances. 2018;4(1):e1701130. doi: 10.1126/sciadv.1701130 29308433

10. Newman ME, Girvan M. Finding and evaluating community structure in networks. Physical Review E. 2004;69(2):026113. doi: 10.1103/PhysRevE.69.026113

11. Duch J, Arenas A. Community detection in complex networks using extremal optimization. Physical Review E. 2005;72(2):027104. doi: 10.1103/PhysRevE.72.027104

12. Newman ME. Modularity and community structure in networks. Proceedings of the National Academy of Sciences. 2006;103(23):8577–8582. doi: 10.1073/pnas.0601602103

13. Traag VA, Bruggeman J. Community detection in networks with positive and negative links. Physical Review E. 2009;80(3):036115. doi: 10.1103/PhysRevE.80.036115

14. Mucha PJ, Richardson T, Macon K, Porter MA, Onnela JP. Community structure in time-dependent, multiscale, and multiplex networks. Science. 2010;328(5980):876–878. doi: 10.1126/science.1184819 20466926

15. Benson AR, Gleich DF, Leskovec J. Higher-order organization of complex networks. Science. 2016;353(6295):163–166. doi: 10.1126/science.aad9029 27387949

16. Su Y, Wang B, Cheng F, Zhang L, Zhang X, Pan L. An algorithm based on positive and negative links for community detection in signed networks. Scientific Reports. 2017;7(1):10874. doi: 10.1038/s41598-017-11463-y 28883663

17. Wilson JD, Palowitch J, Bhamidi S, Nobel AB. Community Extraction in Multilayer Networks with Heterogeneous Community Structure. Journal of Machine Learning Research. 2017;18(149):1–49.

18. Zhai X, Zhou W, Fei G, Liu W, Xu Z, Jiao C, et al. Null Model and Community Structure in Multiplex Networks. Scientific Reports. 2018;8(1):3245. doi: 10.1038/s41598-018-21286-0 29459696

19. Hoffmann S. Rousseau on war and peace. American Political Science Review. 1963;57(2):317–333. doi: 10.2307/1952825

20. Kenneth W. Theory of international politics. Addison-Wesley; 1979.

21. Linklater A. Global civilizing processes and the ambiguities of human interconnectedness. European Journal of International Relations. 2010;16(2):155–178. doi: 10.1177/1354066109350796

22. Doyle MW. Liberalism and world politics. American Political Science Review. 1986;80(4):1151–1169.

23. Oneal JR, Russett B. The Kantian peace: The pacific benefits of democracy, interdependence, and international organizations, 1885–1992. World Politics. 1999;52(1):1–37. doi: 10.1017/S0043887100020013

24. Barbieri K. Economic interdependence: A path to peace or a source of interstate conflict? Journal of Peace Research. 1996;33(1):29–49. doi: 10.1177/0022343396033001003

25. Oneal JR, Oneal FH, Maoz Z, Russett B. The liberal peace: Interdependence, democracy, and international conflict, 1950-85. Journal of Peace Research. 1996;33(1):11–28. doi: 10.1177/0022343396033001002

26. Poast P. (Mis)Using dyadic data to analyze multilateral events. Political Analysis. 2010;18(4):403–425. doi: 10.1093/pan/mpq024

27. Lupu Y, Traag VA. Trading communities, the networked structure of international relations, and the Kantian peace. Journal of Conflict Resolution. 2013;57(6):1011–1042. doi: 10.1177/0022002712453708

28. McMillan SM. Interdependence and conflict. Mershon International Studies Review. 1997;41(Supplement_1):33–58. doi: 10.2307/222802

29. Cranmer SJ, Menninga EJ, Mucha PJ. Kantian fractionalization predicts the conflict propensity of the international system. Proceedings of the National Academy of Sciences. 2015;112(38):11812–11816. doi: 10.1073/pnas.1509423112

30. Pauls SD, Cranmer SJ. Affinity communities in United Nations voting: Implications for democracy, cooperation, and conflict. Physica A: Statistical Mechanics and its Applications. 2017;484 : 428–439. doi: 10.1016/j.physa.2017.04.177

31. Lozano S, Arenas A, Sanchez A. Mesoscopic structure conditions the emergence of cooperation on social networks. PLoS ONE. 2008;3(4):e1892. doi: 10.1371/journal.pone.0001892 18382673

32. Gianetto DA, Heydari B. Network modularity is essential for evolution of cooperation under uncertainty. Scientific Reports. 2015;5 : 9340. doi: 10.1038/srep09340 25849737

33. Gómez-Gardenes J, Reinares I, Arenas A, Floría LM. Evolution of cooperation in multiplex networks. Scientific Reports. 2012;2 : 620. doi: 10.1038/srep00620 22943006

34. Wang Z, Szolnoki A, Perc M. Evolution of public cooperation on interdependent networks: The impact of biased utility functions. EPL (Europhysics Letters). 2012;97(4):48001.

35. Kivelä M, Arenas A, Barthelemy M, Gleeson JP, Moreno Y, Porter MA. Multilayer networks. Journal of Complex Networks. 2014;2(3):203–271. doi: 10.1093/comnet/cnu016

36. Aleta A, Moreno Y. Multilayer networks in a nutshell. Annual Review of Condensed Matter Physics. 2018.

37. Porter MA. What is… a Multilayer Network? Notices of the AMS. 2018;65(11).

38. Pearson GJ. Rohn’s World Treaty Index: Its Past and Future. International Journal of Legal Information. 2001;29 : 543. doi: 10.1017/S0731126500001025

39. Poast P, Bommarito MJ, Katz DM. The Electronic World Treaty Index: Collecting the Population of International Agreements in the 20th Century; 2010.

40. Kinne BJ. Network dynamics and the evolution of international cooperation. American Political Science Review. 2013;107(4):766–785. doi: 10.1017/S0003055413000440

41. Krasner SD. Global communications and national power: Life on the Pareto frontier. World Politics. 1991;43(3):336–366. doi: 10.2307/2010398

42. Morrow JD. Modeling the forms of international cooperation: distribution versus information. International Organization. 1994;48(3):387–423. doi: 10.1017/S0020818300028241

43. Haas EB. Why collaborate? Issue-linkage and international regimes. World Politics. 1980;32(3):357–405. doi: 10.2307/2010109

44. Baturo A, Dasandi N, Jankin Mikhaylov S. Understanding State Preferences with Text As Data: Introducing the UN General Debate Corpus. Research and Politics. 2017;4(2):1–9. doi: 10.1177/2053168017712821

45. Voeten E. Clashes in the Assembly. International Organization. 2000;54(2):185–215. doi: 10.1162/002081800551154

46. Voeten E. Resisting the lonely superpower: Responses of states in the United Nations to US dominance. Journal of Politics. 2004;66(3):729–754. doi: 10.1111/j.1468-2508.2004.00274.x

47. Voeten E. Data and Analyses of Voting in the United Nations General Assembly. In: Reinalda B, editor. Routledge Handbook of International Organization. Routledge; 2013. p. 54.

48. Bailey MA, Strezhnev A, Voeten E. Estimating dynamic state preferences from United Nations voting data. Journal of Conflict Resolution. 2017;61(2):430–456. doi: 10.1177/0022002715595700

49. Macon KT, Mucha PJ, Porter MA. Community structure in the united nations general assembly. Physica A: Statistical Mechanics and its Applications. 2012;391(1-2):343–361. doi: 10.1016/j.physa.2011.06.030

50. Lauderdale BE, Clark TS. Scaling politically meaningful dimensions using texts and votes. American Journal of Political Science. 2014;58(3):754–771. doi: 10.1111/ajps.12085

51. Kim IS, Londregan J, Ratkovic M. Estimating Spatial Preferences from Votes and Text. Political Analysis. 2018;26(2):210–229. doi: 10.1017/pan.2018.7

52. Peterson A, Spirling A. Classification Accuracy as a Substantive Quantity of Interest: Measuring Polarization in Westminster Systems. Political Analysis. 2018;26(1):120–128. doi: 10.1017/pan.2017.39

53. Lauretig AM. Identification, Interpretability, and Bayesian Word Embeddings. arXiv preprint arXiv:190401628. 2019.

54. Pennington J, Socher R, Manning CD. GloVe: Global Vectors for Word Representation. In: Empirical Methods in Natural Language Processing (EMNLP). vol. 14; 2014. p. 1532–1543.

55. Kusner M, Sun Y, Kolkin N, Weinberger K. From word embeddings to document distances. In: International Conference on Machine Learning; 2015. p. 957–966.

56. Ozaki K, Shimbo M, Komachi M, Matsumoto Y. Using the mutual k-nearest neighbor graphs for semi-supervised classification of natural language data. In: Proceedings of the fifteenth conference on computational natural language learning. Association for Computational Linguistics; 2011. p. 154–162.

57. Jenke L, Gelpi C. Theme and variations: Historical contingencies in the causal model of interstate conflict. Journal of Conflict Resolution. 2017;61(10):2262–2284. doi: 10.1177/0022002715615190

58. Bueno de Mesquita B. Systemic polarization and the occurrence and duration of war. Journal of Conflict Resolution. 1978;22(2):241–267. doi: 10.1177/002200277802200203

59. Cranmer SJ, Desmarais BA, Menninga EJ. Complex dependencies in the alliance network. Conflict Management and Peace Science. 2012;29(3):279–313. doi: 10.1177/0738894212443446

60. Robins G, Pattison P. Random graph models for temporal processes in social networks. Journal of Mathematical Sociology. 2001;25(1):5–41. doi: 10.1080/0022250X.2001.9990243

61. Hanneke S, Fu W, Xing EP, et al. Discrete temporal models of social networks. Electronic Journal of Statistics. 2010;4 : 585–605. doi: 10.1214/09-EJS548

62. Wasserman S, Pattison P. Logit models and logistic regressions for social networks: I. An introduction to Markov graphs andp. Psychometrika. 1996;61(3):401–425. doi: 10.1007/BF02294547

63. Pinker S. The better angels of our nature: Why violence has declined. Penguin Group USA; 2012.

64. Bueno de Mesquita B. The war trap. New Haven, CT: Yale University Press; 1981.

65. Gibler DM, Vasquez JA. Uncovering the dangerous alliances, 1495–1980. International Studies Quarterly. 1998;42(4):785–807. doi: 10.1111/0020-8833.00106

66. Bremer SA. Dangerous dyads: Conditions affecting the likelihood of interstate war, 1816-1965. Journal of Conflict Resolution. 1992;36(2):309–341. doi: 10.1177/0022002792036002005

67. Braithwaite A. Location, location, location… identifying hot spots of international conflict. International Interactions. 2005;31(3):251–273. doi: 10.1080/03050620500294234

68. Zhukov YM, Stewart BM. Choosing your neighbors: Networks of diffusion in international relations. International Studies Quarterly. 2013;57(2):271–287. doi: 10.1111/isqu.12008

69. Nowak MA. Five rules for the evolution of cooperation. Science. 2006;314(5805):1560–1563. doi: 10.1126/science.1133755 17158317

70. Kasten L. When less is more: Constructing a parsimonious concept of interstate peace for quantitative analysis. International Studies Review. 2017;19(1):28–52. doi: 10.1093/isr/vix002

71. Yildirim MA, Coscia M. Using random walks to generate associations between objects. PLoS ONE. 2014;9(8):e104813. doi: 10.1371/journal.pone.0104813 25153830

72. Serrano MÁ, Boguná M, Vespignani A. Extracting the multiscale backbone of complex weighted networks. Proceedings of the National Academy of Sciences. 2009;106(16):6483–6488. doi: 10.1073/pnas.0808904106

73. Slater PB. A two-stage algorithm for extracting the multiscale backbone of complex weighted networks. Proceedings of the National Academy of Sciences. 2009;106(26):E66–E66. doi: 10.1073/pnas.0904725106

74. Zhang X, Zhang Z, Zhao H, Wang Q, Zhu J. Extracting the globally and locally adaptive backbone of complex networks. PLoS ONE. 2014;9(6):e100428. doi: 10.1371/journal.pone.0100428 24936975

75. Palmer G, d’Orazio V, Kenwick M, Lane M. The MID4 dataset, 2002–2010: Procedures, coding rules and description. Conflict Management and Peace Science. 2015;32(2):222–242. doi: 10.1177/0738894214559680

76. Benoit K, Nulty P. quanteda: Quantitative Analysis of Textual Data; 2013.

77. Selivanov D. text2vec: Modern Text Mining Framework for R; 2016.

78. R Core Team. R: A Language and Environment for Statistical Computing; 2017. Available from: https://www.R-project.org.

79. Cranmer SJ, Desmarais BA. Inferential network analysis with exponential random graph models. Political Analysis. 2010;19(1):66–86. doi: 10.1093/pan/mpq037

80. Leifeld P, Schneider V. Information exchange in policy networks. American Journal of Political Science. 2012;56(3):731–744. doi: 10.1111/j.1540-5907.2011.00580.x

81. Almquist ZW, Butts CT. Dynamic network logistic regression: A logistic choice analysis of inter-and intra-group blog citation dynamics in the 2004 US presidential election. Political Analysis. 2013;21(4):430–448. doi: 10.1093/pan/mpt016 24143060

82. Desmarais BA, Cranmer SJ. Consistent confidence intervals for maximum pseudolikelihood estimators. In: Proceedings of the Neural Information Processing Systems 2010 Workshop on Computational Social Science and the Wisdom of Crowds. Citeseer; 2010.

83. Desmarais BA, Cranmer SJ. Statistical mechanics of networks: Estimation and uncertainty. Physica A: Statistical Mechanics and its Applications. 2012;391(4):1865–1876. doi: 10.1016/j.physa.2011.10.018

84. Leifeld P, Cranmer S, Desmarais B. Temporal Exponential Random Graph Models with btergm: Estimation and Bootstrap Confidence Intervals. Journal of Statistical Software. 2018;83(6):1–36. doi: 10.18637/jss.v083.i06

85. Desmarais BA, Cranmer SJ. Micro-level interpretation of exponential random graph models with application to estuary networks. Policy Studies Journal. 2012;40(3):402–434. doi: 10.1111/j.1541-0072.2012.00459.x

86. Snijders TA, Pattison PE, Robins GL, Handcock MS. New specifications for exponential random graph models. Sociological Methodology. 2006;36(1):99–153. doi: 10.1111/j.1467-9531.2006.00176.x