Week 5 Interpretaive Assignment

A short description of the post.

Noah Milstein true
2022-02-21

Background and Research Question:

Wikipedia is self-described as a “free content, multilingual online encyclopedia written and maintained by a community of volunteers through a model of open collaboration,” information on the website i shared and maintained “using a wiki-based editing system. [and] Individual contributors,” being the 5th most visited website in the world it is also the largest and most-read reference work in history.” (“https://en.wikipedia.org/wiki/Wikipedia”) Because of Wikipedia’s position, as an arbiter of information and reference for a number of subjects, it is essential to understand the nature of the required citations and how they inform the websites portrayal of history and information. (Chase 2021)

Conflict is a defining feature of history, the results of war and the groups involved are essential to understanding dynamics of power globally. A war can represent the transfer of material, territorial, and strategic power between groups. As a result the networks of wars between nations can give some notion of power centrality among warring nations.

Since wikipedia has become a widely accepted (if often critiqued) source of information, its citations and the information resulting from them can give us a sense of how where the center of global conflict, and thus the most central nations, according to popular and accessible academic literature.

https://www.visualcapitalist.com/the-50-most-visited-websites-in-the-world/

Chase, Matt. “Wikipedia is 20, and its reputation has never been higher”. The Economist. January 9, 2021. Retrieved February 25, 2021.

Part 1:

Describe the Dataset You Are Using:

The Dataset Being Used: The dataset that I am using is wikipedia list of wars throughout history, this article is the “List of wars: 1000–1499” which acts as a subset of the “2nd-millennium conflicts” I chose this dataset as an exemplar of popular history’s depiction of the centralization of worldwide conflict. Wikipedia, being an accessible source generally created from relevant citations makes it a good case study to see where historical writers and academics center their world are relevant conflicts.

Identify initial network format:

Answer: The initial network format is as an edge list, the first, in column contains the winners of each war while the second, out column contains the losers of each. These sets of belligerents are directed

Network Structure: Wars Startings in the 1000s

 Network attributes:
  vertices = 117 
  directed = TRUE 
  hyper = FALSE 
  loops = FALSE 
  multiple = TRUE 
  bipartite = FALSE 
  total edges= 153 
    missing edges= 0 
    non-missing edges= 153 

 Vertex attribute names: 
    vertex.names 

No edge attributes

Network Structure: Wars Startings in the 1100s

 Network attributes:
  vertices = 78 
  directed = TRUE 
  hyper = FALSE 
  loops = FALSE 
  multiple = TRUE 
  bipartite = FALSE 
  total edges= 225 
    missing edges= 0 
    non-missing edges= 225 

 Vertex attribute names: 
    vertex.names 

No edge attributes

Network Structure: Wars Startings in the 1200s

 Network attributes:
  vertices = 161 
  directed = TRUE 
  hyper = FALSE 
  loops = FALSE 
  multiple = TRUE 
  bipartite = FALSE 
  total edges= 313 
    missing edges= 0 
    non-missing edges= 313 

 Vertex attribute names: 
    vertex.names 

No edge attributes

Identify Nodes: Describe and identify the nodes (including how many nodes are in the dataset)

Answer: Nodes or vertices in these datasets represent belligerents in wars throughout history, the involved parties in each conflict can be a nation, province, individual, or group so long as they are listed as involved in the conflict. In the 1000s there are 117, in the 1100s there are 78 and in the 1200s there are 161.

What Constitutes a Tie: What constitutes a tie or edge (including how many ties, whether ties are directed/undirected and weighted/binary, and how to interpret the value of the tie if any)

Answer: A tie or edge in this dataset represents a war, this war can be between two nations or groups within a nation. These edges can represent a war that involved many more nations but are always tied to each and every party involved on both sides. These edges are directed and the direction indicates which side “won” the conflict (if an edge has an arrow pointing to another the node that originated that arrow won the war against them. There are 153 edges in the 1000s, 225 edges in 1100s and 313 edges in the 1200s.

Edge Attributes and Subset: Whether or not there are edge attributes that might be used to subset data or stack multiple networks (e.g., tie type, year, etc).

Answer: There are a number of attributes that could be used to subset the data, year that the conflict began or the length of time it lasted are available. Aspects like each side’s religion and the area where the conflict took place could be used to subset the data itself.

Part 2:

Brokerage and Betweeness centrality

What are betweeness and brokerage cenrrality Calculate brokerage and betweenneess centrality measures for one or more subsets of your network data, and write up the results and your interpretation of them.

Answer: I will be calculating these measures for wars in 1000-1099, 1100-1199, and 1200-1399.

Brokerage scores in the 1000s

(wars_in_1000s.nodes.stat_2%>%
  arrange(desc(broker.tot))%>%
  slice(1:10))[,c(1,11:15)] %>%kable()
name broker.tot broker.coord broker.itin broker.rep broker.gate
Byzantine Empire 25.2728731 NaN 3.9930250 NaN NaN
Sultanate of Rum 10.4364880 NaN -0.5014965 NaN NaN
Holy Roman Empire 10.0243662 NaN 4.8919293 NaN NaN
England 7.5516353 NaN 6.6906430 -0.0800286 -0.0800286
Kingdom of Sicily 5.4910263 NaN 3.9930250 NaN NaN
Kingdom of France 2.1940518 NaN -0.5014965 NaN NaN
Seljuk Empire 1.7819300 -0.0271615 -0.5004595 6.3140099 -0.1374029
Kingdom of Georgia 0.5455645 -0.0158536 -0.5009747 -0.1126843 -0.1126843
Papal States 0.5455645 -0.0158536 -0.5009747 -0.1126843 11.6872209
Normandy 0.1334427 -0.1256118 0.4229345 -0.2730778 -0.2730778

Brokerage scores in the 1100s

(wars_in_1100s.nodes.stat_2%>%
  arrange(desc(broker.tot))%>%
  slice(1:10))[,c(1,10:14)] %>%kable()
name broker.tot broker.coord broker.itin broker.rep broker.gate
Kingdom of Jerusalem 13.5778650 NaN 2.9379204 17.9204822 -0.1688321
Fatimid Caliphate 8.0093472 NaN -0.6987889 NaN NaN
Ayyubid Dynasty 7.2954347 NaN -0.6973228 -0.1688321 -0.1688321
Zengid Dynasty 5.7248271 NaN 0.7542726 NaN NaN
Byzantine Empire 5.2964795 NaN 0.7567745 -0.1688321 -0.1688321
England 4.4397845 NaN -0.6973228 -0.1688321 -0.1688321
Holy Roman Empire 2.1552644 NaN -0.6973228 -0.1688321 -0.1688321
Kingdom of France 1.0130043 NaN -0.6973228 -0.1688321 -0.1688321
Kingdom of Sicily 0.1563093 -0.2053296 -0.6628278 -0.4286929 -0.4286929
Papal States -0.2720383 -0.1781918 -0.6712334 -0.3997174 1.6840665

Brokerage scores in the 1200s

name broker.tot broker.coord broker.itin broker.rep broker.gate
Mongol Empire 47.964825 NaN -0.5953457 NaN NaN
Kingdom of France 28.663539 NaN -0.5953457 NaN NaN
Ayyubid Dynasty 26.995527 NaN 1.6524957 NaN NaN
Kingdom of England 21.991489 NaN 8.3960200 NaN NaN
Republic of Genoa 11.983415 NaN -0.5953457 NaN NaN
Knights Templar 10.077115 NaN 1.6524957 NaN NaN
Holy Roman Empire 4.834790 -0.0170801 -0.5948396 10.86552 10.8655226
Principality of Antioch 4.834790 -0.0170801 2.4030361 13.61357 -0.1266480
Kingdom of Cyprus 4.596503 58.5391124 0.1546293 13.61357 10.8655226
County of Tripoli 3.643353 NaN 0.9035078 19.28056 -0.0898366
name broker.gate
Papal States 11.6872209
County of Sicily -0.0800286
England -0.0800286
County of Aversa -0.1126843
Kingdom of Georgia -0.1126843
Great Seljuq Empire -0.1126843
Chola Empire -0.1126843
Taifa of Lérida -0.1126843
County of Apulia -0.1374029
Seljuk Empire -0.1374029
name broker.tot
Byzantine Empire 25.2728731
Sultanate of Rum 10.4364880
Holy Roman Empire 10.0243662
England 7.5516353
Kingdom of Sicily 5.4910263
Kingdom of France 2.1940518
Seljuk Empire 1.7819300
Kingdom of Georgia 0.5455645
Papal States 0.5455645
Normandy 0.1334427

Option 2.A

For a Specific Research Question: If you have a specific research question, please feel free to use that to guide your analysis. Otherwise, you may want to orient your analysis as follows in order to identify a compelling question or noteworthy pattern in the data that can be interpreted.

Answer: Since I am interested in the relative power of nations by their relative position ad centrality in the worldwide conflict, network brokerage can be used to illustrate significant positions in global conflict. Below I wanted to look at 4 kinds of brokerage, these are broker.gate or gatekeeper, coordinator, liason, and itinerant. I am interested to see if these specific coordination types are primarily done by specific nations.

Total Brokerage

Explanation: Looking at total brokerage in this dataset gives a sense of which factions were responsible for highest connection of unconnected actors through conflict. Given the crusades igniting conflict between Europe and the middle east it is sensible that the Byzantine Empire in the center of both connects the most unconnected actors through conflict closely followed by the Sultanate of Rum, a major Muslim faction that fought against the crusades and third being the Holy Roman Empire who participated in many conflicts including the crusades. These are followed by England who centered the wars in the British isles and the Kingdom of Sicily who were also in a position of conflict.

name broker.tot
Byzantine Empire 25.272873
Sultanate of Rum 10.436488
Holy Roman Empire 10.024366
England 7.551635
Kingdom of Sicily 5.491026

Coordinator Brokerage

Explanation: In this case no particular country is very high above any other in terms of their coordinator brokerage, meaning that within groups no particular nations appear to be brokering more within the groups.

name broker.coord
County of Aversa -0.0158536
Kingdom of Georgia -0.0158536
Great Seljuq Empire -0.0158536
Papal States -0.0158536
Chola Empire -0.0158536

Itinerant Brokerage

Explanation: Itinerant brokerage represents when a non-group actor connects 2 actors in a group it is no in to each other, in this case England has the highest score. Looking at the network graph they do appear to connect 2 actors in a group together.

name broker.itin
England 6.6906430
Holy Roman Empire 4.8919293
Kingdom of Sicily 3.9930250
Byzantine Empire 3.9930250
Principality of Kiev 0.4413428

Representative Brokerage

Explanation: Representative brokerage indicates that the broker, or nation in question loses a war to another in their group, but wins another against a faction outside of their group. This can be though of as their directed connections to them. In this case the Seljuk Empire and Kingdom of Aragon have instances in which they lose to factions within their group before beating those outside of it.

name broker.rep
Seljuk Empire 6.3140099
Kingdom of Aragon 1.1415607
County of Sicily -0.0800286
England -0.0800286
County of Aversa -0.1126843

Gatekeeper Brokerage

Explanation: The Papal states being ranked highest in gatekeeper brokerage is an interesting observation as no other nation in the dataset appears to be close to their level as most are negative in this category. In this cae being a gatekeeper means that they are in at conflict in a group with another while the nation in a different group of conflicts is only at war with them from the group. This is an interesting observation given the Papal states role as a coordinator of the war, but not a participant in the conflcit as directly as other belligerents. (This being the crusade given the period)

name broker.gate
Papal States 11.6872209
County of Sicily -0.0800286
England -0.0800286
County of Aversa -0.1126843
Kingdom of Georgia -0.1126843

Liaison Brokerage

Explanation: A liaison broker, in this case, is a faction that loses a war to a group they do not belong to and wins a war against a different group than the first that they also do not belong to. The Byzantine Empire, Sultanate of Rum, and Holy Roman Empire are highest in this category likely owing to their frequent states of conflict beyond the crusades against a variety of groups.

name broker.lia
Byzantine Empire 31.767840
Sultanate of Rum 14.454659
Holy Roman Empire 10.545231
England 6.202342
Kingdom of Sicily 4.960334

wars_in_1000s_edgelist <- as.matrix(wars_in_1000s)

wars_in_1000s_edgelist_network_edgelist <- graph.edgelist(wars_in_1000s_edgelist, directed=TRUE)

wars_in_1000s.ig<-graph_from_data_frame(wars_in_1000s)

wars_in_1000s_network <- asNetwork(wars_in_1000s.ig)

pls_work<-as.network(wars_in_1000s_edgelist, matrix.type = "edgelist", directed = FALSE,  hyper = FALSE, loops = FALSE, multiple = FALSE, bipartite = FALSE, vertex.attrnames=wars_in_1000s_network%v%"vertex.names")

flomarr.se<-equiv.clust(pls_work, equiv.fun="sedist", method="hamming",mode="graph")
plot(flomarr.se, labels=flomarr.se$glabels,  cex=0.3)
rect.hclust(flomarr.se$cluster,h=5)
?plot()
Help on topic 'plot' was found in the following packages:

  Package               Library
  base                  /Library/Frameworks/R.framework/Resources/library
  graphics              /Library/Frameworks/R.framework/Versions/4.1/Resources/library


Using the first match ...
#blockmodel and select partitions
blk_mod<-blockmodel(pls_work,flomarr.se,k=5)
#assign block membership to vertex attribute
V(wars_in_1000s.ig)$role<-blk_mod$block.membership[match(V(wars_in_1000s.ig)$name,blk_mod$plabels)]
wars_in_1000s_network%v%"role"<-blk_mod$block.membership[match(wars_in_1000s_network%v%"vertex.names", blk_mod$glabels)]
?blockmodel()
set.seed(2)
#blockmodel and select partitions
blk_mod<-blockmodel(pls_work, flomarr.se, k=5)
#assign block membership to vertex attribute
pls_work%v%"role"<-blk_mod$block.membership[match(pls_work%v%"vertex.names", blk_mod$glabels)]
#plot network using "role" to color nodes: statnet
GGally::ggnet2(pls_work,
               node.color="role", 
               node.size=degree(pls_work, gmode="graph"),
               node.label = "vertex.names",
               label.size= 1,
               node.alpha = .5)
?ggnet2()
plot.block<-function(x=blk_mod, main=NULL, cex.lab=1){
  plot.sociomatrix(x$blocked.data, labels=list(x$plabels,x$plabels),
                   main=main, drawlines = FALSE, cex.lab=cex.lab)
  for (j in 2:length(x$plabels)) if (x$block.membership[j] !=
                                     x$block.membership[j-1]) 
    abline(v = j - 0.5, h = j - 0.5, lty = 3, xpd=FALSE)
}
plot.block(blk_mod, cex.lab=.35)

(information regarding the meaning of each type of brokerage was acquired from https://edis.ifas.ufl.edu/publication/WC197)

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY-NC 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

Citation

For attribution, please cite this work as

Milstein (2022, March 3). Data Analytics and Computational Social Science: Week 5 Interpretaive Assignment. Retrieved from https://nmilsteinuma.github.io/posts/2022-02-21-week-5-interpretaive-assignment/

BibTeX citation

@misc{milstein2022week,
  author = {Milstein, Noah},
  title = {Data Analytics and Computational Social Science: Week 5 Interpretaive Assignment},
  url = {https://nmilsteinuma.github.io/posts/2022-02-21-week-5-interpretaive-assignment/},
  year = {2022}
}