DACSS 601: Data Science Fundamentals - FALL 2022
  • Fall 2022 Posts
  • Contributors
  • DACSS

Challenge1 Erika Nagai

  • Course information
    • Overview
    • Instructional Team
    • Course Schedule
  • Weekly materials
    • Fall 2022 posts
    • final posts

On this page

  • Challenge Overview
  • Describe the data

Challenge1 Erika Nagai

  • Show All Code
  • Hide All Code

  • View Source
challenge_1
railroads
faostat
wildbirds
Author

Erika Nagai

Published

September 13, 2022

Code
# Packages 
library(tidyverse)
library(dplyr)
library(ggplot2)
library(summarytools)

knitr::opts_chunk$set(echo = TRUE, warning=FALSE, message=FALSE)

Challenge Overview

Today’s challenge is to read in and explore the data ‘birds.csv’

Code
# 1. Read in the data

bird_data <- read_csv('_data/birds.csv')
bird_data
# A tibble: 30,977 × 14
   Domain Cod…¹ Domain Area …² Area  Eleme…³ Element Item …⁴ Item  Year …⁵  Year
   <chr>        <chr>    <dbl> <chr>   <dbl> <chr>     <dbl> <chr>   <dbl> <dbl>
 1 QA           Live …       2 Afgh…    5112 Stocks     1057 Chic…    1961  1961
 2 QA           Live …       2 Afgh…    5112 Stocks     1057 Chic…    1962  1962
 3 QA           Live …       2 Afgh…    5112 Stocks     1057 Chic…    1963  1963
 4 QA           Live …       2 Afgh…    5112 Stocks     1057 Chic…    1964  1964
 5 QA           Live …       2 Afgh…    5112 Stocks     1057 Chic…    1965  1965
 6 QA           Live …       2 Afgh…    5112 Stocks     1057 Chic…    1966  1966
 7 QA           Live …       2 Afgh…    5112 Stocks     1057 Chic…    1967  1967
 8 QA           Live …       2 Afgh…    5112 Stocks     1057 Chic…    1968  1968
 9 QA           Live …       2 Afgh…    5112 Stocks     1057 Chic…    1969  1969
10 QA           Live …       2 Afgh…    5112 Stocks     1057 Chic…    1970  1970
# … with 30,967 more rows, 4 more variables: Unit <chr>, Value <dbl>,
#   Flag <chr>, `Flag Description` <chr>, and abbreviated variable names
#   ¹​`Domain Code`, ²​`Area Code`, ³​`Element Code`, ⁴​`Item Code`, ⁵​`Year Code`

Describe the data

It consists of 3077 rows and 14 columns. This “bird” data documents the number of five livestock birds (chickens, ducks, geese and guinea fowls, pigeons, turkeys) of each year from 1960 to 2018 of 248 countries/regions.

Code
print(summarytools::dfSummary(bird_data),
      varnumbers = FALSE,
      plain.ascii  = FALSE,
      style        = "grid",
      graph.magnif = 0.80,
      valid.col    = FALSE,
      method = 'render',
      table.classes = 'table-condensed')

Data Frame Summary

bird_data

Dimensions: 30977 x 14
Duplicates: 0
Variable Stats / Values Freqs (% of Valid) Graph Missing
Domain Code [character] 1. QA
30977(100.0%)
0 (0.0%)
Domain [character] 1. Live Animals
30977(100.0%)
0 (0.0%)
Area Code [numeric]
Mean (sd) : 1201.7 (2099.4)
min ≤ med ≤ max:
1 ≤ 156 ≤ 5504
IQR (CV) : 152 (1.7)
248 distinct values 0 (0.0%)
Area [character]
1. Africa
2. Asia
3. Eastern Asia
4. Egypt
5. Europe
6. France
7. Greece
8. Myanmar
9. Northern Africa
10. South-eastern Asia
[ 238 others ]
290(0.9%)
290(0.9%)
290(0.9%)
290(0.9%)
290(0.9%)
290(0.9%)
290(0.9%)
290(0.9%)
290(0.9%)
290(0.9%)
28077(90.6%)
0 (0.0%)
Element Code [numeric] 1 distinct value
5112:30977(100.0%)
0 (0.0%)
Element [character] 1. Stocks
30977(100.0%)
0 (0.0%)
Item Code [numeric]
Mean (sd) : 1066.5 (9)
min ≤ med ≤ max:
1057 ≤ 1068 ≤ 1083
IQR (CV) : 15 (0)
1057:13074(42.2%)
1068:6909(22.3%)
1072:4136(13.4%)
1079:5693(18.4%)
1083:1165(3.8%)
0 (0.0%)
Item [character]
1. Chickens
2. Ducks
3. Geese and guinea fowls
4. Pigeons, other birds
5. Turkeys
13074(42.2%)
6909(22.3%)
4136(13.4%)
1165(3.8%)
5693(18.4%)
0 (0.0%)
Year Code [numeric]
Mean (sd) : 1990.6 (16.7)
min ≤ med ≤ max:
1961 ≤ 1992 ≤ 2018
IQR (CV) : 29 (0)
58 distinct values 0 (0.0%)
Year [numeric]
Mean (sd) : 1990.6 (16.7)
min ≤ med ≤ max:
1961 ≤ 1992 ≤ 2018
IQR (CV) : 29 (0)
58 distinct values 0 (0.0%)
Unit [character] 1. 1000 Head
30977(100.0%)
0 (0.0%)
Value [numeric]
Mean (sd) : 99410.6 (720611.4)
min ≤ med ≤ max:
0 ≤ 1800 ≤ 23707134
IQR (CV) : 15233 (7.2)
11495 distinct values 1036 (3.3%)
Flag [character]
1. *
2. A
3. F
4. Im
5. M
1494(7.4%)
6488(32.1%)
10007(49.5%)
1213(6.0%)
1002(5.0%)
10773 (34.8%)
Flag Description [character]
1. Aggregate, may include of
2. Data not available
3. FAO data based on imputat
4. FAO estimate
5. Official data
6. Unofficial figure
6488(20.9%)
1002(3.2%)
1213(3.9%)
10007(32.3%)
10773(34.8%)
1494(4.8%)
0 (0.0%)

Generated by summarytools 1.0.1 (R version 4.2.1)
2022-12-20

This data looks tidy, however, the column “Area” is not well organized as it has the name of the countries and the regions. For example, there are “Africa” and “Algeria” for the Area column, but “Algeria” is a part of “Africa”.

Code
bird_data %>% 
  select(Area) %>%
  arrange(Area) %>%
  unique()
# A tibble: 248 × 1
   Area               
   <chr>              
 1 Afghanistan        
 2 Africa             
 3 Albania            
 4 Algeria            
 5 American Samoa     
 6 Americas           
 7 Angola             
 8 Antigua and Barbuda
 9 Argentina          
10 Armenia            
# … with 238 more rows

It seems like that the Area consists of - [1-220] 220 countris (Afghanistan - Zimbabwe) - [221] World - [222-248] 27 regions

Code
unique(bird_data$Area)
  [1] "Afghanistan"                                         
  [2] "Albania"                                             
  [3] "Algeria"                                             
  [4] "American Samoa"                                      
  [5] "Angola"                                              
  [6] "Antigua and Barbuda"                                 
  [7] "Argentina"                                           
  [8] "Armenia"                                             
  [9] "Aruba"                                               
 [10] "Australia"                                           
 [11] "Austria"                                             
 [12] "Azerbaijan"                                          
 [13] "Bahamas"                                             
 [14] "Bahrain"                                             
 [15] "Bangladesh"                                          
 [16] "Barbados"                                            
 [17] "Belarus"                                             
 [18] "Belgium"                                             
 [19] "Belgium-Luxembourg"                                  
 [20] "Belize"                                              
 [21] "Benin"                                               
 [22] "Bermuda"                                             
 [23] "Bhutan"                                              
 [24] "Bolivia (Plurinational State of)"                    
 [25] "Bosnia and Herzegovina"                              
 [26] "Botswana"                                            
 [27] "Brazil"                                              
 [28] "Brunei Darussalam"                                   
 [29] "Bulgaria"                                            
 [30] "Burkina Faso"                                        
 [31] "Burundi"                                             
 [32] "Cabo Verde"                                          
 [33] "Cambodia"                                            
 [34] "Cameroon"                                            
 [35] "Canada"                                              
 [36] "Cayman Islands"                                      
 [37] "Central African Republic"                            
 [38] "Chad"                                                
 [39] "Chile"                                               
 [40] "China, Hong Kong SAR"                                
 [41] "China, Macao SAR"                                    
 [42] "China, mainland"                                     
 [43] "China, Taiwan Province of"                           
 [44] "Colombia"                                            
 [45] "Comoros"                                             
 [46] "Congo"                                               
 [47] "Cook Islands"                                        
 [48] "Costa Rica"                                          
 [49] "Côte d'Ivoire"                                       
 [50] "Croatia"                                             
 [51] "Cuba"                                                
 [52] "Cyprus"                                              
 [53] "Czechia"                                             
 [54] "Czechoslovakia"                                      
 [55] "Democratic People's Republic of Korea"               
 [56] "Democratic Republic of the Congo"                    
 [57] "Denmark"                                             
 [58] "Dominica"                                            
 [59] "Dominican Republic"                                  
 [60] "Ecuador"                                             
 [61] "Egypt"                                               
 [62] "El Salvador"                                         
 [63] "Equatorial Guinea"                                   
 [64] "Eritrea"                                             
 [65] "Estonia"                                             
 [66] "Eswatini"                                            
 [67] "Ethiopia"                                            
 [68] "Ethiopia PDR"                                        
 [69] "Falkland Islands (Malvinas)"                         
 [70] "Fiji"                                                
 [71] "Finland"                                             
 [72] "France"                                              
 [73] "French Guyana"                                       
 [74] "French Polynesia"                                    
 [75] "Gabon"                                               
 [76] "Gambia"                                              
 [77] "Georgia"                                             
 [78] "Germany"                                             
 [79] "Ghana"                                               
 [80] "Greece"                                              
 [81] "Grenada"                                             
 [82] "Guadeloupe"                                          
 [83] "Guam"                                                
 [84] "Guatemala"                                           
 [85] "Guinea"                                              
 [86] "Guinea-Bissau"                                       
 [87] "Guyana"                                              
 [88] "Haiti"                                               
 [89] "Honduras"                                            
 [90] "Hungary"                                             
 [91] "Iceland"                                             
 [92] "India"                                               
 [93] "Indonesia"                                           
 [94] "Iran (Islamic Republic of)"                          
 [95] "Iraq"                                                
 [96] "Ireland"                                             
 [97] "Israel"                                              
 [98] "Italy"                                               
 [99] "Jamaica"                                             
[100] "Japan"                                               
[101] "Jordan"                                              
[102] "Kazakhstan"                                          
[103] "Kenya"                                               
[104] "Kiribati"                                            
[105] "Kuwait"                                              
[106] "Kyrgyzstan"                                          
[107] "Lao People's Democratic Republic"                    
[108] "Latvia"                                              
[109] "Lebanon"                                             
[110] "Lesotho"                                             
[111] "Liberia"                                             
[112] "Libya"                                               
[113] "Liechtenstein"                                       
[114] "Lithuania"                                           
[115] "Luxembourg"                                          
[116] "Madagascar"                                          
[117] "Malawi"                                              
[118] "Malaysia"                                            
[119] "Mali"                                                
[120] "Malta"                                               
[121] "Martinique"                                          
[122] "Mauritania"                                          
[123] "Mauritius"                                           
[124] "Mexico"                                              
[125] "Micronesia (Federated States of)"                    
[126] "Mongolia"                                            
[127] "Montenegro"                                          
[128] "Montserrat"                                          
[129] "Morocco"                                             
[130] "Mozambique"                                          
[131] "Myanmar"                                             
[132] "Namibia"                                             
[133] "Nauru"                                               
[134] "Nepal"                                               
[135] "Netherlands"                                         
[136] "Netherlands Antilles (former)"                       
[137] "New Caledonia"                                       
[138] "New Zealand"                                         
[139] "Nicaragua"                                           
[140] "Niger"                                               
[141] "Nigeria"                                             
[142] "Niue"                                                
[143] "North Macedonia"                                     
[144] "Norway"                                              
[145] "Oman"                                                
[146] "Pacific Islands Trust Territory"                     
[147] "Pakistan"                                            
[148] "Palestine"                                           
[149] "Panama"                                              
[150] "Papua New Guinea"                                    
[151] "Paraguay"                                            
[152] "Peru"                                                
[153] "Philippines"                                         
[154] "Poland"                                              
[155] "Portugal"                                            
[156] "Puerto Rico"                                         
[157] "Qatar"                                               
[158] "Republic of Korea"                                   
[159] "Republic of Moldova"                                 
[160] "Réunion"                                             
[161] "Romania"                                             
[162] "Russian Federation"                                  
[163] "Rwanda"                                              
[164] "Saint Helena, Ascension and Tristan da Cunha"        
[165] "Saint Kitts and Nevis"                               
[166] "Saint Lucia"                                         
[167] "Saint Pierre and Miquelon"                           
[168] "Saint Vincent and the Grenadines"                    
[169] "Samoa"                                               
[170] "Sao Tome and Principe"                               
[171] "Saudi Arabia"                                        
[172] "Senegal"                                             
[173] "Serbia"                                              
[174] "Serbia and Montenegro"                               
[175] "Seychelles"                                          
[176] "Sierra Leone"                                        
[177] "Singapore"                                           
[178] "Slovakia"                                            
[179] "Slovenia"                                            
[180] "Solomon Islands"                                     
[181] "Somalia"                                             
[182] "South Africa"                                        
[183] "South Sudan"                                         
[184] "Spain"                                               
[185] "Sri Lanka"                                           
[186] "Sudan"                                               
[187] "Sudan (former)"                                      
[188] "Suriname"                                            
[189] "Sweden"                                              
[190] "Switzerland"                                         
[191] "Syrian Arab Republic"                                
[192] "Tajikistan"                                          
[193] "Thailand"                                            
[194] "Timor-Leste"                                         
[195] "Togo"                                                
[196] "Tokelau"                                             
[197] "Tonga"                                               
[198] "Trinidad and Tobago"                                 
[199] "Tunisia"                                             
[200] "Turkey"                                              
[201] "Turkmenistan"                                        
[202] "Tuvalu"                                              
[203] "Uganda"                                              
[204] "Ukraine"                                             
[205] "United Arab Emirates"                                
[206] "United Kingdom of Great Britain and Northern Ireland"
[207] "United Republic of Tanzania"                         
[208] "United States of America"                            
[209] "United States Virgin Islands"                        
[210] "Uruguay"                                             
[211] "USSR"                                                
[212] "Uzbekistan"                                          
[213] "Vanuatu"                                             
[214] "Venezuela (Bolivarian Republic of)"                  
[215] "Viet Nam"                                            
[216] "Wallis and Futuna Islands"                           
[217] "Yemen"                                               
[218] "Yugoslav SFR"                                        
[219] "Zambia"                                              
[220] "Zimbabwe"                                            
[221] "World"                                               
[222] "Africa"                                              
[223] "Eastern Africa"                                      
[224] "Middle Africa"                                       
[225] "Northern Africa"                                     
[226] "Southern Africa"                                     
[227] "Western Africa"                                      
[228] "Americas"                                            
[229] "Northern America"                                    
[230] "Central America"                                     
[231] "Caribbean"                                           
[232] "South America"                                       
[233] "Asia"                                                
[234] "Central Asia"                                        
[235] "Eastern Asia"                                        
[236] "Southern Asia"                                       
[237] "South-eastern Asia"                                  
[238] "Western Asia"                                        
[239] "Europe"                                              
[240] "Eastern Europe"                                      
[241] "Northern Europe"                                     
[242] "Southern Europe"                                     
[243] "Western Europe"                                      
[244] "Oceania"                                             
[245] "Australia and New Zealand"                           
[246] "Melanesia"                                           
[247] "Micronesia"                                          
[248] "Polynesia"                                           
Source Code
---
title: "Challenge1 Erika Nagai"
author: "Erika Nagai"
desription: "Reading in data and creating a post"
date: "09/13/2022"
format:
  html:
    toc: true
    code-fold: true
    code-copy: true
    code-tools: true
categories:
  - challenge_1
  - railroads
  - faostat
  - wildbirds
---

```{r}
#| label: setup
#| warning: false
#| message: false

# Packages 
library(tidyverse)
library(dplyr)
library(ggplot2)
library(summarytools)

knitr::opts_chunk$set(echo = TRUE, warning=FALSE, message=FALSE)
```

## Challenge Overview

Today's challenge is to read in and explore the data 'birds.csv'

```{r}
# 1. Read in the data

bird_data <- read_csv('_data/birds.csv')
bird_data

```

## Describe the data

It consists of 3077 rows and 14 columns.
This "bird" data documents the number of five livestock birds (chickens, ducks, geese and guinea fowls, pigeons, turkeys) of each year from 1960 to 2018 of 248 countries/regions. 

```{r}
#| label: summary

print(summarytools::dfSummary(bird_data),
      varnumbers = FALSE,
      plain.ascii  = FALSE,
      style        = "grid",
      graph.magnif = 0.80,
      valid.col    = FALSE,
      method = 'render',
      table.classes = 'table-condensed')
```


This data looks tidy, however, the column "Area" is not well organized as it has the name of the countries and the regions. For example, there are "Africa" and "Algeria" for the Area column, but "Algeria" is a part of "Africa".

```{r}
bird_data %>% 
  select(Area) %>%
  arrange(Area) %>%
  unique()
```
It seems like that the Area consists of 
- [1-220] 220 countris (Afghanistan - Zimbabwe)
- [221] World
- [222-248] 27 regions

```{r}
unique(bird_data$Area)
```