Code
library(tidyverse)
::opts_chunk$set(echo = TRUE, warning=FALSE, message=FALSE) knitr
Dane Shelton
September 16, 2022
read in a dataset, and
describe the dataset using both words and any supporting information (e.g., tables, etc)
Used function read_csv()
to read in FAO Country Groups dataset.
Data was clean for Tibbles format already, did not need to alter any arguments other than filename.
# Block output from showing
#| include: false
# Using Readr package function read_csv() to Import FAO Stat Country
# Assigning variable name "country_code" to tibble created from read_csv
country_code <- read_csv("_data/FAOSTAT_country_groups.csv")
# Remember to add desired data into working directory before calling read_csv
# A tibble: 6 × 7
`Country Group Code` `Country Group` Country…¹ Country M49 C…² ISO2 …³ ISO3 …⁴
<dbl> <chr> <dbl> <chr> <chr> <chr> <chr>
1 5100 Africa 4 Algeria 012 DZ DZA
2 5100 Africa 7 Angola 024 AO AGO
3 5100 Africa 53 Benin 204 BJ BEN
4 5100 Africa 20 Botswa… 072 BW BWA
5 5100 Africa 233 Burkin… 854 BF BFA
6 5100 Africa 29 Burundi 108 BI BDI
# … with abbreviated variable names ¹`Country Code`, ²`M49 Code`, ³`ISO2 Code`,
# ⁴`ISO3 Code`
Country Groups contains 1943 observations (rows) and 7 variables (columns).
The variables and their types are as follows: “Country Group Code”
This data meticulously categorizes all the countries recognized by the United Nations, first by region (Africa, Americas, Asia…), then further geographically (South America, Central Asia, East Africa…). Each observation has standardized codes in multiple formats representing the associated nation.
Within the variable “Country Group” , which already categorized countries geographically in detail, the countries are further classified by socioeconomic indicators like “Low Income Economy” , “High Income Economy” , “Net Food Importing Country”. This data was likely collected from an organization like the United Nations as it uses common language in the manner it categorizes countries.
which((is.na(...)))
: directly return index of vals missing
complete.cases()
: return all complete obs
na.omit()
: omits all obs with missing values
na.rm()
: Logical argument option within some functions to remove NA for a calculation
summary()
: more in depth, shows what variables have missing vals etc
glimpse()
: less in depth
min()
, max()
IQR()
, range()
, diff()
dim(
), colnames()
, starts/ends_with()
, contains()
length()
, str()
, typeof()
---
title: "Challenge 1 Solution"
author: "Dane Shelton"
desription: "Reading in data and creating a post"
date: "09/16/2022"
format:
html:
toc: true
code-fold: true
code-copy: true
code-tools: true
categories:
- challenge_1
- shelton
- faostat
editor:
markdown:
wrap: 72
---
```{r}
#| label: setup
#| warning: false
#| message: false
library(tidyverse)
knitr::opts_chunk$set(echo = TRUE, warning=FALSE, message=FALSE)
```
## Challenge 1 Tasks
1) read in a dataset, and
2) describe the dataset using both words and any supporting information
(e.g., tables, etc)
### Task 1: Reading in the Data
Used function `read_csv()` to read in FAO Country Groups dataset.
Data was clean for Tibbles format already, did not need to alter any
arguments other than filename.
```{r}
#| label: Reading In Country Groups Data
# Block output from showing
#| include: false
# Using Readr package function read_csv() to Import FAO Stat Country
# Assigning variable name "country_code" to tibble created from read_csv
country_code <- read_csv("_data/FAOSTAT_country_groups.csv")
# Remember to add desired data into working directory before calling read_csv
```
### Task 2: Describing the Country Group Data
```{r}
#| label: Summary of Country Code Data
#| include: true
# Checking to type of country_code
# str(country_code)
# r-default dataframe
# Coercing to a Tibble and viewing
# country_code <- as_tibble(country_code)
# country_code
# 1947 x 7
# Viewed Tibble in Another Window
# Viewed Head to determine variable types
head(country_code)
# which(is.na(country_code[,'M49 Code']==TRUE))
# Used to determine whether any countries in the dataset were not in UN
```
#### Description of Country Groups Data
Country Groups contains 1943 observations (rows) and 7 variables
(columns).
The variables and their types are as follows: "Country Group Code" <dbl>
(double-precision), "Country Group" <chr> (character), "Country Code"
<dbl>, "Country" <chr>, "M49 Code" <chr>, "ISO2 Code" <chr>, "ISO3 Code"
<chr>.
This data meticulously categorizes all the countries recognized by the
United Nations, first by region (Africa, Americas, Asia...), then
further geographically (South America, Central Asia, East Africa...).
Each observation has standardized codes in multiple formats representing
the associated nation.
Within the variable "Country Group" , which already categorized
countries geographically in detail, the countries are *further*
classified by socioeconomic indicators like "Low Income Economy" , "High
Income Economy" , "Net Food Importing Country". This data was likely
collected from an organization like the United Nations as it uses common
language in the manner it categorizes countries.
##### Other Useful Functions and Notes
`which((is.na(...)))`: directly return index of vals missing
`complete.cases()`: return all complete obs
`na.omit()` : omits all obs with missing values
`na.rm()` : Logical argument option within some functions to remove NA
for a calculation
`summary()`: more in depth, shows what variables have missing vals etc
`glimpse()`: less in depth
`min()`, `max()`
`IQR()`, `range()`, `diff()`
`dim(`), `colnames()`, `starts/ends_with()`, `contains()`
`length()`, `str()`, `typeof()`