sathvik_iris_data_hw2

the Iris dataset

sathvik_thogaru
08-18-2021

loading the iris dataset from the datasets package and running summary statistics on the iris data

  Sepal.Length    Sepal.Width     Petal.Length    Petal.Width   
 Min.   :4.300   Min.   :2.000   Min.   :1.000   Min.   :0.100  
 1st Qu.:5.100   1st Qu.:2.800   1st Qu.:1.600   1st Qu.:0.300  
 Median :5.800   Median :3.000   Median :4.350   Median :1.300  
 Mean   :5.843   Mean   :3.057   Mean   :3.758   Mean   :1.199  
 3rd Qu.:6.400   3rd Qu.:3.300   3rd Qu.:5.100   3rd Qu.:1.800  
 Max.   :7.900   Max.   :4.400   Max.   :6.900   Max.   :2.500  
       Species  
 setosa    :50  
 versicolor:50  
 virginica :50  
                
                
                

finding the column nammes of the data

[1] "Sepal.Length" "Sepal.Width"  "Petal.Length" "Petal.Width" 
[5] "Species"     

dim()is used to find the dimensions of the data

[1] 150   5

skim() is an alternative to summary(), quickly providing a broad overview of a data frame. It handles data of all types, dispatching a different set of summary functions based on the types of columns in the data frame.

Table 1: Data summary
Name iris
Number of rows 150
Number of columns 5
_______________________
Column type frequency:
factor 1
numeric 4
________________________
Group variables None

Variable type: factor

skim_variable n_missing complete_rate ordered n_unique top_counts
Species 0 1 FALSE 3 set: 50, ver: 50, vir: 50

Variable type: numeric

skim_variable n_missing complete_rate mean sd p0 p25 p50 p75 p100 hist
Sepal.Length 0 1 5.84 0.83 4.3 5.1 5.80 6.4 7.9 ▆▇▇▅▂
Sepal.Width 0 1 3.06 0.44 2.0 2.8 3.00 3.3 4.4 ▁▆▇▂▁
Petal.Length 0 1 3.76 1.77 1.0 1.6 4.35 5.1 6.9 ▇▁▆▇▂
Petal.Width 0 1 1.20 0.76 0.1 0.3 1.30 1.8 2.5 ▇▁▇▅▃

there are 4 numeric variables and 1 factor variable which is the species. there are a total of 3 unique species in the iris

[1] setosa     versicolor virginica 
Levels: setosa versicolor virginica

the three species are setosa, versicolor and virginica

ploting iris

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY-NC 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

Citation

For attribution, please cite this work as

sathvik_thogaru (2021, Aug. 18). DACSS 601 August 2021: sathvik_iris_data_hw2. Retrieved from https://mrolfe.github.io/DACSS601August2021/posts/2021-08-18-sathvikirisdatahw2/

BibTeX citation

@misc{sathvik_thogaru2021sathvik_iris_data_hw2,
  author = {sathvik_thogaru, },
  title = {DACSS 601 August 2021: sathvik_iris_data_hw2},
  url = {https://mrolfe.github.io/DACSS601August2021/posts/2021-08-18-sathvikirisdatahw2/},
  year = {2021}
}