Please use D2L to turn in both the HTML output and your R Markdown file in.

Q1. Data Wrangling (4 pts)

Define data wrangling and discuss why it is an important element in the data analysis or data visualization process.

Q2. dplyr (5 pts)

Implementone function from the dplyr package on a dataset we have used in class (such as http://math.montana.edu/ahoegh/teaching/stat408/datasets/SeattleHousing.csv or http://math.montana.edu/ahoegh/teaching/stat408/datasets/HousingSales.csv )

Q3.

a (4 pts)

Describe the difference between substr() and strsplit().

b (6 pts)

Use one of these function to create a new variable for the hour an Uber ride began using http://math.montana.edu/ahoegh/teaching/stat408/datasets/UberMay2014.csv.

Then apply the count() function from dplyr to compute the number of rides starting at each hour