# AP Statistics R Programming Assignment 2

[This article was first published on

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

**R – Saturn Science**, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Our second assignment involves students downloading an Excel file from the internet and importing it into the RStudio environment. They will then write code to give summary statistics and the five-number summary.

The Sharp Sight website has a nice explanation of using ggplot for creating scatter plots. Go here to read the entire article.

This post will be part two for the students to check their code against mine and to make any final adjustments before submitting their assignment.

# Assignment 2 # Follow the usual procedures for code submission. library(readxl) # Download this package if you dont have it already # Go to import dataset, then "From Excel" and find you file "backpack" on you computer. backpack <- read_excel("//ais-main/users/kevin-smith/Desktop/backpack.xls") View(backpack) # you need to make it a data.frame first with this command. as.data.frame(backpack) # don't forget to load ggplot, dplyr and/or tidyverse. summary(backpack) # gives summary statistics fivenum(backpack$boyweight) # because this is a data.frame, I need to use the "$" to indicate which column to analyse. fivenum(backpack$packweight) library(ggplot2) scatter145=ggplot(data=backpack, aes(boyweight,packweight)) + geom_point() scatter145 # this just give a basic scatterplot # now we add some color scatter145b=ggplot(data=backpack, aes(boyweight,packweight,colour=body.wt)) + geom_point() scatter145b # Next we add a small color box. scatter145c = scatter145b+ geom_point(size=2) + xlab("Body Weight (lb)") + ylab("Pack weight (lb)") + ggtitle("Backpack Weight") scatter145c # Here we will add the confidence intervals and regression line with the "lm" command # lm means linear model scatter145d=scatter145c+ geom_point(size=3) + xlab("Body Weight (lb)") + ylab("Pack weight (lb)")+ ggtitle("Backpack Weight")+ geom_smooth(method = "lm") scatter145d

Go here to read the article on creating another scatter plot.

To

**leave a comment**for the author, please follow the link and comment on their blog:**R – Saturn Science**.R-bloggers.com offers

**daily e-mail updates**about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.