-
Notifications
You must be signed in to change notification settings - Fork 56.7k
Commit
- Loading branch information
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,114 @@ | ||
ExData_Plotting1 | ||
================ | ||
## Introduction | ||
|
||
This assignment uses data from | ||
the <a href="http://archive.ics.uci.edu/ml/">UC Irvine Machine | ||
Learning Repository</a>, a popular repository for machine learning | ||
datasets. In particular, we will be using the "Individual household | ||
electric power consumption Data Set" which I have made available on | ||
the course web site: | ||
|
||
|
||
* <b>Dataset</b>: <a href="https://d396qusza40orc.cloudfront.net/exdata%2Fdata%2Fhousehold_power_consumption.zip">Electric power consumption</a> [20Mb] | ||
|
||
* <b>Description</b>: Measurements of electric power consumption in | ||
one household with a one-minute sampling rate over a period of almost | ||
4 years. Different electrical quantities and some sub-metering values | ||
are available. | ||
|
||
|
||
The following descriptions of the 9 variables in the dataset are taken | ||
from | ||
the <a href="https://archive.ics.uci.edu/ml/datasets/Individual+household+electric+power+consumption">UCI | ||
web site</a>: | ||
|
||
<ol> | ||
<li><b>Date</b>: Date in format dd/mm/yyyy </li> | ||
<li><b>Time</b>: time in format hh:mm:ss </li> | ||
<li><b>Global_active_power</b>: household global minute-averaged active power (in kilowatt) </li> | ||
<li><b>Global_reactive_power</b>: household global minute-averaged reactive power (in kilowatt) </li> | ||
<li><b>Voltage</b>: minute-averaged voltage (in volt) </li> | ||
<li><b>Global_intensity</b>: household global minute-averaged current intensity (in ampere) </li> | ||
<li><b>Sub_metering_1</b>: energy sub-metering No. 1 (in watt-hour of active energy). It corresponds to the kitchen, containing mainly a dishwasher, an oven and a microwave (hot plates are not electric but gas powered). </li> | ||
<li><b>Sub_metering_2</b>: energy sub-metering No. 2 (in watt-hour of active energy). It corresponds to the laundry room, containing a washing-machine, a tumble-drier, a refrigerator and a light. </li> | ||
<li><b>Sub_metering_3</b>: energy sub-metering No. 3 (in watt-hour of active energy). It corresponds to an electric water-heater and an air-conditioner.</li> | ||
</ol> | ||
|
||
## Loading the data | ||
|
||
This comment has been minimized.
Sorry, something went wrong.
This comment has been minimized.
Sorry, something went wrong.
Zahramin
|
||
|
||
|
||
|
||
|
||
When loading the dataset into R, please consider the following: | ||
|
||
* The dataset has 2,075,259 rows and 9 columns. First | ||
This comment has been minimized.
Sorry, something went wrong.
Jameaba
|
||
calculate a rough estimate of how much memory the dataset will require | ||
in memory before reading into R. Make sure your computer has enough | ||
memory (most modern computers should be fine). | ||
|
||
* We will only be using data from the dates 2007-02-01 and | ||
2007-02-02. One alternative is to read the data from just those dates | ||
rather than reading in the entire dataset and subsetting to those | ||
dates. | ||
|
||
* You may find it useful to convert the Date and Time variables to | ||
Date/Time classes in R using the `strptime()` and `as.Date()` | ||
functions. | ||
|
||
* Note that in this dataset missing values are coded as `?`. | ||
|
||
|
||
## Making Plots | ||
|
||
Our overall goal here is simply to examine how household energy usage | ||
varies over a 2-day period in February, 2007. Your task is to | ||
reconstruct the following plots below, all of which were constructed | ||
using the base plotting system. | ||
|
||
First you will need to fork and clone the following GitHub repository: | ||
[https://github.com/rdpeng/ExData_Plotting1](https://github.com/rdpeng/ExData_Plotting1) | ||
|
||
|
||
For each plot you should | ||
|
||
* Construct the plot and save it to a PNG file with a width of 480 | ||
pixels and a height of 480 pixels. | ||
|
||
* Name each of the plot files as `plot1.png`, `plot2.png`, etc. | ||
|
||
* Create a separate R code file (`plot1.R`, `plot2.R`, etc.) that | ||
constructs the corresponding plot, i.e. code in `plot1.R` constructs | ||
the `plot1.png` plot. Your code file **should include code for reading | ||
the data** so that the plot can be fully reproduced. You should also | ||
include the code that creates the PNG file. | ||
|
||
* Add the PNG file and R code file to your git repository | ||
|
||
When you are finished with the assignment, push your git repository to | ||
GitHub so that the GitHub version of your repository is up to | ||
date. There should be four PNG files and four R code files. | ||
|
||
|
||
The four plots that you will need to construct are shown below. | ||
|
||
This comment has been minimized.
Sorry, something went wrong.
Jameaba
|
||
|
||
### Plot 1 | ||
|
||
This comment has been minimized.
Sorry, something went wrong.
Jameaba
|
||
|
||
![plot of chunk unnamed-chunk-2](figure/unnamed-chunk-2.png) | ||
|
||
|
||
### Plot 2 | ||
|
||
This comment has been minimized.
Sorry, something went wrong.
This comment has been minimized.
Sorry, something went wrong.
Zahramin
|
||
![plot of chunk unnamed-chunk-3](figure/unnamed-chunk-3.png) | ||
|
||
|
||
### Plot 3 | ||
|
||
![plot of chunk unnamed-chunk-4](figure/unnamed-chunk-4.png) | ||
|
||
|
||
### Plot 4 | ||
|
||
![plot of chunk unnamed-chunk-5](figure/unnamed-chunk-5.png) | ||
|
||
Plotting Assignment 1 for Exploratory Data Analysis |
3 comments
on commit 3edfc82
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
setwd(C:/Users/Jill/Desktop/Coursera")
data <- read.csv("C:/Users/Jill/Desktop/Coursera/February.txt", header=TRUE, sep=";")
power <- read.table(file, header=T, sep=";")
power$Date <- as.Date(power$Date, format="%d/%m/%Y")
df <- power[(power$Date=="2007-02-01") | (power$Date=="2007-02-02"),]
df$Global_active_power <- as.numeric(as.character(df$Global_active_power))
df$Global_reactive_power <- as.numeric(as.character(df$Global_reactive_power))
df$Voltage <- as.numeric(as.character(df$Voltage))
df <- transform(df, timestamp=as.POSIXct(paste(Date, Time)), "%d/%m/%Y %H:%M:%S")
df$Sub_metering_1 <- as.numeric(as.character(df$Sub_metering_1))
df$Sub_metering_2 <- as.numeric(as.character(df$Sub_metering_2))
df$Sub_metering_3 <- as.numeric(as.character(df$Sub_metering_3))
plot1 <- function() {
hist(df$Global_active_power, main = paste("Global Active Power"), col="red", xlab="Global Active Power (kilowatts)")
dev.copy(png, file="plot1.png", width=480, height=480)
dev.off()
cat("Plot1.png has been saved in", getwd())
}
plot1()
Plot1.png has been saved in C:/Users/Jill/Desktop/Coursera
plot2 <- function() {
plot(df$timestamp,df$Global_active_power, type="l", xlab="", ylab="Global Active Power (kilowatts)")
dev.copy(png, file="plot2.png", width=480, height=480)
dev.off()
cat("plot2.png has been saved in", getwd())
}
plot2()
Plot2.png has been saved in C:/Users/JIll/Desktop/Coursera
plot3 <- function() {
plot(df$timestamp,df$Sub_metering_1, type="l", xlab="", ylab="Energy sub metering")
lines(df$timestamp,df$Sub_metering_2,col="red")
lines(df$timestamp,df$Sub_metering_3,col="blue")
legend("topright", col=c("black","red","blue"), c("Sub_metering_1 ","Sub_metering_2 ", "Sub_metering_3 "),lty=c(1,1), lwd=c(1,1))
dev.copy(png, file="plot3.png", width=480, height=480)
dev.off()
cat("plot3.png has been saved in", getwd())
}
plot3()
plot3.png has been saved in C:/Users/Jill/Desktop/Coursera
plot4 <- function() {
par(mfrow=c(2,2))
##PLOT 1
plot(df$timestamp,df$Global_active_power, type="l", xlab="", ylab="Global Active Power")
##PLOT 2
plot(df$timestamp,df$Voltage, type="l", xlab="datetime", ylab="Voltage")
##PLOT 3
plot(df$timestamp,df$Sub_metering_1, type="l", xlab="", ylab="Energy sub metering")
lines(df$timestamp,df$Sub_metering_2,col="red")
lines(df$timestamp,df$Sub_metering_3,col="blue")
legend("topright", col=c("black","red","blue"), c("Sub_metering_1 ","Sub_metering_2 ", "Sub_metering_3 "),lty=c(1,1), bty="n", cex=.5) #bty removes the box, cex shrinks the text, spacing added after labels so it renders correctly
#PLOT 4
plot(df$timestamp,df$Global_reactive_power, type="l", xlab="datetime", ylab="Global_reactive_power")
#OUTPUT
dev.copy(png, file="plot4.png", width=480, height=480)
dev.off()
cat("plot4.png has been saved in", getwd())
}
plot4()
plot4.png has been saved in C:/Users/Jill/Desktop/Coursera
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I took the data set in the import data set option, data name=hpc
dff<-subset(hpc,hpc$Date=="1/2/2007"|hpc$Date=="2/2/2007")
#PLOT 1
plot1<-hist(dff$Global_active_power,col="red",xlab="Global Active Power (kilowatts)",
ylab="Frequency",main="Global Active Power")
dev.copy(png, file="plot1.png", width=480, height=480)
dev.off()
PLOT 2
datetime <- strptime(paste(dff$Date, dff$Time, sep=" "), "%d/%m/%Y %H:%M:%S")
plot2<- plot(datetime,dff$Global_active_power,ylab="Global Active Power(kilowatts)",
xlab="",type="l")
dev.copy(png, file="plot2.png", width=480, height=480)
dev.off()
PLOT 3
plot3<-plot(datetime,dff$Sub_metering_1,type="n",xlab = "",ylab="Energy sub metering")
lines(datetime,dff$Sub_metering_1,col="black")
lines(datetime,dff$Sub_metering_2,col="red")
lines(datetime,dff$Sub_metering_3,col="blue")
legend("topright",lty=c(1,1,1),col=c("black","red","blue"),legend=c("Sub_meeting_1","Sub_meeting_2","Sub_meeting_3"),cex=0.2)
dev.copy(png, file="plot3.png", width=480, height=480)
dev.off()
PLOT 4
library(ggplot2)
par(mfrow=c(2,2))
plot(datetime,dff$Global_active_power,ylab="Global Active Power",
xlab="",type="l")
plott4<-plot(datetime,dff$Voltage,ylab="Voltage",type="l")
plot(datetime,dff$Sub_metering_1,type="n",xlab = "",ylab="Energy sub metering")
lines(datetime,dff$Sub_metering_1,col="black")
lines(datetime,dff$Sub_metering_2,col="red")
lines(datetime,dff$Sub_metering_3,col="blue")
plot(datetime,dff$Global_reactive_power,type="l")
legend("topright",lty=c(1,1,1),col=c("black","red","blue"),legend=c("Sub_meeting_1","Sub_meeting_2","Sub_metering_3"))
dev.copy(png, file="plott4.png", width=480, height=480)
dev.off()
df<-read.table("household_power_consumption.txt", header = T, sep=";", comment.char="%", stringsAsFactors=FALSE, na.strings="?") ## reads data