Source data

One of the most exciting areas in all of data science right now is wearable computing. Companies like Fitbit, Nike, and Jawbone Up are racing to develop the most advanced algorithms to attract new users. The source data are from the Human Activity Recognition Using Smartphones Data Set collected from the accelerometers from the Samsung Galaxy S smartphone. A full description is available at the site where the data was obtained:

http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones

Here is the data for the project:

https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip

The data

run_analysis.R contains several datasets along with the final datasets. These are:

activities: contains activity labels with names
elements: contains the file locations of unzipped folder
features: contains feature (variable) names
FINALtidydataset: contains final tidy dataset
MeanSDdataset: contains the variables on the mean and standard deviations along with subjectID, activityID and activityNames
mergedDataset: contains the merged testing and training datasets
testx and trainx: contains testing and training datasets
testy and trainy: contains testing and training labels
testsubject and trainsubject: contains testing and training subjectID(s)
MeanSDcolnames: contains a boolean vector of names of variables on mean and standard deviations

The transformation

File with R code run_analysis.R performs the 5 following steps:

Reading in the files and merging the training and the test sets to create one dataset.
Extracting only the measurements on the mean and standard deviation for each measurement.
Using descriptive activity names to name the activities in the data set.
Appropriately labeling the data set with descriptive variable names.
Creating a second, independent tidy data set with the average of each variable for each activity and each subject:
5.1 Making second tidy data set (FINALtidyset)
5.2 Writing second tidy data set in .txt file (tidy.txt)

The code assumes all the data is present in the same folder, un-compressed and without names altered.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CodeBook.md

CodeBook.md

Source data

The data

The transformation

Files

CodeBook.md

Latest commit

History

CodeBook.md

File metadata and controls

Source data

The data

The transformation