From ef8b5ffe7e0b07f8882f365d3677a6955e3df944 Mon Sep 17 00:00:00 2001 From: Ruturaj Date: Thu, 24 Oct 2019 12:41:35 -0500 Subject: [PATCH] Update README.md --- README.md | 53 +++++++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 53 insertions(+) diff --git a/README.md b/README.md index 8519bda..ec4b09b 100644 --- a/README.md +++ b/README.md @@ -24,3 +24,56 @@ Project Organization --------

Project based on the cookiecutter data science project template. #cookiecutterdatascience

+ +### Before Getting Stated + +If you are only interested in looking at the notebook then go to (There are notebook rendering problems in github ecosystem, in particularly with plotly): + +https://nbviewer.jupyter.org/github/Ruturaj4/jimmy_wrangler/blob/master/notebooks/Jimmy_Wrangler.ipynb + +Also, all the values are estimates and from 2018 United States census and homelessness estimated data. All the graphs are plotted using `plotly`. + +### Aim + +Jimmy Wrangler - Data Explorer is an introductory data science project. The objectives of this project are: + +* Find public data sets +* Do exploratory data analysis by combining them +* Plot results + +### Project Idea + +Homelessness count vs Total resident population - 2018, United States + +### Gathering datasets + +I believe that the datasets we use for analysis must come from credible sources. +Hence, for this project I used data from United States government. I used +a couple of datasets - homelessness dataset and US census dataset. + +Dataset source: + +* Homelessness data: https://www.hudexchange.info/resources/documents/2007-2018-PIT-Counts-by-CoC.xlsx +* United States census estimate data - https://www2.census.gov/programs-surveys/popest/tables/2010-2018/state/totals/nst-est2018-01.xlsx + +### Plots + +1) Following plot shows an estimated percentage of homelessness in 2018 by state. + +100 * Homeless Count / Total population + +![alt text](/reports/figures/homelessness_percentage.png) + +2) Following plot shows an estimated count of homeless population in 2018 by state (per 10000 population). Note that the following graph is almost same as above, I though it would be better to plot realistic numbers (greater than 1). + +10000 * Homeless Count / Total population + +![alt text](/reports/figures/homelessness_per_10000.png) + +3) Following geo-plot shows an estimated percentage of homelessness in 2018 by state. + +![alt text](/reports/figures/homlessness_geoplot.png) + +### License + +MIT