From 8465fb8edc2ae558886ff4232ead341c14aa5fb1 Mon Sep 17 00:00:00 2001 From: Sean Kross Date: Mon, 26 Oct 2015 21:08:23 -0400 Subject: [PATCH 01/45] Update ddp.md --- ddp.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/ddp.md b/ddp.md index bf9419f0..951724af 100644 --- a/ddp.md +++ b/ddp.md @@ -11,7 +11,7 @@ permalink: /ddp/ - [Shiny app to simulate 401K growth with interactive plots](http://www.mephistosoftware.com/shiny/401k_simulator/) - [Shiny Video Tutorials Playlist on Youtube](http://www.youtube.com/playlist?list=PL6wLL_RojB5xNOhe2OTSd-DPkMLVY9DfB) - [Tutorial on writing Shiny simulation apps](https://github.com/homerhanumat/shinyTutorials) -- [Dockerize a Shiny App](http://www.flaviobarros.net/2015/04/30/dockerizing-a-shiny-app/) +- [Dockerize a Shiny App](http://www.rmining.net/2015/04/30/dockerizing-a-shiny-app/) - [Git pushing Shiny Apps with Docker/Dokku](http://www.flaviobarros.net/2015/05/11/git-pushing-shiny-apps-with-docker-dokku/) - [Share your Shiny Apps with Docker and Kitematic](http://www.flaviobarros.net/2015/08/10/share-your-shiny-apps-with-docker-and-kitematic/) From 816edbe647bee4b3d0346591574cc1ba900afeff Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Carsten=20J=C3=B8rgensen?= Date: Thu, 29 Oct 2015 12:06:02 +0100 Subject: [PATCH 02/45] Added link to http://reproducibleresearch.net/ This website contains lots of useful material about reproducible research. --- curated.md | 1 + 1 file changed, 1 insertion(+) diff --git a/curated.md b/curated.md index 33ff6ca7..613c5f4e 100644 --- a/curated.md +++ b/curated.md @@ -61,6 +61,7 @@ permalink: /curated/ ### Reproducible Research - [Markdown live demo](http://markdown-here.com/livedemo.html) - [Boosting Slides by Ron Meir](https://github.com/Aratinga/Misc/blob/master/BoostingTutorial.pdf) +- [Reproducible Research website](http://reproducibleresearch.net/) ### Machine Learning - [UC Irvine Machine Learning Data Repository](http://archive.ics.uci.edu/ml/) From cf21437e3c8541dd7363ca5f02ff3ac6a435049d Mon Sep 17 00:00:00 2001 From: Len Greski Date: Mon, 30 Nov 2015 18:30:23 -0800 Subject: [PATCH 03/45] Added an article containing step by step instructions for using Github Pages with RStudio for the PML project. --- pml.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/pml.md b/pml.md index be4defc3..f2cac02d 100644 --- a/pml.md +++ b/pml.md @@ -25,3 +25,7 @@ permalink: /pml/ ## Comprehensive Notes - Complete notes for [Practical Machine Learning](http://sux13.github.io/DataScienceSpCourseNotes/) + +## Configuring Github Pages with RStudio for PML Project + +- Step by step instructions to [Configure Github Pages with RStudio](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/pml-ghPagesSetup.md) to support the PML course project. From 21b4c2dd22c15474a49392dfae3d555cf086fb29 Mon Sep 17 00:00:00 2001 From: Len Greski Date: Tue, 1 Dec 2015 16:48:27 -0800 Subject: [PATCH 04/45] Added links for Configuring RStudio to work with Git / Github, Mac and Windows versions, plus Using Editor Modes in Coursera Discussion Forums. --- toolbox.md | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/toolbox.md b/toolbox.md index ed0755ac..3c2dfc68 100644 --- a/toolbox.md +++ b/toolbox.md @@ -15,7 +15,12 @@ permalink: /toolbox/ - [Understanding the Relationship Between Git and GitHub](http://www.dataschool.io/github-is-just-dropbox-for-git/) - [Simple Guide to GitHub Forks](http://www.dataschool.io/simple-guide-to-forks-in-github-and-git/) - [Github Repo Tutorial How to fork a repo, download it to your local drive and commit changes ](https://www.youtube.com/watch?v=MY94AIplcaU) +- [Configuring RStudio to work with Git / Github - Mac OSX](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/configureRStudioGitOSXVersion.md) +- [Configuring RStudio to work with Git / Github - Windows](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/configureRStudioGitWindowsVersion.md) ## Comprehensive Notes - Complete notes for [The Data Scientist's Toolbox](http://sux13.github.io/DataScienceSpCourseNotes/) + +## Miscellaneous +- [Using Editor Modes in Coursera Discussion Forum Posts](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/usingMarkdownInForumPosts.md) From 37ae85a7b3092bcd9d417d1ce82dd69accbcf9ce Mon Sep 17 00:00:00 2001 From: Len Greski Date: Wed, 2 Dec 2015 09:51:37 -0800 Subject: [PATCH 05/45] Added articles written to support students in R Programming -- strategy for coding the assignments, grading the SHA-1 has code, Data frame as a list, and 3 articles discussing R and commercial stats packages. --- rprog.md | 9 +++++++++ 1 file changed, 9 insertions(+) diff --git a/rprog.md b/rprog.md index 42e19df4..85162862 100644 --- a/rprog.md +++ b/rprog.md @@ -6,22 +6,31 @@ permalink: /rprog/ ## Programming Assignments +- [Strategy for Coding the Programming Assignments](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/makeItRun.md) - [Tutorial for those struggling with Programming Assignment 1](https://github.com/derekfranks/practice_assignment) - [Tutorial for those struggling with Programming Assignment 2](https://github.com/DanieleP/PA2-clarifying_instructions) - [Tutorial for those struggling with Programming Assignment 3](https://github.com/DanieleP/PA3-tutorial) - [PA1-test: `testthat`, Unit Tests for Programming Assignment 1](https://github.com/cbryant1000/pa1test) - [PA3-test: `testthat`, Unit Tests for Programming Assignment 3](https://github.com/cbryant1000/pa3test) - [Alternative submit script for Programming Assignment 1 that makes submitting more convenient by allowing selection of multiple parts plus prompting if user wants to submit another part before exiting](https://github.com/rchampoux/coursera/blob/master/rprog-scripts-submitscript1.R) +- [Grading the SHA-1 Hash Code](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-gradeSHA1hash.md) ## R Language - [Some notes on the R Language](http://lopezrj.github.io) +- [A Data Frame is Also a List](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/dataFrameAsList.md) ## R language cheatsheet - [R cheatsheet covering all lectures](https://github.com/startupjing/Tech_Notes/blob/master/R/R_language.md) +## R and Commercial Statistics Packages + +- [Commercial Statistics Packages: An Historical Perspective](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statsPackagesHistory.md) +- [Why is R More Difficult than SAS?](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/whyIsRHarderThanSAS.md) +- [SAS Experience: impediment to learning R?](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/exampleSortRvsSAS.md) + ## Comprehensive Notes - Complete notes for [R Programming](http://sux13.github.io/DataScienceSpCourseNotes/) From 5d1cf2b335f14cfda25cd40429d18f9af8d4f127 Mon Sep 17 00:00:00 2001 From: Sean Kross Date: Sat, 12 Dec 2015 02:11:50 -0500 Subject: [PATCH 06/45] Update index.md --- index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/index.md b/index.md index 6c035e70..5cc7be2f 100644 --- a/index.md +++ b/index.md @@ -4,7 +4,7 @@ layout: page ## Table of Contents -This is site is meant to serve as a directory for the amazing content the +This site is meant to serve as a directory for the amazing content the community has created around the Data Science Specialization. If you are interested in contributing [click here](https://github.com/DataScienceSpecialization/DataScienceSpecialization.github.io#contributing). From be0a48ab6b0aadb403eaff3ee52308868d170280 Mon Sep 17 00:00:00 2001 From: Leonard Greski Date: Thu, 24 Dec 2015 21:02:10 -0800 Subject: [PATCH 07/45] Added two links: makeCacheMatrix as an Object, and S Objects, R Objects, and Lexical Scoping. --- rprog.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/rprog.md b/rprog.md index 85162862..f94ac309 100644 --- a/rprog.md +++ b/rprog.md @@ -14,12 +14,15 @@ permalink: /rprog/ - [PA3-test: `testthat`, Unit Tests for Programming Assignment 3](https://github.com/cbryant1000/pa3test) - [Alternative submit script for Programming Assignment 1 that makes submitting more convenient by allowing selection of multiple parts plus prompting if user wants to submit another part before exiting](https://github.com/rchampoux/coursera/blob/master/rprog-scripts-submitscript1.R) - [Grading the SHA-1 Hash Code](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-gradeSHA1hash.md) +- [Assignment 2: makeCacheMatrix as an Object](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprogAssignment2Prototype.md) ## R Language - [Some notes on the R Language](http://lopezrj.github.io) - [A Data Frame is Also a List](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/dataFrameAsList.md) +- [S Objects, R Objects, and Lexical Scoping](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-lexicalScoping.md) + ## R language cheatsheet From 0f621d08873493244d46654e19a2dc4f4566bdc6 Mon Sep 17 00:00:00 2001 From: Flavio Barros Date: Sun, 27 Dec 2015 20:26:25 -0700 Subject: [PATCH 08/45] Update ddp.md As I moved some post from www.flaviobarros.net to www.rmining.net I'm updating the file. --- ddp.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/ddp.md b/ddp.md index 951724af..6a467ab5 100644 --- a/ddp.md +++ b/ddp.md @@ -12,8 +12,8 @@ permalink: /ddp/ - [Shiny Video Tutorials Playlist on Youtube](http://www.youtube.com/playlist?list=PL6wLL_RojB5xNOhe2OTSd-DPkMLVY9DfB) - [Tutorial on writing Shiny simulation apps](https://github.com/homerhanumat/shinyTutorials) - [Dockerize a Shiny App](http://www.rmining.net/2015/04/30/dockerizing-a-shiny-app/) -- [Git pushing Shiny Apps with Docker/Dokku](http://www.flaviobarros.net/2015/05/11/git-pushing-shiny-apps-with-docker-dokku/) -- [Share your Shiny Apps with Docker and Kitematic](http://www.flaviobarros.net/2015/08/10/share-your-shiny-apps-with-docker-and-kitematic/) +- [Git pushing Shiny Apps with Docker/Dokku](http://www.rmining.net/2015/05/11/git-pushing-shiny-apps-with-docker-dokku/) +- [Share your Shiny Apps with Docker and Kitematic](http://www.rmining.net/2015/08/10/share-your-shiny-apps-with-docker-and-kitematic/) ## Comprehensive Notes From 608e561d3934fde6645c746701c8b4e72b247552 Mon Sep 17 00:00:00 2001 From: Flavio Barros Date: Sun, 27 Dec 2015 20:29:01 -0700 Subject: [PATCH 09/45] Update other.md As I moved my domain from www.flaviobarros.net to www.rmining.net I'm updating the links. --- other.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/other.md b/other.md index 2bdd10ee..d01d490f 100644 --- a/other.md +++ b/other.md @@ -27,6 +27,6 @@ permalink: /other/ - [Virtual machine with RStudio server and github setup](https://github.com/tboloo/vagrant-rstudio) - A VirtualBox, Vagrant & chef-solo managed virtual machine which provides RStudio server with git & github setup ## Deploying and sharing Shiny Apps with Docker -- [Dockerize a Shiny App](http://www.flaviobarros.net/2015/04/30/dockerizing-a-shiny-app/) -- [Git pushing Shiny Apps with Docker/Dokku](http://www.flaviobarros.net/2015/05/11/git-pushing-shiny-apps-with-docker-dokku/) -- [Share your Shiny Apps with Docker and Kitematic](http://www.flaviobarros.net/2015/08/10/share-your-shiny-apps-with-docker-and-kitematic/) +- [Dockerize a Shiny App](http://www.rmining.net/2015/04/30/dockerizing-a-shiny-app/) +- [Git pushing Shiny Apps with Docker/Dokku](http://www.rmining.net/2015/05/11/git-pushing-shiny-apps-with-docker-dokku/) +- [Share your Shiny Apps with Docker and Kitematic](http://www.rmining.net/2015/08/10/share-your-shiny-apps-with-docker-and-kitematic/) From 2035298f25186b4f0f5285a453f0cc016c0f3e37 Mon Sep 17 00:00:00 2001 From: Leonard Greski Date: Sun, 3 Jan 2016 15:49:59 -0600 Subject: [PATCH 10/45] Add link for Improving Runtime Performance of caret::train() article. --- pml.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/pml.md b/pml.md index f2cac02d..0628ebb8 100644 --- a/pml.md +++ b/pml.md @@ -29,3 +29,7 @@ permalink: /pml/ ## Configuring Github Pages with RStudio for PML Project - Step by step instructions to [Configure Github Pages with RStudio](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/pml-ghPagesSetup.md) to support the PML course project. + +## Improving Runtime Performance of Caret + +- Step by step instructions to [implement parallel processing in caret::train()](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/pml-randomForestPerformance.md) on a random forest model, along with runtime performance analysis for a variety of laptops, ranging from an Intel Atom-based tablet to a quad-core i7 processor. From 55d85379df353ece4855ca669caecab6582cbea8 Mon Sep 17 00:00:00 2001 From: Len Greski Date: Sat, 9 Jan 2016 06:51:04 -0800 Subject: [PATCH 11/45] Add two articles: Common Mistakes / overwriting R functions with data objects, and Thinking in R versus Thinking in SAS. --- rprog.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/rprog.md b/rprog.md index f94ac309..0f3554f8 100644 --- a/rprog.md +++ b/rprog.md @@ -22,6 +22,7 @@ permalink: /rprog/ - [Some notes on the R Language](http://lopezrj.github.io) - [A Data Frame is Also a List](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/dataFrameAsList.md) - [S Objects, R Objects, and Lexical Scoping](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-lexicalScoping.md) +- [Common R Mistakes: Overwriting Functions with Data Objects](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-overwritingRFunctions.md) ## R language cheatsheet @@ -33,6 +34,7 @@ permalink: /rprog/ - [Commercial Statistics Packages: An Historical Perspective](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statsPackagesHistory.md) - [Why is R More Difficult than SAS?](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/whyIsRHarderThanSAS.md) - [SAS Experience: impediment to learning R?](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/exampleSortRvsSAS.md) +- [Thinking in R versus Thinking in SAS](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/exampleSortRvsSAS.md) ## Comprehensive Notes From 814f6ad0d8d98d6b024b3ecf7779d28ef4a90296 Mon Sep 17 00:00:00 2001 From: Len Greski Date: Sat, 9 Jan 2016 07:02:59 -0800 Subject: [PATCH 12/45] Add five articles related to statinf class. --- statinf.md | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/statinf.md b/statinf.md index c5a85435..ca1cb346 100644 --- a/statinf.md +++ b/statinf.md @@ -6,6 +6,11 @@ permalink: /statinf/ - [Why degrees of freedom decrease for sample variance](https://github.com/Manu58/bias/blob/master/bias.pdf) - [Analysis of exponential distribution of births data set from the CDC](https://gist.github.com/ProgramErgoSum/5316008387746fcd84de) +- [Exponential Distribution / Central Limit Theorem - Assignment Checklist](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statinf-expDistChecklist.md) +- [ToothGrowth Analysis - Assignment Checklist](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/ToothGrowthChecklist.md) +- [Exploratory Data Analysis in ToothGrowth Assignment](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/edaInToothGrowthAnalysis.md), explaining the exploratory data analysis requirement for students who have not taken the *Exploratory Data Analysis* course prior to taking *Statistical Inference*. +- [Using MathJax with Discussion Forums, R Markdown, and Github Pages](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/mathjaxWithGithubMarkdown.md) +- [Kable Tables with Data Frames](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/kableDataFrameTable.md) illustrates how to display a custom table in a `knitr()` document by creating a data frame to contain the information to be rendered with `kable()`. ## Comprehensive Notes From 3417573bec53a9b872573ec63bb2463677a877ad Mon Sep 17 00:00:00 2001 From: Len Greski Date: Sat, 9 Jan 2016 07:05:21 -0800 Subject: [PATCH 13/45] Add article on configuring shinyapps.io application timeout. --- ddp.md | 1 + 1 file changed, 1 insertion(+) diff --git a/ddp.md b/ddp.md index 6a467ab5..3240b3b3 100644 --- a/ddp.md +++ b/ddp.md @@ -14,6 +14,7 @@ permalink: /ddp/ - [Dockerize a Shiny App](http://www.rmining.net/2015/04/30/dockerizing-a-shiny-app/) - [Git pushing Shiny Apps with Docker/Dokku](http://www.rmining.net/2015/05/11/git-pushing-shiny-apps-with-docker-dokku/) - [Share your Shiny Apps with Docker and Kitematic](http://www.rmining.net/2015/08/10/share-your-shiny-apps-with-docker-and-kitematic/) +- [Shinyapps.io: Configuring Application Timeout](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/dataProd-shinyTimeoutConfig.md) ## Comprehensive Notes From 1a9c230c7346005f4e8e8d21ed95355d3c5b14a8 Mon Sep 17 00:00:00 2001 From: Paul Adamson Date: Mon, 18 Jan 2016 00:03:22 -0500 Subject: [PATCH 14/45] add link to ProjectTemplate blog post --- repres.md | 1 + 1 file changed, 1 insertion(+) diff --git a/repres.md b/repres.md index 5fc1ac8e..cba776f9 100644 --- a/repres.md +++ b/repres.md @@ -9,6 +9,7 @@ permalink: /repres/ - [Trends and severity of Data Breaches](http://rpubs.com/ww44ss/29389) - [Benefit-cost analysis of a park user fee](https://rstudio-pubs-static.s3.amazonaws.com/72135_dc45211d976842c2a9a8c8b5f2472ff0.html) - [Data Lake Integrity](http://rpubs.com/rshane/81297) +- [ProjectTemplate in RStudio with Git](http://padamson.github.io/r/rstudio/projecttemplate/git/2016/01/17/projecttemplate-in-rstudio-with-git.html) ## Comprehensive Notes From f5c7e090c97645c681071b950997bddc7aa89f17 Mon Sep 17 00:00:00 2001 From: Aaron McAdie Date: Fri, 26 Feb 2016 13:11:10 -0800 Subject: [PATCH 15/45] added link to interactive CI repo --- statinf.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/statinf.md b/statinf.md index c5a85435..1cc82800 100644 --- a/statinf.md +++ b/statinf.md @@ -6,7 +6,7 @@ permalink: /statinf/ - [Why degrees of freedom decrease for sample variance](https://github.com/Manu58/bias/blob/master/bias.pdf) - [Analysis of exponential distribution of births data set from the CDC](https://gist.github.com/ProgramErgoSum/5316008387746fcd84de) - +-[Interactive Confidence Interval Visualization](https://github.com/amcadie/interactive_CI) ## Comprehensive Notes - Complete notes for [Statistical Inference](http://sux13.github.io/DataScienceSpCourseNotes/) From 1eb1be9adece106f162b3816161a2b83158090af Mon Sep 17 00:00:00 2001 From: Aaron McAdie Date: Fri, 26 Feb 2016 13:15:04 -0800 Subject: [PATCH 16/45] fixed line break --- statinf.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/statinf.md b/statinf.md index 1cc82800..bee6381e 100644 --- a/statinf.md +++ b/statinf.md @@ -6,7 +6,8 @@ permalink: /statinf/ - [Why degrees of freedom decrease for sample variance](https://github.com/Manu58/bias/blob/master/bias.pdf) - [Analysis of exponential distribution of births data set from the CDC](https://gist.github.com/ProgramErgoSum/5316008387746fcd84de) --[Interactive Confidence Interval Visualization](https://github.com/amcadie/interactive_CI) +- [Interactive Confidence Interval Visualization](https://github.com/amcadie/interactive_CI) + ## Comprehensive Notes - Complete notes for [Statistical Inference](http://sux13.github.io/DataScienceSpCourseNotes/) From 6f08d7b5a2b2535dd7831e03fd3d0de2b841e952 Mon Sep 17 00:00:00 2001 From: Leonard Greski Date: Sun, 24 Apr 2016 17:43:20 -0400 Subject: [PATCH 17/45] Added Len Greski to list of community contributors. --- about.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/about.md b/about.md index 27bf2391..aa0af257 100644 --- a/about.md +++ b/about.md @@ -19,8 +19,9 @@ The [Data Science Specialization](https://www.coursera.org/specialization/jhudat - [Kevin Markham](http://www.dataschool.io/) - Derek Franks - David Hood +- [Leonard Greski](https://github.com/lgreski) - Michael Sachs - Allan InocĂȘncio de Souza Costa - [stepds](https://github.com/stepds) - Bastiaan Quast -- [Xing Su](http://sux13.github.io/DataScienceSpCourseNotes/) \ No newline at end of file +- [Xing Su](http://sux13.github.io/DataScienceSpCourseNotes/) From e1da253e0a82b8a507129deadfb23e28eb007e36 Mon Sep 17 00:00:00 2001 From: Leonard Greski Date: Sun, 24 Apr 2016 22:37:01 -0400 Subject: [PATCH 18/45] Added link to MiKTeX install walkthrough on Windows 10. --- statinf.md | 1 + 1 file changed, 1 insertion(+) diff --git a/statinf.md b/statinf.md index ca1cb346..051fb7ed 100644 --- a/statinf.md +++ b/statinf.md @@ -11,6 +11,7 @@ permalink: /statinf/ - [Exploratory Data Analysis in ToothGrowth Assignment](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/edaInToothGrowthAnalysis.md), explaining the exploratory data analysis requirement for students who have not taken the *Exploratory Data Analysis* course prior to taking *Statistical Inference*. - [Using MathJax with Discussion Forums, R Markdown, and Github Pages](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/mathjaxWithGithubMarkdown.md) - [Kable Tables with Data Frames](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/kableDataFrameTable.md) illustrates how to display a custom table in a `knitr()` document by creating a data frame to contain the information to be rendered with `kable()`. +- [Installing MiKTeK on Windows 10 / Generate a PDF from knitr](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statinf-generatePDF.md) ## Comprehensive Notes From d6132aba9f57d175e29a8c032cab97775410566e Mon Sep 17 00:00:00 2001 From: Leonard Greski Date: Sat, 7 May 2016 11:58:12 -0400 Subject: [PATCH 19/45] Add article describing "optimal" sample size relative to power calculations. --- statinf.md | 1 + 1 file changed, 1 insertion(+) diff --git a/statinf.md b/statinf.md index ac5c9e9f..1157c59f 100644 --- a/statinf.md +++ b/statinf.md @@ -13,6 +13,7 @@ permalink: /statinf/ - [Kable Tables with Data Frames](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/kableDataFrameTable.md) illustrates how to display a custom table in a `knitr()` document by creating a data frame to contain the information to be rendered with `kable()`. - [Interactive Confidence Interval Visualization](https://github.com/amcadie/interactive_CI) - [Installing MiKTeK on Windows 10 / Generate a PDF from knitr](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statinf-generatePDF.md) +- [Power calculations: optimal szmple size](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statinf-optimalSampleSize.md) ## Comprehensive Notes From 925de9276c17799264d54cd3a97b975fe0e4f068 Mon Sep 17 00:00:00 2001 From: Leonard Greski Date: Sun, 15 May 2016 19:02:27 -0400 Subject: [PATCH 20/45] Add R Onboarding for SAS Users --- rprog.md | 1 + 1 file changed, 1 insertion(+) diff --git a/rprog.md b/rprog.md index 0f3554f8..524517cd 100644 --- a/rprog.md +++ b/rprog.md @@ -31,6 +31,7 @@ permalink: /rprog/ ## R and Commercial Statistics Packages +- [R Onboarding for SAS Users](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-onboardingForSASUsers.md) Provides an overview and links to a variety of resources to help people with SAS experience make the transition to R - [Commercial Statistics Packages: An Historical Perspective](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statsPackagesHistory.md) - [Why is R More Difficult than SAS?](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/whyIsRHarderThanSAS.md) - [SAS Experience: impediment to learning R?](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/exampleSortRvsSAS.md) From bf50b365586f7a5ffd02527fb97fe0096f0ded76 Mon Sep 17 00:00:00 2001 From: lgreski Date: Sun, 29 May 2016 22:47:21 -0400 Subject: [PATCH 21/45] Add article on forms of the Extract Operator --- rprog.md | 1 + 1 file changed, 1 insertion(+) diff --git a/rprog.md b/rprog.md index 524517cd..a5ee9afb 100644 --- a/rprog.md +++ b/rprog.md @@ -23,6 +23,7 @@ permalink: /rprog/ - [A Data Frame is Also a List](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/dataFrameAsList.md) - [S Objects, R Objects, and Lexical Scoping](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-lexicalScoping.md) - [Common R Mistakes: Overwriting Functions with Data Objects](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-overwritingRFunctions.md) +- [Forms of the Extract Operator](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-extractOperator.md) ## R language cheatsheet From 0223ab1b46adac26e2ddb1a75171efd9dfb17a4a Mon Sep 17 00:00:00 2001 From: lgreski Date: Fri, 17 Jun 2016 20:21:14 -0400 Subject: [PATCH 22/45] Added article explaining use of binomial theorem in Combining Predictors lecture. --- pml.md | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/pml.md b/pml.md index 0628ebb8..1054002d 100644 --- a/pml.md +++ b/pml.md @@ -22,9 +22,10 @@ permalink: /pml/ - [Comparing Supervised Learning Algorithms](http://www.dataschool.io/comparing-supervised-learning-algorithms/): Comparing 8 common supervised learning algorithms (for regression and classification) on 13 different dimensions. -## Comprehensive Notes +## Content Related to the Lectures - Complete notes for [Practical Machine Learning](http://sux13.github.io/DataScienceSpCourseNotes/) +- [Week 4: Combining Predictors -- Math Explained](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/pml-combiningPredictorsBinomial.md) ## Configuring Github Pages with RStudio for PML Project @@ -32,4 +33,5 @@ permalink: /pml/ ## Improving Runtime Performance of Caret -- Step by step instructions to [implement parallel processing in caret::train()](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/pml-randomForestPerformance.md) on a random forest model, along with runtime performance analysis for a variety of laptops, ranging from an Intel Atom-based tablet to a quad-core i7 processor. +- Step by step instructions to [implement parallel processing in caret::train()](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/pml-randomForestPerformance.md) on a random forest model, along with runtime performance analysis for a variety of laptops, ranging from an Intel Atom-based tablet to a quad-core i7 processor. + From 318b3c1217755fc2ecfde5aeec8fb78f24486432 Mon Sep 17 00:00:00 2001 From: lgreski Date: Fri, 17 Jun 2016 21:14:16 -0400 Subject: [PATCH 23/45] add 2 articles: breaking down pollutantmean, and a SAS version of pollutantmean? --- rprog.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/rprog.md b/rprog.md index a5ee9afb..80e56ac4 100644 --- a/rprog.md +++ b/rprog.md @@ -8,6 +8,8 @@ permalink: /rprog/ - [Strategy for Coding the Programming Assignments](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/makeItRun.md) - [Tutorial for those struggling with Programming Assignment 1](https://github.com/derekfranks/practice_assignment) +- [Breaking Down pollutantmean](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-discussPollutantmean.md) +- [A SAS Version of pollutantmean?](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-pollutantmeanSASVersion.md) - [Tutorial for those struggling with Programming Assignment 2](https://github.com/DanieleP/PA2-clarifying_instructions) - [Tutorial for those struggling with Programming Assignment 3](https://github.com/DanieleP/PA3-tutorial) - [PA1-test: `testthat`, Unit Tests for Programming Assignment 1](https://github.com/cbryant1000/pa1test) From ea32fb68c6c13fb67e43fbf17f5ade4ec7d30d26 Mon Sep 17 00:00:00 2001 From: Andrew Voshchevoz Date: Mon, 20 Jun 2016 18:02:45 +0300 Subject: [PATCH 24/45] Updated other.md Updated other.md to include link on HTTP/HTTPS proxy configuration guide --- other.md | 1 + 1 file changed, 1 insertion(+) diff --git a/other.md b/other.md index d01d490f..ddb49135 100644 --- a/other.md +++ b/other.md @@ -11,6 +11,7 @@ permalink: /other/ - [Installing Some Basic R Packages in Ubuntu; Ibrahim El Merehbi](http://elmerehbi.wordpress.com/2014/09/09/installing-some-basic-r-packages-in-ubuntu) - [Using Projects in RStudio](https://support.rstudio.com/hc/en-us/articles/200526207-Using-Projects) - [Using Version Control with RStudio](https://support.rstudio.com/hc/en-us/articles/200532077-Version-Control-with-Git-and-SVN) +- [Using R behind HTTP/HTTPS Proxy](https://support.rstudio.com/hc/en-us/articles/200488488-Configuring-R-to-Use-an-HTTP-or-HTTPS-Proxy) ### Ignoring R & RStudio files - [gitignore template for R](https://github.com/github/gitignore/blob/master/R.gitignore) (source:[gitignore](https://github.com/github/gitignore)) From 4a18de5fe20a16fefe10bbd0eeb8bb11d051de85 Mon Sep 17 00:00:00 2001 From: lgreski Date: Mon, 4 Jul 2016 08:22:49 -0400 Subject: [PATCH 25/45] Add article on permutation tests. --- statinf.md | 1 + 1 file changed, 1 insertion(+) diff --git a/statinf.md b/statinf.md index 1157c59f..5ea0baa6 100644 --- a/statinf.md +++ b/statinf.md @@ -14,6 +14,7 @@ permalink: /statinf/ - [Interactive Confidence Interval Visualization](https://github.com/amcadie/interactive_CI) - [Installing MiKTeK on Windows 10 / Generate a PDF from knitr](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statinf-generatePDF.md) - [Power calculations: optimal szmple size](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statinf-optimalSampleSize.md) +- [Permutation Tests Explained](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statinf-permutationTests.md) ## Comprehensive Notes From 53c1f9e8d26f96da4e4013cfb41fe314a538457a Mon Sep 17 00:00:00 2001 From: lgreski Date: Tue, 9 Aug 2016 22:53:58 -0400 Subject: [PATCH 26/45] Add url for Demystifying makeVector() article. --- rprog.md | 1 + 1 file changed, 1 insertion(+) diff --git a/rprog.md b/rprog.md index 80e56ac4..35e8e0d8 100644 --- a/rprog.md +++ b/rprog.md @@ -16,6 +16,7 @@ permalink: /rprog/ - [PA3-test: `testthat`, Unit Tests for Programming Assignment 3](https://github.com/cbryant1000/pa3test) - [Alternative submit script for Programming Assignment 1 that makes submitting more convenient by allowing selection of multiple parts plus prompting if user wants to submit another part before exiting](https://github.com/rchampoux/coursera/blob/master/rprog-scripts-submitscript1.R) - [Grading the SHA-1 Hash Code](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-gradeSHA1hash.md) +- [Assignment 2: Demystifying makeVector](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-breakingDownMakeVector.md) - [Assignment 2: makeCacheMatrix as an Object](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprogAssignment2Prototype.md) From 3c73cb4bc7762bf84e99826c7ebc333b425e9ca1 Mon Sep 17 00:00:00 2001 From: lgreski Date: Sun, 21 Aug 2016 08:54:23 -0400 Subject: [PATCH 27/45] Add article illustrating how to use R to download lecture videos. --- rprog.md | 1 + 1 file changed, 1 insertion(+) diff --git a/rprog.md b/rprog.md index 35e8e0d8..2f6e17fe 100644 --- a/rprog.md +++ b/rprog.md @@ -27,6 +27,7 @@ permalink: /rprog/ - [S Objects, R Objects, and Lexical Scoping](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-lexicalScoping.md) - [Common R Mistakes: Overwriting Functions with Data Objects](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-overwritingRFunctions.md) - [Forms of the Extract Operator](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-extractOperator.md) +- [Creative Use of R: Downloading Course Lectures](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-downloadingLectures.md) Article illustrating how to use R to automate the download of lectures from *Data Science Specialization* courses, such as *R Programming*. Techniques used in this article are helpful to make research reproducible, as required for courses like *Getting and Cleaning Data* and *Reproducible Research*. ## R language cheatsheet From 9308f293a2707f726afc86b288513ba3f5b5f95f Mon Sep 17 00:00:00 2001 From: Devinsuit Date: Tue, 29 Nov 2016 14:13:58 +0300 Subject: [PATCH 28/45] Update broken link #126 --- eda.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/eda.md b/eda.md index 8e179acb..a6c52b13 100644 --- a/eda.md +++ b/eda.md @@ -7,7 +7,7 @@ permalink: /eda/ - [Creating a Kite Graph](http://rpubs.com/thoughtfulbloke/kitegraph) - [Analyzing Top/Green500 Supercomputer Technology Trends](http://github.com/ww44ss/Exascalar-Analysis-) - [Emissions Choropleth Maps](https://github.com/BillSeliger/ExData_Plotting2) -- [Data Analysis using Twitter API and Python](http://blog.impiyush.me/2015/03/data-analysis-using-twitter-api-and.html) +- [Data Analysis using Twitter API and Python](http://blog.impiyush.com/2015/03/data-analysis-using-twitter-api-and.html) ## Comprehensive Notes From 9cbaa239486ac3ca856a6019da246278d0515284 Mon Sep 17 00:00:00 2001 From: Len Greski Date: Sat, 7 Jan 2017 20:53:29 -0500 Subject: [PATCH 29/45] Added a "getting started" section, added DSS value proposition article, and converted URLs to bit.ly versions. --- rprog.md | 41 +++++++++++++++++++++++------------------ 1 file changed, 23 insertions(+), 18 deletions(-) diff --git a/rprog.md b/rprog.md index 2f6e17fe..219011cb 100644 --- a/rprog.md +++ b/rprog.md @@ -1,33 +1,39 @@ --- -layout: page -title: R Programming +title: "R Programming" permalink: /rprog/ +layout: page --- +## Getting Started +- [Resources for R Programming](http://bit.ly/2dhZ8Dy) +- [References for R Programming](http://bit.ly/2b8AxhF) +- [Data Science Specialization Value Proposition](http://bit.ly/2j3EcCn) +- [R Onboarding for SAS Users](http://bit.ly/2dr7yum) + ## Programming Assignments -- [Strategy for Coding the Programming Assignments](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/makeItRun.md) +- [Strategy for Coding the Programming Assignments](http://bit.ly/2ddFh9A) - [Tutorial for those struggling with Programming Assignment 1](https://github.com/derekfranks/practice_assignment) -- [Breaking Down pollutantmean](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-discussPollutantmean.md) -- [A SAS Version of pollutantmean?](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-pollutantmeanSASVersion.md) +- [Breaking Down pollutantmean](http://bit.ly/2cHyiCl) +- [A SAS Version of pollutantmean?](http://bit.ly/2d3DR4e) - [Tutorial for those struggling with Programming Assignment 2](https://github.com/DanieleP/PA2-clarifying_instructions) - [Tutorial for those struggling with Programming Assignment 3](https://github.com/DanieleP/PA3-tutorial) - [PA1-test: `testthat`, Unit Tests for Programming Assignment 1](https://github.com/cbryant1000/pa1test) - [PA3-test: `testthat`, Unit Tests for Programming Assignment 3](https://github.com/cbryant1000/pa3test) - [Alternative submit script for Programming Assignment 1 that makes submitting more convenient by allowing selection of multiple parts plus prompting if user wants to submit another part before exiting](https://github.com/rchampoux/coursera/blob/master/rprog-scripts-submitscript1.R) -- [Grading the SHA-1 Hash Code](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-gradeSHA1hash.md) -- [Assignment 2: Demystifying makeVector](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-breakingDownMakeVector.md) -- [Assignment 2: makeCacheMatrix as an Object](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprogAssignment2Prototype.md) +- [Grading the SHA-1 Hash Code](http://bit.ly/2iUWoB6) +- [Assignment 2: Demystifying makeVector](http://bit.ly/2bTXXfq) +- [Assignment 2: makeCacheMatrix as an Object](http://bit.ly/2byUe4e) ## R Language - [Some notes on the R Language](http://lopezrj.github.io) -- [A Data Frame is Also a List](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/dataFrameAsList.md) -- [S Objects, R Objects, and Lexical Scoping](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-lexicalScoping.md) -- [Common R Mistakes: Overwriting Functions with Data Objects](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-overwritingRFunctions.md) -- [Forms of the Extract Operator](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-extractOperator.md) -- [Creative Use of R: Downloading Course Lectures](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-downloadingLectures.md) Article illustrating how to use R to automate the download of lectures from *Data Science Specialization* courses, such as *R Programming*. Techniques used in this article are helpful to make research reproducible, as required for courses like *Getting and Cleaning Data* and *Reproducible Research*. +- [A Data Frame is Also a List](http://bit.ly/2fmMRAp) +- [S Objects, R Objects, and Lexical Scoping](http://bit.ly/2dtOSXi) +- [Common R Mistakes: Overwriting Functions with Data Objects](http://bit.ly/2i3gmoA) +- [Forms of the Extract Operator](http://bit.ly/2bzLYTL) +- [Creative Use of R: Downloading Course Lectures](http://bit.ly/2bGlI7R) Article illustrating how to use R to automate the download of lectures from *Data Science Specialization* courses, such as *R Programming*. Techniques used in this article are helpful to make research reproducible, as required for courses like *Getting and Cleaning Data* and *Reproducible Research*. ## R language cheatsheet @@ -36,11 +42,10 @@ permalink: /rprog/ ## R and Commercial Statistics Packages -- [R Onboarding for SAS Users](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/rprog-onboardingForSASUsers.md) Provides an overview and links to a variety of resources to help people with SAS experience make the transition to R -- [Commercial Statistics Packages: An Historical Perspective](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statsPackagesHistory.md) -- [Why is R More Difficult than SAS?](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/whyIsRHarderThanSAS.md) -- [SAS Experience: impediment to learning R?](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/exampleSortRvsSAS.md) -- [Thinking in R versus Thinking in SAS](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/exampleSortRvsSAS.md) +- [R Onboarding for SAS Users](http://bit.ly/2dr7yum) Provides an overview and links to a variety of resources to help people with SAS experience make the transition to R +- [Commercial Statistics Packages: An Historical Perspective](http://bit.ly/2fPj2qN) +- [Why is R More Difficult than SAS?](http://bit.ly/2erxk3A) +- [Thinking in R versus Thinking in SAS](http://bit.ly/2cH3u8x) ## Comprehensive Notes From 39a027e6006717b7574b749b7e14c606f34c3e8f Mon Sep 17 00:00:00 2001 From: MMohey Date: Tue, 18 Apr 2017 16:27:10 +0200 Subject: [PATCH 30/45] Fixed broken link --- ddp.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/ddp.md b/ddp.md index 3240b3b3..895309ef 100644 --- a/ddp.md +++ b/ddp.md @@ -5,7 +5,7 @@ permalink: /ddp/ --- - [Slidify to Github walkthrough](http://rpubs.com/thoughtfulbloke/25103) -- [ggvis and rmarkdown slides with interactive plots](http://qua.st/ggvis-shiny-html5-slides/) +- [ggvis and rmarkdown slides with interactive plots](http://qua.st/ggvis-shiny-html5-slides) ## Shiny - [Shiny app to simulate 401K growth with interactive plots](http://www.mephistosoftware.com/shiny/401k_simulator/) From 5db3cfda2d08498b482817dc3ec1dcbffbb9f5db Mon Sep 17 00:00:00 2001 From: Len Greski Date: Sat, 20 May 2017 04:52:49 -0400 Subject: [PATCH 31/45] Add articles related to R programming course --- rprog.md | 7 ++++++- 1 file changed, 6 insertions(+), 1 deletion(-) diff --git a/rprog.md b/rprog.md index 219011cb..47df54d1 100644 --- a/rprog.md +++ b/rprog.md @@ -15,6 +15,7 @@ layout: page - [Strategy for Coding the Programming Assignments](http://bit.ly/2ddFh9A) - [Tutorial for those struggling with Programming Assignment 1](https://github.com/derekfranks/practice_assignment) - [Breaking Down pollutantmean](http://bit.ly/2cHyiCl) +- [Assignment 1: A More Elegant Solution](http://bit.ly/2kwBBlK) - [A SAS Version of pollutantmean?](http://bit.ly/2d3DR4e) - [Tutorial for those struggling with Programming Assignment 2](https://github.com/DanieleP/PA2-clarifying_instructions) - [Tutorial for those struggling with Programming Assignment 3](https://github.com/DanieleP/PA3-tutorial) @@ -23,7 +24,7 @@ layout: page - [Alternative submit script for Programming Assignment 1 that makes submitting more convenient by allowing selection of multiple parts plus prompting if user wants to submit another part before exiting](https://github.com/rchampoux/coursera/blob/master/rprog-scripts-submitscript1.R) - [Grading the SHA-1 Hash Code](http://bit.ly/2iUWoB6) - [Assignment 2: Demystifying makeVector](http://bit.ly/2bTXXfq) -- [Assignment 2: makeCacheMatrix as an Object](http://bit.ly/2byUe4e) +- [Assignment 2: makeCacheMatrix as an Object](http://bit.ly/2byUe4e) ## R Language @@ -33,7 +34,11 @@ layout: page - [S Objects, R Objects, and Lexical Scoping](http://bit.ly/2dtOSXi) - [Common R Mistakes: Overwriting Functions with Data Objects](http://bit.ly/2i3gmoA) - [Forms of the Extract Operator](http://bit.ly/2bzLYTL) +- [Functions to Sort Data Frames](http://bit.ly/2dxItzw) - [Creative Use of R: Downloading Course Lectures](http://bit.ly/2bGlI7R) Article illustrating how to use R to automate the download of lectures from *Data Science Specialization* courses, such as *R Programming*. Techniques used in this article are helpful to make research reproducible, as required for courses like *Getting and Cleaning Data* and *Reproducible Research*. +- [Lexical Scoping and Statistical Computing](http://bit.ly/2cmqAPy) Article by Robert Gentleman and Ross Ihaka at the University of Auckland describing how lexical scoping works, and why it is valuable in statistical computing. +- [Data Science Job Report 2017: R Passes SAS, But Python Leaves Them Both Behind](http://bit.ly/2oCHulX) Bob Muenchen's take on the job market for various data science langauges. + ## R language cheatsheet From 497330b024ebfe0bb89c9df644bd5f64a6ff7505 Mon Sep 17 00:00:00 2001 From: Len Greski Date: Sat, 20 May 2017 04:58:05 -0400 Subject: [PATCH 32/45] add articles --- getclean.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/getclean.md b/getclean.md index e2b8ce28..6076632d 100644 --- a/getclean.md +++ b/getclean.md @@ -16,6 +16,10 @@ permalink: /getclean/ - [Codebook template that can be used in the Getting and Cleaning Data project](https://gist.github.com/JorisSchut/dbc1fc0402f28cad9b41) - ["Real world" example - reading American Community Survey 2000 PUMS Data:](https://github.com/lgreski/acsexample) Demonstrates how to extract records of a given type from a data file containing multiple record types, and how to use an Excel-based code book to specify arguments for reading a fixed-width file. - [18 Months of CTA advice](https://thoughtfulbloke.wordpress.com/2015/08/31/hello-world) +- [Common Problems: Quiz 1 - Missing Java Runtime](http://bit.ly/2jjtyXM) Explains how to solve the problem of a missing Java Runtime for the question that requires students to process a Microsoft Excel spreadsheet. +- [Strategy for Reading Files & APIs / Quiz 2](http://bit.ly/2e4L5oF) +- [Common Problems: Quiz 2 - sqldf() driver fails to connect](http://bit.ly/2kD2KTY) +- [Tutorial: Downloading Files](http://bit.ly/2iP2suj) Illustrates various ways of downloading files, including binary and text files. ## Comprehensive Notes From 51c9e7672e32aa689e9ca29412699bc8ca857e0f Mon Sep 17 00:00:00 2001 From: Len Greski Date: Sat, 20 May 2017 05:06:20 -0400 Subject: [PATCH 33/45] Add capstone page to index, and content for capstone page --- .gitignore | 1 + index.md | 1 + 2 files changed, 2 insertions(+) diff --git a/.gitignore b/.gitignore index 058dd6c2..d17e5544 100644 --- a/.gitignore +++ b/.gitignore @@ -1,3 +1,4 @@ _site .DS_Store .Rhistory +.Rproj.user diff --git a/index.md b/index.md index 5cc7be2f..761f3e41 100644 --- a/index.md +++ b/index.md @@ -17,6 +17,7 @@ interested in contributing [click here](https://github.com/DataScienceSpecializa 7. [Regression Models](/regmod/) 8. [Practical Machine Learning](/pml/) 9. [Developing Data Products](/ddp/) +10. [Capstone](/capstone/) - [Other Resources](/other/) - [Curated Pages](/curated/) From 4dc95e6f4d52284a801b37f2e2eda1fddb73c163 Mon Sep 17 00:00:00 2001 From: Aaron Date: Tue, 23 May 2017 10:38:39 -0400 Subject: [PATCH 34/45] Added shiny choropleth app code available at https://github.com/amsilvr/shiny_choropleth --- ddp.md | 1 + 1 file changed, 1 insertion(+) diff --git a/ddp.md b/ddp.md index 895309ef..0d39e861 100644 --- a/ddp.md +++ b/ddp.md @@ -8,6 +8,7 @@ permalink: /ddp/ - [ggvis and rmarkdown slides with interactive plots](http://qua.st/ggvis-shiny-html5-slides) ## Shiny +- [Shiny app using leaflet to create a choropleth of all Wireless Emergency Alerts sent through PBS WARN](https://silverman.shinyapps.io/warn_wea/) - [Shiny app to simulate 401K growth with interactive plots](http://www.mephistosoftware.com/shiny/401k_simulator/) - [Shiny Video Tutorials Playlist on Youtube](http://www.youtube.com/playlist?list=PL6wLL_RojB5xNOhe2OTSd-DPkMLVY9DfB) - [Tutorial on writing Shiny simulation apps](https://github.com/homerhanumat/shinyTutorials) From 452a58760df645fc2898a47e75107c67d79ad2c5 Mon Sep 17 00:00:00 2001 From: Aaron Date: Tue, 23 May 2017 13:25:26 -0400 Subject: [PATCH 35/45] Update ddp.md --- ddp.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/ddp.md b/ddp.md index 0d39e861..c1a411fc 100644 --- a/ddp.md +++ b/ddp.md @@ -8,7 +8,9 @@ permalink: /ddp/ - [ggvis and rmarkdown slides with interactive plots](http://qua.st/ggvis-shiny-html5-slides) ## Shiny -- [Shiny app using leaflet to create a choropleth of all Wireless Emergency Alerts sent through PBS WARN](https://silverman.shinyapps.io/warn_wea/) +- Choropleth of PBS WARN Distribution of Wireless Emergency Alerts + - [Code for Shiny App](https://github.com/amsilvr/shiny_choropleth) + - [App running on shinyapps.ip](https://silverman.shinyapps.io/warn_wea/) - [Shiny app to simulate 401K growth with interactive plots](http://www.mephistosoftware.com/shiny/401k_simulator/) - [Shiny Video Tutorials Playlist on Youtube](http://www.youtube.com/playlist?list=PL6wLL_RojB5xNOhe2OTSd-DPkMLVY9DfB) - [Tutorial on writing Shiny simulation apps](https://github.com/homerhanumat/shinyTutorials) From 4cd7fc39364e92437e6833b0c97ffe3885ac283f Mon Sep 17 00:00:00 2001 From: Len Greski Date: Tue, 23 May 2017 19:18:31 -0400 Subject: [PATCH 36/45] Add capstone page. --- capstone.md | 12 ++++++++++++ 1 file changed, 12 insertions(+) create mode 100644 capstone.md diff --git a/capstone.md b/capstone.md new file mode 100644 index 00000000..57792306 --- /dev/null +++ b/capstone.md @@ -0,0 +1,12 @@ +--- +title: "Capstone" +permalink: /capstone/ +layout: page +--- +## Reference Material + +- [Speech and Language Processing, 3rd Edition](https://web.stanford.edu/~jurafsky/slp3/) Working version of Jurafsky, et. al. book on natural language processing whose content on n-grams is helpful for the capstone. + +## Course Project + +- [n-gram Computations and Computer Capacity](http://bit.ly/2couvxh) Explains the amount of memory required to convert the text files for the course project into n-grams, using the quanteda package. From cf0e2fe24800c5e32e3944b891b4ee1449aca4dd Mon Sep 17 00:00:00 2001 From: Len Greski Date: Sat, 27 May 2017 07:44:44 -0400 Subject: [PATCH 37/45] Add articles to capstone page --- capstone.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/capstone.md b/capstone.md index 57792306..6285e422 100644 --- a/capstone.md +++ b/capstone.md @@ -10,3 +10,5 @@ layout: page ## Course Project - [n-gram Computations and Computer Capacity](http://bit.ly/2couvxh) Explains the amount of memory required to convert the text files for the course project into n-grams, using the quanteda package. +- [Capstone Strategy](http://bit.ly/2rGcgc6) Describes a general strategy to get through the Capstone: use the simplest approaches possible. +- [Choosing a Text Analysis Package](http://bit.ly/2qagsPa) Reviews pros and cons of various R packages used for natural language processing, in the context of requirements for the Capstone project. From ff0aaf40ed3fc52b005ad5ed8379038eff6df746 Mon Sep 17 00:00:00 2001 From: Len Greski Date: Sat, 5 Aug 2017 08:53:26 -0400 Subject: [PATCH 38/45] Add article explaining why one cannot calculate the area under a specific point on the normal curve. --- statinf.md | 1 + 1 file changed, 1 insertion(+) diff --git a/statinf.md b/statinf.md index 5ea0baa6..a96df6cb 100644 --- a/statinf.md +++ b/statinf.md @@ -5,6 +5,7 @@ permalink: /statinf/ --- - [Why degrees of freedom decrease for sample variance](https://github.com/Manu58/bias/blob/master/bias.pdf) +[CONCEPTS: Calculating Area for a Point on the Normal Curve](http://bit.ly/2hw5AMF) Reviews the mathematics that explain why one cannot calculate the exact proability for a specific value within a distribution for a continuous variable, and illustrates how to calculate a quantile for a point on the curve. - [Analysis of exponential distribution of births data set from the CDC](https://gist.github.com/ProgramErgoSum/5316008387746fcd84de) - [Exponential Distribution / Central Limit Theorem - Assignment Checklist](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statinf-expDistChecklist.md) - [ToothGrowth Analysis - Assignment Checklist](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/ToothGrowthChecklist.md) From eb0e0249711dad0980532d357524f5759da26da7 Mon Sep 17 00:00:00 2001 From: Len Greski Date: Sat, 19 Aug 2017 16:20:29 -0400 Subject: [PATCH 39/45] Add missing dash in bullet list --- statinf.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/statinf.md b/statinf.md index a96df6cb..19592a27 100644 --- a/statinf.md +++ b/statinf.md @@ -5,7 +5,7 @@ permalink: /statinf/ --- - [Why degrees of freedom decrease for sample variance](https://github.com/Manu58/bias/blob/master/bias.pdf) -[CONCEPTS: Calculating Area for a Point on the Normal Curve](http://bit.ly/2hw5AMF) Reviews the mathematics that explain why one cannot calculate the exact proability for a specific value within a distribution for a continuous variable, and illustrates how to calculate a quantile for a point on the curve. +- [CONCEPTS: Calculating Area for a Point on the Normal Curve](http://bit.ly/2hw5AMF) Reviews the mathematics that explain why one cannot calculate the exact proability for a specific value within a distribution for a continuous variable, and illustrates how to calculate a quantile for a point on the curve. - [Analysis of exponential distribution of births data set from the CDC](https://gist.github.com/ProgramErgoSum/5316008387746fcd84de) - [Exponential Distribution / Central Limit Theorem - Assignment Checklist](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/statinf-expDistChecklist.md) - [ToothGrowth Analysis - Assignment Checklist](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/ToothGrowthChecklist.md) From 253d6157bcc02966a0c533cb442c3acff33486e3 Mon Sep 17 00:00:00 2001 From: DocOfi Date: Sun, 21 Jan 2018 13:18:50 +0800 Subject: [PATCH 40/45] added link to my pdf file --- getclean.md | 1 + 1 file changed, 1 insertion(+) diff --git a/getclean.md b/getclean.md index 6076632d..deeccc56 100644 --- a/getclean.md +++ b/getclean.md @@ -20,6 +20,7 @@ permalink: /getclean/ - [Strategy for Reading Files & APIs / Quiz 2](http://bit.ly/2e4L5oF) - [Common Problems: Quiz 2 - sqldf() driver fails to connect](http://bit.ly/2kD2KTY) - [Tutorial: Downloading Files](http://bit.ly/2iP2suj) Illustrates various ways of downloading files, including binary and text files. +- [Creating dataframes from xml data](https://www.dropbox.com/s/7bbzzp4bwsmfl5y/CreatingDataframesfrom%20XmlFiles.odt?dl=0) ## Comprehensive Notes From 5549c1459adcd6bad1d7d81456e09956c0ca2095 Mon Sep 17 00:00:00 2001 From: DocOfi Date: Sun, 21 Jan 2018 13:31:09 +0800 Subject: [PATCH 41/45] Added a link to my presentation in Rpubs --- eda.md | 1 + 1 file changed, 1 insertion(+) diff --git a/eda.md b/eda.md index a6c52b13..8c14e435 100644 --- a/eda.md +++ b/eda.md @@ -8,6 +8,7 @@ permalink: /eda/ - [Analyzing Top/Green500 Supercomputer Technology Trends](http://github.com/ww44ss/Exascalar-Analysis-) - [Emissions Choropleth Maps](https://github.com/BillSeliger/ExData_Plotting2) - [Data Analysis using Twitter API and Python](http://blog.impiyush.com/2015/03/data-analysis-using-twitter-api-and.html) +- [Exploratory Data Analysis using Flexdashboard](http://rpubs.com/DocOfi/350830) ## Comprehensive Notes From 5a351e34a3068ca320b5759217ff45a9adacb5bf Mon Sep 17 00:00:00 2001 From: DocOfi Date: Sat, 27 Jan 2018 12:47:56 +0800 Subject: [PATCH 42/45] added name and link in about.md --- about.md | 1 + 1 file changed, 1 insertion(+) diff --git a/about.md b/about.md index aa0af257..37ecc9da 100644 --- a/about.md +++ b/about.md @@ -25,3 +25,4 @@ The [Data Science Specialization](https://www.coursera.org/specialization/jhudat - [stepds](https://github.com/stepds) - Bastiaan Quast - [Xing Su](http://sux13.github.io/DataScienceSpCourseNotes/) +- [Edmund julian Ofilada](https://github.com/DocOfi) From 64aba82b1ebef0cd07d241c83d8cede63f41f617 Mon Sep 17 00:00:00 2001 From: DocOfi Date: Sat, 27 Jan 2018 12:49:15 +0800 Subject: [PATCH 43/45] added link to metricsgraphics tutorial --- eda.md | 1 + 1 file changed, 1 insertion(+) diff --git a/eda.md b/eda.md index 8c14e435..1f56ac70 100644 --- a/eda.md +++ b/eda.md @@ -9,6 +9,7 @@ permalink: /eda/ - [Emissions Choropleth Maps](https://github.com/BillSeliger/ExData_Plotting2) - [Data Analysis using Twitter API and Python](http://blog.impiyush.com/2015/03/data-analysis-using-twitter-api-and.html) - [Exploratory Data Analysis using Flexdashboard](http://rpubs.com/DocOfi/350830) +- [Plotting using Metricsgraphics](http://www.rpubs.com/DocOfi/352947) ## Comprehensive Notes From 3a05c60dd2a70f1466b48ec6f0ee07655416fcb9 Mon Sep 17 00:00:00 2001 From: DocOfi Date: Fri, 23 Mar 2018 15:03:40 +0800 Subject: [PATCH 44/45] adding a leaflet plot example --- ddp.md | 1 + 1 file changed, 1 insertion(+) diff --git a/ddp.md b/ddp.md index c1a411fc..0af67104 100644 --- a/ddp.md +++ b/ddp.md @@ -18,6 +18,7 @@ permalink: /ddp/ - [Git pushing Shiny Apps with Docker/Dokku](http://www.rmining.net/2015/05/11/git-pushing-shiny-apps-with-docker-dokku/) - [Share your Shiny Apps with Docker and Kitematic](http://www.rmining.net/2015/08/10/share-your-shiny-apps-with-docker-and-kitematic/) - [Shinyapps.io: Configuring Application Timeout](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/dataProd-shinyTimeoutConfig.md) +- [Plotting Natural Disasters](http://www.rpubs.com/DocOfi/367052) ## Comprehensive Notes From c5413d29c8932f488923632477dc76ec40c400d2 Mon Sep 17 00:00:00 2001 From: Adhira <37569680+Adhira-Deogade@users.noreply.github.com> Date: Mon, 25 Mar 2019 01:56:01 -0400 Subject: [PATCH 45/45] Update curated.md Added medium website for ipython notebook --- curated.md | 1 + 1 file changed, 1 insertion(+) diff --git a/curated.md b/curated.md index 613c5f4e..8c806fd8 100644 --- a/curated.md +++ b/curated.md @@ -16,6 +16,7 @@ permalink: /curated/ - [Matrix rotation for image and contour plots in R](http://blog.snap.uaf.edu/2012/06/08/matrix-rotation-for-image-and-contour-plots-in-r/) - [Fig Data: 11 Tips on How to Handle Big Data in R (and 1 Bad Pun)](http://theodi.org/blog/fig-data-11-tips-how-handle-big-data-r-and-1-bad-pun) - [Data from 538](https://github.com/fivethirtyeight/data) +- [Getting started with python notebook](https://medium.com/@adhira_deo/the-environment-for-building-machine-learning-models-a1552116b355) ### Command Line