shinybrightstar
diff --git a/‎README.md‎
Lines changed: 22 additions & 42 deletions b/‎README.md‎
Lines changed: 22 additions & 42 deletions
diff --git a/‎tutorials/dc-cleaning_strings/dc-cleaning_strings-dataset_generation_script.R‎
Lines changed: 80 additions & 0 deletions b/‎tutorials/dc-cleaning_strings/dc-cleaning_strings-dataset_generation_script.R‎
Lines changed: 80 additions & 0 deletions
@@ -1,55 +1,26 @@
 # R tips  
-
-![](https://img.shields.io/badge/Language-R-blue) ![](https://img.shields.io/badge/Theory-Statistics-orange) 
-
-This repository contains R programming tips covering topics across data cleaning, data visualisation, machine learning, statistical theory and data productionisation.  
-
-<p align="center">  
-<img src="https://github.com/erikaduan/r_tips/blob/master/figures/r_milestones.jpg"
-width="600"></center>  
-</p>  
-
-Many kudos to [Dr Chuanxin Liu](https://github.com/codetrainee), my former PhD student and code editor, for teaching me how to code in R in my past life as an immunologist.  
-
-
-# Content summary
-
-| Legend | Category |  
-|--------|----------|  
-| 📚 | Data cleaning |  
-| 🎨 | Data visualisation |  
-| 🔮 | Machine learning |  
-| 🔨 | Productionisation |  
-| 🔢 | Statistical theory |  
-
-
-# Tutorials  
+ 
 ## 🎨 Data visualisation  
-
 + [An introduction to `ggplot2` using volcano plots](https://github.com/erikaduan/r_tips/blob/master/tutorials/dv-volcano_plots_with_ggplot/dv-volcano_plots_with_ggplot.md) (Updated)  
 + [Using `DiagrammeR` to draw flow charts](https://github.com/erikaduan/r_tips/blob/master/tutorials/dv-using_diagrammer/dv-using_diagrammer.md) (Updated)  
 
 ## 📚 Data cleaning
-
 + [Data cleaning using `data.table` or `tidyverse` (or Python `Pandas`)](https://github.com/erikaduan/r_tips/blob/master/tutorials/dc-data_table_vs_dplyr/dc-data_table_vs_dplyr.md) (Updated)    
-+ [Cleaning strings with regular expressions using `stringr`](https://github.com/erikaduan/r_tips/blob/master/tutorials/dc-cleaning_strings/dc-cleaning_strings.md) (Updated)          
++ [Cleaning strings using regular expressions with base R or `stringr`](https://github.com/erikaduan/r_tips/blob/master/tutorials/dc-cleaning_strings/dc-cleaning_strings.md) (Updated)          
 
 ## 🔨 Productionisation  
 + [Creating SQL <> R workflows - Part 1](https://github.com/erikaduan/r_tips/blob/master/tutorials/p-sql_to_r_workflows/p-sql_to_r_workflows_part_1.md) (Updated)  
 + [Creating SQL <> R workflows - Part 2](https://github.com/erikaduan/r_tips/blob/master/tutorials/p-sql_to_r_workflows/p-sql_to_r_workflows_part_2.md) (Updated)  
 + [Automating R Markdown report generation - Part 1](https://github.com/erikaduan/r_tips/blob/master/tutorials/p-automating_rmd_reports/p-automating_rmd_reports_part_1.md) (Updated)  
-+ [Automating R Markdown report generation - Part 2](https://github.com/erikaduan/r_tips/blob/master/tutorials/p-automating_rmd_reports/p-automating_rmd_reports_part_2.md) (updated)  
-
-## 🔮 Machine learning   
-+ [Working with dummy variables and factors](https://github.com/erikaduan/r_tips/blob/master/tutorials/2020-04-23_dummy-variables-and-factors/2020-04-23_dummy-variables-and-factors.md)  
++ [Automating R Markdown report generation - Part 2](https://github.com/erikaduan/r_tips/blob/master/tutorials/p-automating_rmd_reports/p-automating_rmd_reports_part_2.md) (updated)   
 
-## 🔢 Statistical theory   
+## 🔢 Statistical modelling   
 + [Introduction to expectation and variance](https://github.com/erikaduan/r_tips/blob/master/tutorials/st-expectations_and_variance/st-expectation_and_variance.md)  
 + [Beyond expectations: centrality measures in statistics](https://github.com/erikaduan/r_tips/blob/master/tutorials/2020-07-26_many-roads-to-the-middle/2020-07-26_many-roads-to-the-middle.md)  
-+ [Introduction to the normal distribution](https://github.com/erikaduan/r_tips/blob/master/tutorials/st-normal_distribution/st-normal_distribution.md)  
-+ [Introduction to the Chi-squared and F distribution](https://github.com/erikaduan/r_tips/blob/master/tutorials/st-chi_squared_and_f_distributions/st-chi_squared_and_f_distributions.md)  
-+ [Introduction to binomial distributions](https://github.com/erikaduan/R_tips/blob/master/tutorials/2020-09-12_binomial_distribution/2020-09-12_binomial-distribution.md)  
-+ [Introduction to hypergeometric, geometric, negative binomial and multinomial distributions](https://github.com/erikaduan/R_tips/blob/master/tutorials/2020-09-22_hypergeometric-and-other-discrete-distributions/2020-09-22_hypergeometric-and-other-discrete-distributions.md)  
+
+
+## 🔮 Machine learning   
++ [Working with dummy variables and factors](https://github.com/erikaduan/r_tips/blob/master/tutorials/2020-04-23_dummy-variables-and-factors/2020-04-23_dummy-variables-and-factors.md) 
 
 
 # Other resources 
@@ -61,9 +32,9 @@ The resources below also cover a comprehensive range of practical R tutorials.
 
 # Tutorial style guide  
 
-A painful form of technical debt is inconsistent code style. This repository now contains the following file naming and code style rules.  
+This repository now contains the following file naming and code style rules.  
 
-+ Folders are no longer ordered with a numerical prefix and names are no longer case sensitive e.e.g `r_tips\tutorials\...` and `r_tips\figures\...`    
++ Folders are no longer ordered with a numerical prefix and names are no longer case sensitive e.g `r_tips\tutorials\...` and `r_tips\figures\...`    
 + Tutorial subtopics share the same prefix e.g. `r_tips\tutorials\dv-...` and   `r_tips\tutorials\st-...`  
 + File names contain `-` to separate file name prefixes and `_` instead of other white space e.g. `r_tips\figures\dv-using_diagrammer-simple_flowchart.svg`  
 + Comments are styled according to the [tidyverse style guide](https://style.tidyverse.org/functions.html?q=comments#comments-1):    
@@ -73,9 +44,9 @@ A painful form of technical debt is inconsistent code style. This repository now
   + Comments should not be followed by a blank line, unless the comment is a stand-alone paragraph containing in-depth rationale or an alternative solution  
 + R code chunks are styled as follows:  
   + Each R chunk should be named with a short unique description written in the active voice e.g. `create basic plot` and `modify plot labels`    
-  + Arguments inside code chunks should not contain white space and boolean argument options should be written in capitals e.g. `{r load libraries, message=FALSE, warning = FALSE}`   
+  + Arguments inside code chunks should not contain white space and boolean argument options should be written in capitals e.g. `{r load libraries, message=FALSE, warning=FALSE}`   
   + To render the github document, results are generally suppressed using `results='hide'` and manually entered in a new line beneath the code.  
-  + To render the github document, figures are generally outputed using `fig.show='hold'` and figure outputs can then be suppressed at the local chunk level using `fig.show='hide'`  
+  + To render the github document, figures are generally outputed using `fig.show='markdown'` and figure outputs can then be suppressed at the local chunk level using `fig.show='hide'`  
 + Set a margin of 80 characters length in RStudio through `Tools\Global options --> Code --> Display --> Show margin` and use this margin as the cut-off for code and comments length   
 
 # Citations  
@@ -88,4 +59,13 @@ Citing packages is a good practice when you are publishing research papers. To d
   1686, https://doi.org/10.21105/joss.01686
 + H. Wickham. `ggplot2`: Elegant Graphics for Data Analysis. Springer-Verlag New York, 2016.
 + Matt Dowle and Arun Srinivasan (2021). `data.table`: Extension of `data.frame`. R package
-  version 1.14.2. https://CRAN.R-project.org/package=data.table
+  version 1.14.2. https://CRAN.R-project.org/package=data.table  
+
+# Acknowledgements  
+
+Many kudos to [Dr Chuanxin Liu](https://github.com/codetrainee), my former PhD student and code editor, for teaching me how to code in R in my past life as an immunologist.   
+
+  <p align="center">  
+<img src="https://github.com/erikaduan/r_tips/blob/master/figures/r_milestones.jpg"
+width="600"></center>  
+</p>  
@@ -0,0 +1,80 @@
+# Load required packages -------------------------------------------------------
+if (!require("pacman")) install.packages("pacman")
+pacman::p_load(dplyr,
+               purrr) 
+
+# Create a manual list of survey responses -------------------------------------
+# Each list contains a vector containing 2 atomic elements (rating and comment)
+survey_list <- list(
+  expert_1 = c(
+    8,
+    '<textarea name="comment" form="1"> &lt;Grade A beans.&gt; Easily melts. 
+    Smooth chocolate shell, with a crunchy malty filling, and not so sweet <p> I
+    enjoyed this. </textarea>'
+    ), 
+  expert_2 = c(
+    7, 
+    '<textarea name="comment" form="1"> &lt;Grade A beans with subtle caramel 
+    hints.&gt; Melts well. Smooth exterior. Glossy coating. Malt-filled core may
+    be too sweet for some. </textarea>'
+    ),  
+  expert_3 = c(
+    8,
+    '<textarea name="comment" form="1"> &lt;Grade A beans.&gt; <p> Caramel and 
+    vanilla undertones complement the bitter dark chocolate - low sugar content 
+    and smooth chocolate shell. <p> Recommended. </textarea>'
+    ),  
+  expert_4 = c(
+    10, '<textarea name="comment" form="1"> &lt;Grade A cocoa beans.&gt; Melts 
+    easily. Smooth dark chocolate contrasts nicely against the crunchy malty 
+    filling. </textarea>'
+    ),  
+  expert_5 = c(
+    7,
+    '<textarea name="comment" form="1"> &lt;Grade A beans,&gt; likely of Ecuador
+    origin. Smooth dark chocolate coating. Malt filling ratio could be 
+    decreased. Easy to eat. </textarea>'
+    ),  
+  fan_1 = c(
+    9, 
+    '<textarea name="comment" form="1"> Delicious and melts in your mouth. The 
+    malt crunch is a nice touch <p> Would recommend. </textarea>'),  
+  fan_2 = c(
+    10,
+    '<textarea name="comment" form="1"> Smooth dark chocolate shell likely made 
+    from grade A beans. Has some nice crunch. <p> This is definiely one of my 
+    new favourites! </textarea>'
+    ),  
+  fan_3 = c(
+    8,
+    '<textarea name="comment" form="1"> Tastes great. Smooth and tasty 
+    chocolate. <p> Highly recommended. </textarea>'),  
+  fan_4 = c(
+    10,
+    '<textarea name="comment" form="1"> This will be one of my new favourites. 
+    Love the malty interior! </textarea>'
+    ),  
+  fan_5 = c(
+    9, 
+    '<textarea name="comment" form="1"> Ive loved Haighs since I was a kid! 
+    Love the caramels the most! </textarea>'
+    ),  
+  fan_6 = c(
+    9, 
+    '<textarea name="comment" form="1"> Delicious :)!!! </textarea>')
+)  
+
+# Convert list into tidy data frame --------------------------------------------
+# t() produces a nested data frame where every column contains a matrix array
+survey <- survey_list %>% 
+  map_df(~ as_tibble(t(.x), .name_repair = "unique")) %>%
+  mutate(respondee = names(survey_list)) %>%
+  rename("rating" = "...1",
+         "comment_field" = "...2") %>%
+  select(respondee, everything())  
+
+# Clean global environment by removing redundant objects ----------------------- 
+rm(list = setdiff(ls(), "survey"))
+
+# Print output -----------------------------------------------------------------
+cat("PASS: loaded survey into global R environment\n")