You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+6-24Lines changed: 6 additions & 24 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -50,6 +50,12 @@ Many kudos to [Dr Chuanxin Liu](https://github.com/codetrainee), my former PhD s
50
50
+[Introduction to hypergeometric, geometric, negative binomial and multinomial distributions](https://github.com/erikaduan/R_tips/blob/master/tutorials/2020-09-22_hypergeometric-and-other-discrete-distributions/2020-09-22_hypergeometric-and-other-discrete-distributions.md)
51
51
52
52
53
+
# Other resources
54
+
These resources also cover a comprehensive range of practical R usage tutorials.
55
+
56
+
+[Statistical Computing](https://36-750.github.io/) by Alex Reinhart and Christopher Genovese
57
+
+[Data Science Toolkit](https://benkeser.github.io/info550/lectures/) by David Benkeser
58
+
53
59
# Tutorial style guide
54
60
55
61
A painful form of technical debt is inconsistent code style. This repository now contains the following file naming and code style rules.
@@ -81,31 +87,7 @@ A painful form of technical debt is inconsistent code style. This repository now
81
87
version 1.4.0.
82
88
https://CRAN.R-project.org/package=stringr
83
89
84
-
+ Max Kuhn. (2019). `caret`: Classification and Regression
85
-
Training. R package version 6.0-84. https://CRAN.R-project.org/package=caret
86
-
+ Contributions from Jed Wing, Steve Weston, Andre Williams, Chris Keefer, Allan Engelhardt, Tony
87
-
Cooper, Zachary Mayer, Brenton Kenkel, the R Core Team, Michael Benesty, Reynald Lescarbeau, Andrew Ziem,
88
-
Luca Scrucca, Yuan Tang, Can Candan and Tyler Hunt.
89
-
90
-
+ Jacob Kaplan (2020). `fastDummies`: Fast Creation of Dummy (Binary) Columns and Rows from Categorical
91
-
Variables. R package version 1.6.1. https://CRAN.R-project.org/package=fastDummies
92
-
93
90
+ Kirill Müller (2017). `here`: A Simpler Way to Find Your Files. R package version 0.1.
94
91
https://CRAN.R-project.org/package=here
95
92
96
-
+ Paul Murrell (2015). `compare`: Comparing Objects for Differences. R package version 0.2-6.
97
-
https://CRAN.R-project.org/package=compare
98
-
99
-
+ A. Liaw and M. Wiener (2002). Classification and Regression by `randomForest`. R News 2(3), 18--22.
if (!require("pacman")) install.packages("pacman")
17
18
pacman::p_load(here,
18
-
ids, # for generating random ids
19
+
ids, # Generate random ids
19
20
tidyverse,
20
21
data.table,
21
-
compare, # compare between data frames
22
+
compare, # Compare between data frames
22
23
microbenchmark)
23
24
```
24
25
25
26
26
27
# Introduction
27
28
28
-
One of the great benefits of following Rstats conversations on Twitter is its access to user insights. I became curious about `data.table` after reading conversations about its superior performance yet decreased visibility compared to `tidyverse`.
29
+
I became curious about `data.table` after reading Twitter conversations about its superior performance yet decreased visibility compared to `tidyverse`. Because
30
+
29
31
30
32
Fast forward a few years and the [data processing efficiency](https://h2oai.github.io/db-benchmark/) of `data.table` has become extremely handy:
31
33
@@ -960,6 +962,8 @@ In contrast, `data.table` is efficient because it contains a very fast ordering
+ The definitive [stack overflow discussion](https://stackoverflow.com/questions/21435339/data-table-vs-dplyr-can-one-do-something-well-the-other-cant-or-does-poorly/27840349#27840349) about the best use cases for data.table versus dplyr (from tidyverse).
964
968
965
969
+ A great side by side comparison of data.table versus dplyr operations by [Atrebas](https://atrebas.github.io/post/2019-03-03-datatable-dplyr/).
@@ -974,3 +978,7 @@ In contrast, `data.table` is efficient because it contains a very fast ordering
974
978
Robin Lovelace](https://csgillespie.github.io/efficientR/data-processing-with-data-table.html).
975
979
976
980
+ A more detailed explanation of the usage of binary search based subset in `data.table` by [Arun Srinivasan](https://gist.github.com/arunsrinivasan/dacb9d1cac301de8d9ff).
Copy file name to clipboardExpand all lines: tutorials/p-automating_rmd_reports/p-automating_rmd_reports_part_2.Rmd
+2Lines changed: 2 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -314,3 +314,5 @@ jobs:
314
314
+ A [YouTube tutorial](https://www.youtube.com/watch?v=NwUijrm2U2w) by DVC on using GitHub Actions with R to automate data visualisation tasks.
315
315
+ A useful (online resource](https://explainshell.com/) for explaining shell commands required to create components of the GitHub Actions YAML workflow.
0 commit comments