This repository contains a series of R usage tips covering topics across data cleaning, visualisation and machine learning. The creation of this repository was precipitated by a call-out for RLadies-Melbourne lightning talks in 2018.
Many kudos to Dr Chuanxin Liu, my former PhD student and occasional code editor, for teaching me how to learn to code in R in my past life as an immunologist.
- How to draw a volcano plot using ggplot2 🎨
- You can use data.table or tidyverse! 📚
- Working with dummy variables and factors 📚🔮
- Untangling strings using regular expressions 📚
- Using DiagrammeR to draw flow charts 🎨
- Understanding centrality measures in statistics 🔢
- Topic modelling 📚🔮
- Automating R Markdown report generation - Part 1 📚🎨🔨
- Automating R Markdown report generation - Part 2 📚🔨
- Introduction to binomial distributions 🔢🎨
- Introduction to hypergeometric, geometric, negative binomial and multinomial distributions 🔢🎨
Legend | Category |
---|---|
📚 | Data wrangling |
🎨 | Data visualisation |
🔮 | Machine learning |
🔨 | Production |
🔢 | Statistics |
-
Hadley Wickham (2017).
tidyverse
: Easily Install and Load the 'Tidyverse'. R package version 1.2.1. https://CRAN.R-project.org/package=tidyverse -
Matt Dowle and Arun Srinivasan (2019). data.table: Extension of
data.frame
. R package version 1.12.6. https://CRAN.R-project.org/package=data.table -
Hadley Wickham (2019).
stringr
: Simple, Consistent Wrappers for Common String Operations. R package version 1.4.0. https://CRAN.R-project.org/package=stringr -
Max Kuhn. (2019).
caret
: Classification and Regression Training. R package version 6.0-84. https://CRAN.R-project.org/package=caret- Contributions from Jed Wing, Steve Weston, Andre Williams, Chris Keefer, Allan Engelhardt, Tony Cooper, Zachary Mayer, Brenton Kenkel, the R Core Team, Michael Benesty, Reynald Lescarbeau, Andrew Ziem, Luca Scrucca, Yuan Tang, Can Candan and Tyler Hunt.
-
Jacob Kaplan (2020).
fastDummies
: Fast Creation of Dummy (Binary) Columns and Rows from Categorical Variables. R package version 1.6.1. https://CRAN.R-project.org/package=fastDummies -
Kirill Müller (2017).
here
: A Simpler Way to Find Your Files. R package version 0.1. https://CRAN.R-project.org/package=here -
Paul Murrell (2015).
compare
: Comparing Objects for Differences. R package version 0.2-6. https://CRAN.R-project.org/package=compare -
A. Liaw and M. Wiener (2002). Classification and Regression by
randomForest
. R News 2(3), 18--22. -
Tianqi Chen, Tong He, Michael Benesty, Vadim Khotilovich, Yuan Tang, Hyunsu Cho, Kailong Chen, Rory Mitchell, Ignacio Cano, Tianyi Zhou, Mu Li, Junyuan Xie, Min Lin, Yifeng Geng and Yutian Li (2020).
xgboost
: Extreme Gradient Boosting. R package version 1.0.0.2. https://CRAN.R-project.org/package=xgboost -
Alexandros Karatzoglou, Alex Smola, Kurt Hornik, Achim Zeileis (2004).
kernlab
- An S4 Package for Kernel Methods in R. Journal of Statistical Software 11(9), 1-20. URL http://www.jstatsoft.org/v11/i09/ -
Microsoft Corporation and Steve Weston (2019).
doParallel
: Foreach Parallel Adaptor for theparallel
Package. R package version 1.0.15. https://CRAN.R-project.org/package=doParallel -
Richard Iannone (2020).
DiagrammeR
: Graph/Network Visualization. R package version 1.0.6.1. https://CRAN.R-project.org/package=DiagrammeR