Notes from a (data) scientist

Connecting RStudio to GitHub

Git
GitHub
Reproducibility

Gitting serious about version control

Advanced data.table operations

data cleaning
data.table
dplyr
R

Things get querysome and querysome.

Introduction to data.table

data cleaning
data.table
dplyr
R

To data.table or dplyr? That is the question.

Volcano plots with ggplot2

data visualisation
ggplot2
tidyverse
R

Revising my grammar of graphics.

Cleaning free text and wrangling strings

data cleaning
regex
R

These are some common data cleaning things.

More articles »

Notes from a (data) scientist

Notes about R and Python programming and data science workflows.