Data science tutorials

Data preprocessing and wrangling, tutorials, statistical analyses, modeling, and so on

Hause Lin (University of Toronto)
Jan 12, 2019

What is this site about?

This site contains introduction tutorials as well as relatively advanced data analysis tips/tricks I have used to analyze data. Occasionally, I’ll also explain statistical procedures and concepts with code. I have used these tutorials and articles to teach data science in graduate school. In my tutorials and articles, I usually try my best to explain how things work.

To learn more about me, visit my personal site.

Consider being a patron and supporting my work?

Donate and become a patron: If you find value in what I do and have learned something from my site, please consider becoming a patron. It takes me many hours to research, learn, and put together tutorials. Your support really matters.

Where to begin?

If you’re new to R, start with the tutorials. If you’re looking for tips/tricks to improve your existing R workflow or explanations of statistical concepts, check out my articles.

Questions or want to get in touch?

If you have any questions or comments, email me or create a new issue here or follow me on Twitter.

More resources?


If you see mistakes or want to suggest changes, please create an issue on the source repository.


Text and figures are licensed under Creative Commons Attribution CC BY 4.0. Source code is available at, unless otherwise noted. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".


For attribution, please cite this work as

Lin (2019, Jan. 12). Data science: Data science tutorials. Retrieved from

BibTeX citation

  author = {Lin, Hause},
  title = {Data science: Data science tutorials},
  url = {},
  year = {2019}