The Laurier Institute for the Study of Public Opinion and Policy is hosting a series of workshops on reproducible research in the social sciences during the Winter Reading Week from February 22nd to the 25th. A mix of remote courses will be offered introducing undergraduate and graduate students, faculty and researchers to introductory and advanced statistical analysis and data visualization using R, RMarkdown and GitHub. Register for one of the following courses:
Students receive 20% off with promo code 'Student20'
Course Description
Introduction to R will cover everything needed to begin working with data in RStudio, with a focus on tidyverse style. The workshop will begin with a module to cover installation, basic syntax, and file types. Data types and structures in R will be explained and participants will learn how to manipulate data tables to summarise and present key information. To facilitate working with real data, the workshop will cover data import for a variety of file types, cleaning, and joining. Data visualization will be taught using ggplot, focusing on histograms, scatter plots, and bar charts. Finally, the workshop will teach how to use R to look for relationships between variables using chi-squared tests, t-tests, and linear regressions.
Prerequisites: Participants should install R and RStudio before the workshop, but no previous knowledge of R or programming is required.
Instructor Bio
Amy Farrow is currently a master’s student at the Faculty of Information at the University of Toronto, studying Human-Centred Data Science. Her previous degrees are in sociology and math. Currently, she works as a research assistant for the University of Toronto’s Data Sciences Institute and Ryerson University and volunteer with the Toronto Data Workshop and my faculty’s Accessibility Interests Working Group. Previously, she has worked as a data and policy analyst and a tutor.
Course Description
This workshop will showcase the data visualization capabilities of R with an emphasis on the expansive and highly customizable graphics that can be produced with the ggplot2 package. The session will cover commonly produced graphs to communicate scientific results (bar plots, histograms, scatter plots, cumulative incidence curves) and touch on additional packages for spatial visualisation and interactive plots. Workshop participants will have the opportunity to wrangle and prepare a data set for visualization, and thoughtful presentation and communication of data will be discussed.
Prerequisites: Attendees should have R and RStudio installed on their local machines and know-how of reading data into the working environment.
Instructor Bio
Julia Ma is a Data Scientist at Precision-Analytics, a Montreal-based consultancy. She has extensive experience supporting clinical and academic research, working with data that range from electronic medical records to provincial-level administrative data. Julia’s work spans observational and experimental studies including quality improvement initiatives, health economic analyses, and predictive modelling. She holds a Master’s degree in Epidemiology from the Dalla Lana School of Public Health, University of Toronto.
Course Description
This course will introduce participants to the use of GitHub as a tool for encouraging collaboration, archiving, knowledge mobilization and reproducibility. Topics for the course will include:
Instructor Bio
Rachael Lam (Details forthcoming)