The Statistical Society of Australia warmly invites you to a workshop on
Data Wrangling with R, taught by Dr. Emi Tanaka
to be held over two sessions, 1:30 - 5:00pm (AEDT) on 1-2 December 2020.
About the workshop:
Data wrangling is one of the first key steps necessary before downstream analysis such as visualization or modelling. This workshop will teach you how to wrangle data in the statistical language R using the tidyverse suite of packages, i.e. dplyr, tidyr, stringr, lubridate and forcats. This will include learning about the concept of tidy data and learning the new verbs in dplyr v1.0.0 released early this year. The workshop will be hands-on with plenty of practical examples and time for participants to work through exercises to put what they learnt into practice.
About the presenter:
Emi Tanaka is a Lecturer in Statistics at Monash University and the Vice President of the SSA Victorian Branch. Her research interest is in producing useful statistical tools for practitioners, motivated primarily by applications in bioinformatics and agriculture. She is an experienced and enthusiastic R user and instructor, and regularly teaches university courses and workshops to the broader community on a variety of R-related workshops.
Target audience:
The workshop is suitable for those who know R but are not familiar or comfortable with using the tidyverse suite of R packages to do data wrangling.
Learning objectives:
· Transform messy data into tidy data using various R packages
· Learn to pivot data from longer to wider format and vice versa using the tidyr R package
· Complex data wrangling with the dplyr R package
· All about factors and how to manipulate it easily using the forcats R package
· Dealing with dates using the lubridate R package
· Manipulating characters with the stringr R package
Requirements:
· Basic R knowledge (e.g. you have used R to load data, create simple visualisations, perform basic analyses and write simple functions or more specifically, you are familiar with concepts in Cookbook for R by Winston Chang)
· Basic statistics (e.g. simple linear regression, hypothesis testing, basic summary statistics and plots)
· Computer (with ability to install R and R-packages), microphone and web camera
· Stable internet connection
· Install the video conferencing software, Zoom and know how to use Zoom
Desirable:
· Know about tidy data
· Some familiarity with tidyverse or ggplot2
· Know about regular expressions
Timetable
Please note times are in AEDT.
Day 1 (1 December 2020)
1:30pm – 3.00pm (1.5 hours)
|
Session 1
|
3.00pm – 3.30pm
|
Break / networking over virtual afternoon tea
|
3.30pm – 5:00pm (1.5 hours)
|
Session 2
|
12:30pm
|
End of first day
|
Day 2 (2 December 2020)
1.30pm – 3.00pm (1.5 hours)
|
Session 1
|
3.00pm – 3:30pm
|
Break / networking over virtual afternoon tea
|
3:30pm – 5:00pm (1.5 hours)
|
Session 2
|
5:00pm
|
End workshop
|
Expenses
Occasionally workshops have to be cancelled due to a lack of subscription. Early registration ensures that this will not happen. Please note that the Society will not be held responsible for any financial loss incurred due to a workshop cancellation.
Cancellation Policy
Cancellations received prior to 6 Nov 2020 will be refunded, minus a $20 administration fee. From then onwards no part of the registration fee will be refunded. However, registrations are transferable within the same organisation. Please advise any changes to eo@statsoc.org.au.
For more details and to register, click here.