Menu
Log in


CPD126 - Workshop: Data wrangling with R

  • 1 Dec 2020
  • (AEDT)
  • 2 Dec 2020
  • (AEDT)
  • 2 sessions
  • 1 Dec 2020, 1:30 PM 5:00 PM (AEDT)
  • 2 Dec 2020, 1:30 PM 5:00 PM (AEDT)
  • Online, times are in AEDT
  • 2

Registration

  • Discounted registration for an SSA member. Available until 2 Nov.
  • Discounted price until Nov 2nd.
  • Discounted registration for an SSA student member. Available until 2 Nov.

Registration is closed

The Statistical Society of Australia warmly invites you to a workshop on

Data Wrangling with R, taught by Dr. Emi Tanaka.

About the workshop:

Data wrangling is one of the first key steps necessary before downstream analysis such as visualization or modelling. This workshop will teach you how to wrangle data in the statistical language R using the tidyverse suite of packages, i.e. dplyr, tidyr, stringr, lubridate and forcats. This will include learning about the concept of tidy data and learning the new verbs in dplyr v1.0.0 released early this year. The workshop will be hands-on with plenty of practical examples and time for participants to work through exercises to put what they learnt into practice. 

About the presenter:

Emi Tanaka is a Lecturer in Statistics at Monash University and the Vice President of the SSA Victorian Branch. Her research interest is in producing useful statistical tools for practitioners, motivated primarily by applications in bioinformatics and agriculture. She is an experienced and enthusiastic R user and instructor, and regularly teaches university courses and workshops to the broader community on a variety of R-related workshops.

Target audience:

The workshop is suitable for those who know R but are not familiar or comfortable with using the tidyverse suite of R packages to do data wrangling.

Learning objectives:

·         Transform messy data into tidy data using various R packages

·         Learn to pivot data from longer to wider format and vice versa using the tidyr R package

·         Complex data wrangling with the dplyr R package

·         All about factors and how to manipulate it easily using the forcats R package

·         Dealing with dates using the lubridate R package

·         Manipulating characters with the stringr R package

Requirements:

·         Basic R knowledge (e.g. you have used R to load data, create simple visualisations, perform basic analyses and write simple functions or more specifically, you are familiar with concepts in  Cookbook for R by Winston Chang)

·         Basic statistics (e.g. simple linear regression, hypothesis testing, basic summary statistics and plots)

·         Computer (with ability to install R and R-packages), microphone and web camera

·         Stable internet connection

·         Install the video conferencing software, Zoom and know how to use Zoom

Desirable:

·         Know about tidy data

·         Some familiarity with tidyverse or ggplot2

·         Know about regular expressions

Timetable

Please note times are in AEDT.

Day 1

1:30pm  3.00pm (1.5 hours)

Session 1

3.00pm  3.30pm

Break / networking over virtual afternoon tea

3.30pm  5:00pm (1.5 hours)

Session 2

5:00pm

End of first day

Day 2

1.30pm – 3.00pm (1.5 hours)

Session 1

3.00pm  3:30pm

Break / networking over virtual afternoon tea

3:30pm  5:00pm (1.5 hours)

Session 2

5:00pm

End workshop


Expenses

Occasionally workshops have to be cancelled due to a lack of subscription. Early registration ensures that this will not happen. Please note that the Society will not be held responsible for any financial loss incurred due to a workshop cancellation.

Cancellation Policy

Cancellations received prior to 6 Nov 2020 will be refunded, minus a $20 administration fee. From then onwards no part of the registration fee will be refunded. However, registrations are transferable within the same organisation. Please advise any changes to eo@statsoc.org.au.


Powered by Wild Apricot Membership Software