The SSA proudly presents Text Analysis with Large Language Models by Dr Emi Tanaka.
Unstructured text data are often hard to process downstream by traditional statistical methods without processing it to a structured or standardised form. In this workshop, we discuss methods to process text data such as cleaning text entries, sentiment analysis, topic identification, topic modelling, and text summarisation. This workshop also serves as an introduction to large language models (LLMs) with demonstration of LLM-assisted text analysis. The workshop will help participants understand how LLMs can be integrated into existing workflows with hands-on experience. Practical applications will be demonstrated through R using OpenAI LLMs or local Ollama models. Participants will have access to all the slides and code used in the demonstration, so you can replicate the analyses and continue exploring LLMs on your own machines.
About the presenter:
Dr Emi Tanaka is an Applied Statistician and Deputy Director at the Biological Data Science Institute at the Australian National University. Her primary interest is developing impactful methods and tools practitioners can readily use. She delivers numerous statistical workshops including data visualisation, data wrangling, reproducible practices, statistical modelling and statistical consulting. She was the inaugural recipient of the SSA Distinguished Presenter's Award based on the delivery of her workshops.
Required:
- Proficient in R programming basics,
- Own laptop/desktop with admin access and Zoom,
- Stable internet access, and
- Installation of Ollama (https://ollama.com) OR a developer account on OpenAI (https://platform.openai.com/) with some small credit, say US$5 (see Billing under Your profile).
Desirable:
- Prior experience or knowledge of machine learning methods.
- Basics of text analysis.
Cancellation Policy:
Occasionally workshops have to be cancelled due to a lack of subscription. Early registration ensures that this will not happen.
Cancellations received prior to two weeks before the event will be refunded, minus the Stripe processing fee (1.75% + $0.30 per transaction) and an SSA administration fee of $20.
From then onward no part of the registration fee will be refunded. However, registrations are transferable within the same organisation. Please advise any changes to events@statsoc.org.au.
For any questions, please email events@statsoc.org.au.