Training Lineup for 2025: January-June
All of our public training courses for the first half of 2025 are now open for registration! Head over to the public courses page on our website to book in and start building your programming skills in the new year! Below is a list of all of our upcoming courses with a description, bookable dates, course level and a link to the course webpage to find out more!
There is still time to book yourself on to the final public courses of 2024. We are running Reporting with Quarto and Advanced Machine Learning with Tidymodels, both on the 18 of November.
R Stats and Programming
Introduction to R
Course level: Foundation
Upcoming course dates: 15th January 2025 & 22nd April 2025
R is a versatile language for statistical computing and graphics. In this course you will learn the advantages of using R and how to get started. You will gain familiarity with the RStudio interface and learn the R basics. Also included is an introduction to the Tidyverse and how to use various packages for data storage, visualisation and manipulation. This course provides a great foundation to begin your R journey!
Data Wrangling in the Tidyverse
Course level: Foundation
Upcoming course dates: 22nd January 2025 & 29th April 2025
If you work with data, you probably spend a lot of time cleaning it and wrangling it into the correct shape. This course will show you how you can use R to efficiently clean and wrangle your data into a format that’s ready for analysis. You will learn about the Tidyverse, what tidy data really is, and how to practically achieve it with packages such as {dplyr}, {tidyr}, {lubridate} and {forcats}.
Programming with R
Course level: Intermediate
Upcoming course dates: 29th January 2025 & 20th May 2025
The benefit of using a programming language such as R is that we can automate repetitive tasks. This course covers the fundamental techniques such as functions, for loops and conditional expressions. By the end of this course, you will understand what these techniques are and when to use them. This is a one-day intensive course on R.
R Best Practices
Course level: Intermediate
Upcoming course dates: 12th February 2025
So you can write code? Great. But can you write code which is easy to read, simple to maintain, and reproducible? Under the pressure of deadlines even the best of us can fall victim to bad-practices. In this course we motivate the importance of good-practices, and show how we can make best practices second nature by incorporating them into our normal workflow.
Data Visualisation with ggplot2
Course level: Intermediate
Upcoming course dates: 5th February 2025 & 10th June 2025
Want to learn how to effectively visualise your data in R using the elegant {ggplot2} package? With {ggplot2} it’s easy to customise everything from plot layouts and themes to scales, colours, and more! This course will comprehensively take you through basic plot types such as bar and line charts as well as cover more advanced topics such as interactive graphics with {plotly}.
Statistical Modelling with R
Course level: Intermediate
Upcoming course dates: 26th February 2025 & 3rd June 2025
From the very beginning, R was designed for statistical modelling. Out of the box, R makes standard statistical techniques easy. This course covers the fundamental modelling techniques. We begin the day by revising hypotheses tests, before moving onto ANOVA tables and regression analysis. The class ends by looking at more sophisticated methods such as clustering and principal components analysis (PCA).
Machine Learning and Bayesian Techniques
Machine Learning with Tidymodels
Course level: Intermediate
Upcoming course dates: 4th March 2025 & 17th June 2025
Machine learning is the process of applying statistical techniques to gain systematic information about a quantity of interest. We will be specifically focusing on how we can use the {tidymodels} suite of packages to implement these techniques. We cover key reasons for model fitting, such as prediction and inference, on quantitative and qualitative responses.
Advanced Machine Learning with Tidymodels
Course level: Advanced
Upcoming course dates: 18th March 2025 & 24th June 2025
A course that builds on the material covered in our Machine Learning with Tidymodels course. We take a look at how we can fit linear discriminant analysis (LDA) models using {discrim}, assessing model reliability using V-fold cross validation, pre-processing, tree-based models & more. If you wish to explore the abundance of model fitting techniques {tidymodels} has to offer, then this course is certainly for you!
Introduction to Bayesian Inference using RStan
Course level: Intermediate
Upcoming course dates: 13th January 2025
Despite the promise of big data, inferences are often limited by its systematic structure. Only by carefully modelling this structure can we take full advantage of the data. Stan is a platform for facilitating this modelling, providing an expressive modelling language to implement state-of-the-art algorithms, to draw subsequent Bayesian inferences. This course will teach participants how to interface with Stan through R!
Automatic Reporting
Reporting with Quarto
Course level: Intermediate
Upcoming course dates: 25th March 2025 & 24th June 2025
Do you create interactive documents that always need to be updated when the data changes? Then this course is for you. In this course you will learn how to use Quarto to create high quality, dynamic, fully reproducible documents. Quarto is a multi-language open source publishing tool that allows for the creation of dynamic content with Python, R, Julia and Observable.
Python
Introduction to Python
Course level: Foundation
Upcoming course dates: 26th February 2025 & 13th May 2025
Python is a general-purpose programming language popular among data scientists and statisticians. In this one-day introductory course, participants will learn to import, summarise and visualise their data. At each step, we avoid using “magic code”, and stress the importance of understanding what Python is doing.
Programming with Python
Course level: Intermediate
Upcoming course dates: 4th March 2025 & 3rd June 2025
The benefit of using a programming language such as Python is that we can automate repetitive tasks. This course covers the fundamental techniques such as functions, for loops and conditional expressions. By the end of this course, you will understand what these techniques are and how they can be applied to solve real-world data wrangling tasks.
Data Visualisation with Python
Course level: Intermediate
Upcoming course dates: 18th March 2025 & 17th June 2025
Python has a number of packages for the effective creation of graphics to communicate your data insights. This course will examine two popular libraries for creating static 2D plots: Matplotlib and Seaborn. During the training session, we’ll cover plotting basics and customisation of figures with Matplotlib, before moving onto complex statistical visualisations with Seaborn.
SQL
Introduction to SQL
Course level: Foundation
Upcoming course dates: 12th February 2025
The Structured Query Language (SQL) defines a standard for communicating with a relational database. In this half-day introductory course, participants will learn the basic SQL syntax for data extraction, filtering and insertion. We will then discuss some considerations for working with databases on the cloud, and finish by learning basic techniques for joining tables.
The course can be taken either independently or as a precursor to our Intro to SQL with R and Intro to SQL with Python courses (see below).
An Introduction to SQL with R
Course level: Intermediate
Upcoming course dates: 15th April 2025
Using databases is a fundamental part of a data scientist’s role. The main focus of this training course is to introduce SQL databases, write your first SQL queries, and show how R can be used to retrieve and manipulate data stored in a relational database. The course uses both the {DBI} and {dbplyr} packages.
We use the PostgreSQL database as an example for public courses. For in-house training, we are happy to adapt the course to match your database requirements.
Introduction to SQL with Python
Course level: Intermediate
Upcoming course dates: 15th April 2025
Using databases is a fundamental part of a data scientist’s role. This training course introduces SQL databases and the SQL command syntax, and shows how Python can be used to retrieve and manipulate data held in a relational database. The course also discusses how SQLAlchemy can be used to define and interact with databases using object-oriented Python code.
We use a PostgreSQL database as an example, and communicate with this using a psycopg2 connection.
So what now?
If you’re interested in attending any of our public courses, then you can head straight over to the public booking page! If you’re looking for training for your team, or maybe even something a bit more bespoke, then get in touch and we’ll see what we can do! All of our training courses (including courses not mentioned above) can be found in our course catalogue.