Tutorium: Tidy{verse|models}: A modern approach to Data Science and Machine learning using R
Practitioners and researchers can agree on one thing: Knowing your way around data manipulation and statistical modeling is quintessential for succeeding in their respective fields. While many libraries enable working with data and developing statistical models, two stand out the most. The {tidyverse} and {tidymodels} libraries are collections of packages that are built on four unifying design principles:
- Human-centeredness
- Consistency
- Composability
- Inclusiveness
Sticking to these principles while coding sounds idealistic but is easy to achieve by using the framework brought to life by Hadley Wickham.
This workshop aims to introduce and apply popular packages from each collection that can benefit novice and experienced researchers or practitioners in their daily work.
The workshop consists of two parts:
- The first session starts with an introduction to popular data manipulation and exploration packages such as {dplyr}, {tidyr}, {lubridate}, and {ggplot2}. These tools are combined with the powerful statistical modeling libraries in the {tidymodels} collection to develop a simple regression and classification model. Here, the focus is on presenting best practices that align with the aforementioned design principles and showcasing methods that make life easier when developing models working with messy data.
- The second session is more focused on interactivity. The goal is for the audience to apply the tools introduced in the previous session to real-world data sets. By exploring different data sets and developing models independently, the participants will better understand the core principles behind the {tidy} philosophy. The participants themselves can choose the underlying data sets and models of choice to make sure that each participant can learn something independent of their current skill level.
Die Anmeldung zum Tutorium ist über das ConfTool während der Anmeldung zur Teilnahme an der Statistischen Woche 2025 möglich
Sprache: tba
Teilnahmegebühr: 30€
Mindestteilnehmendenzahl: 5
Maximalteilnehmendenzahl: tba
Für die Teilnahme am Tutorium ist das Mitbringen eines eigenen Laptops erforderlich.
Es gelten die allgemeinen Stornierungsbedingungen der Teilnahme an der Statistischen Woche 2025.