The segmentation approach used in this paper employs a Latent Class Analysis (LCA) methodology to identify sub-groups within the adult population that share similar characteristics. Similar to clustering methods, LCA identifies mutually exclusive and exhaustive latent (or unobserved) classes based on patterns in observed data. LCA estimates class membership probabilities and uses iterative numerical methods to determine the model that best fits the data, based on a statistical criterion.
LCA is widely used applied across disciplines to classify behaviour patterns, attitudes, and preferences. It has been used, for example, to identify consumer preferences (Dolšak, Hrovatin and Zorić, 2020[16]) and to analyse subpopulations based on responses to survey or test items (Bertrand and Hafner, 2013[17]). In the field of education, LCA has been employed to explore educational choices and patterns of learning engagement, particularly within formal education settings. For instance, Denson and Ing (2014[18]) used LCA to classify freshmen according to their ability to work effectively with others of diverse backgrounds, being open to new ideas and different perspectives and being empathetic with other perspectives. This helped higher education institutions target diversity-related interventions. Similarly, Ramesh et al (2014[19]) applied LCA to model and understand student engagement in online courses, using behavioural indicators to predict course completion and identify key survival factors associated with learner persistence.
However, the use of LCA in learning engagement outside formal education settings remains relatively limited. The OECD has applied LCA in multiple policy areas, including identifying different labour market groups based on employment barriers in the “Faces of Joblessness” reports (Fernandez et al., 2016[14]), and defining adult learning profiles in the “OECD Skills Strategy Implementation Guidance for Flanders, Belgium: The Faces of Learners in Flanders” report (OECD, 2022[3]). The methodology developed for Flanders has informed the approach used in this policy paper. The same model has been applied, with two main differences: this paper uses AES 2022 data instead of AES 2016 (see below), and the report for Flanders included additional characteristics for the profiles (e.g. socio-demographic characteristics, learning characteristics, and more).
LCA offers three key advantages over other common segmentation (or clustering) methods: 1) it relies on formal statistical tests guide the selection of the optimal number of profiles and other model parameters; 2) rather than assigning individuals to groups deterministically, LCA estimates the probability of membership in each profile, thereby reducing the risk of classification errors in subsequent analysis; and 3) it is well suited to handling common data challenges, such as missing values and complex survey designs (Collins and Lanza, 2009[20]).