The database of people who participated in the Youth Guarantee during the evaluation period served as the sampling frame for the survey. It contains the details of 849 1691 participants, including information on: i) gender; ii) age group at registration (16‑19 years, 20‑24 years, or 25‑29 years); iii) autonomous community of residence; iv) vulnerability status; and v) type of service(s) received during participation in the Youth Guarantee, i.e. career guidance, training or employment services.2
The OECD evaluation team agreed that the survey results for each type of service would need to be analysed using the gender and vulnerability status of the participants as variables, although other characteristics could also be used. They also established that the sample had to be representative of the whole group of Youth Guarantee participants, in terms of type of service received, gender, vulnerability status, age group and autonomous community of residence.
They decided, therefore, to explicitly stratify the sample using the type of service received and participants’ gender and vulnerability status as variables. As some participants received several types of service, before the selection was carried out, each participant was allocated to one of the following four groups: i) career guidance; ii) training; iii) employment services; or iv) no service received. This ensured that each participant appeared only once in the sample. Assignment to these groups was done randomly. The group of participants who had received two types of service was split in two, and one‑half allocated to each service. The group that had received three types was divided into three parts, and one‑third allocated to each service. This allocation approach did, however, maintain the composition of each group in terms of gender, vulnerability status, age group and autonomous community of residence.
Table A C.1. shows the results of this allocation, while Table A C.2. presents the distribution of the resulting sample, by explicit stratum.
To ensure that the sample reflected the distribution of the sampling frame, implicit stratification was used. This meant that the participants for each explicit stratum were selected using systematic random sampling, working from lists that had been ordered by age group and by autonomous community.