The use of econometric models to forecast international migration took off in the 1990s, when discussions about the enlargement of the European Union to include Central European countries began. Forecasting potential labour mobility from new Member States (NMS) was high on the policy agenda. Forecasts had to take into account the transitional arrangements (i.e. restrictions on mobility), some of which remained in place after accession in 2004. Many academic researchers ran econometric models before the anticipated enlargement (Boeri and Brucker, 2001[1]; Dustmann et al., 2003[2]) or after the enlargement (Brücker, Damelang and Wolf, 2009[3]). These regression-based models (see Bijak (2011[4]) for details on various approaches) estimated different dependent variables (e.g. migration flows, migrant stocks, immigration rates) between destination and selected EU candidate countries, conditional on a set of traditional predictors such as labour market outcomes or income differences, GDP per capita and dummy variables denoting geographic or cultural distance. These forecasts failed to correctly anticipate what was going to happen, giving results far lower than actual flows. However, unemployment rates have been confirmed since then as a powerful explanatory factor (Bijak et al., 2019[5]) for all categories of migration, and not only to forecast labour migration. The models employed at the time have nevertheless been criticised for shortcomings of model specification, especially with respect to demographic variables and country-specific effects, which were missing in many studies (Bijak, 2011[4]). In particular, most assumed that past times series were stationary, an assumption that cannot hold in uncertain time (such as the global economic crisis which occurred in the late 2000s) and with such a volatile phenomenon as migration.
An additional problem with using covariates of migration in forecasting models, or in scenarios, is that they also need forecasting, either separate from or with migration. This means that the predictive uncertainty of migration is compounded by the predictive uncertainty of the covariates, and by the uncertain nature of the relationships between migration and its drivers (Bijak, 2011[4]; Barker and Bijak, 2025[6]). Still, this approach continues to be used in forecasts, as well as in migration scenarios, which chart several possible rather than likely future trajectories of migration (e.g. Acostamadiedo et al. (2020[7]) or Wiśniowski et al. (2023[8])). Such scenarios can be based on pre‑selected trajectories of drivers, perhaps described qualitatively rather than necessarily being quantified. An alternative approach could involve a driver-less statistical approach to scenario-setting, based on the frequency and magnitudes of rare migration events (Bijak, 2024[9]).
In addition to the aforementioned econometric models involving explanatory variables, another important group of migration forecasting approaches can be based on the traditional analysis and extrapolation of time series. It includes standard approaches to time series extrapolation such as Auto-Regressive Integrated Moving Average (ARIMA) models, including Generalised Autoregressive Conditional Heteroskedasticity (GARCH) and Stochastic Volatility (SV) extensions, as well as Vector Auto-Regressive (VAR) models (Barker and Bijak, 2025[6]). Bijak, et al. (2019[10]) assessed migration forecasting approaches using the United Kingdom data with various extrapolation methods and econometric models. Econometric models evaluated yield poor or only reasonable calibration with middle‑sized or even high measurement errors. Traditional extrapolation of time series are also miscalibrated or biased in most cases. This is true when forecasts are based on shorter series of data, or/and when series are non-stationary.
Some migration phenomena are nevertheless sufficiently stationary to be forecasted through traditional time series analysis based on past data only (e.g. family migration, see Box 6.1 and Table 4.1). Therefore, except for these few migration categories that are likely stationary, forecasts of other migration flows increasingly rely on Bayesian or Machine learning models.