This study introduces a novel approach in analyzing the determinants of tourism expenditure in Italy, with a particular focus on the differences between domestic and incoming tourism. The analysis is based on microdata from two surveys: ISTAT’s “Travels and Holidays” and the Bank of Italy’s “International Tourism”. Expenditure forecasts for both categories were calculated using the eXtreme Gradient Boosting (XGBoost) machine learning algorithm, ensuring high predictive accuracy. To identify the main variables influencing tourism expenditure, SHapley Additive exPlanations (SHAP) values were calculated, providing a deeper interpretation of the contribution of each one of them. The results highlight both differences and common points in the determinants driving domestic and incoming tourism expenditure, underscoring the need for targeted policy interventions to address the distinct characteristics of these two segments. The methodological integration of machine learning techniques and SHAP values offers a tool for understanding and forecasting tourism expenditure trends in Italy, with relevant implications for policymakers and industry stakeholders.
Explaining Tourism Expenditure Patterns in Italy: A Comparative Study of Domestic and Incoming Tourists Using XGBoost and SHAP
Antolini, Fabrizio;Cesarini, Samuele;Terraglia, Ivan
2025-01-01
Abstract
This study introduces a novel approach in analyzing the determinants of tourism expenditure in Italy, with a particular focus on the differences between domestic and incoming tourism. The analysis is based on microdata from two surveys: ISTAT’s “Travels and Holidays” and the Bank of Italy’s “International Tourism”. Expenditure forecasts for both categories were calculated using the eXtreme Gradient Boosting (XGBoost) machine learning algorithm, ensuring high predictive accuracy. To identify the main variables influencing tourism expenditure, SHapley Additive exPlanations (SHAP) values were calculated, providing a deeper interpretation of the contribution of each one of them. The results highlight both differences and common points in the determinants driving domestic and incoming tourism expenditure, underscoring the need for targeted policy interventions to address the distinct characteristics of these two segments. The methodological integration of machine learning techniques and SHAP values offers a tool for understanding and forecasting tourism expenditure trends in Italy, with relevant implications for policymakers and industry stakeholders.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


