The preliminary recoding procedure from Ateco 2022 to Ateco 2025

Authors

  • Francesca Alonzi Istat - Italian National Institute of Statistics
  • Annarita Mancini Istat - Italian National Institute of Statistics
  • Caterina Viviano Istat - Italian National Institute of Statistics

DOI:

https://doi.org/10.71014/sieds.v80i3.485

Keywords:

Classification of economic activities, Automatic matching algorithms, Correspondence tables

Abstract

On 1 January 2025, the revised Italian classification of economic activities, Ateco 2025, entered into force. To implement it for both statistical and administrative purposes, an automated procedure recodes enterprises according to the new scheme. This process relies on a mapping and operational correspondence table that automatically resolves one-to-many cases between Ateco 2022 and Ateco 2025. This research work is intended to describe the process of development of the above-mentioned tool including: i) the application of an automatic matching algorithm that compares the text strings headings and inclusion notes of the two classifications Ateco 2022 and Ateco 2025, ii) the analysis of the SEA Survey of Economic Activities results and iii) the involvement of classification experts. Results have shown that, in the absence of any kind of information describing the economic activity at individual level, the developed tool is very useful to make a preliminary large scale recoding of registers and archives of enterprises maintained by various bodies.

References

ALONZI F., CONSALVI M., VIVIANO C. 2024. A Comprehensive Strategy for Implementing NACE Rev. 2.1 in the Italian Statistical Business Register. Meeting of the Group of Experts on Business Registers Organised by UNECE

https://unece.org/statistics/events/meeting-group-experts-business-registers-0

ALONZI F., MANCINI A., SPERANZA A., VIVIANO C. 2025. La tabella operativa di riclassificazione da ATECO 2007 aggiornamento 2022 a ATECO 2025.

https://www.istat.it/classificazione/ateco-2025/

ALONZI F., VIVIANO C. 2025. Le relazioni di corrispondenza tra le classificazioni delle attività economiche ATECO 2025 e ATECO 2007 aggiornamento 2022.

https://www.istat.it/classificazione/documenti-ateco/

COHEN W. W., RAVIKUMAR P., & FIENBERG S. E. 2003. A Comparison of String Distance Metrics for Name-Matching Task Proceedings of the IJCAI-2003 Workshop on Information Integration on the Web (IIWeb-03)

DEVLIN J., ET AL. 2019 BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding NAACL https://arxiv.org/abs/1810.04805

HANCOCK A. 2013. Best Practice Guidelines for Developing International Statistical Classifications

https://unstats.un.org/unsd/classifications/bestpractices/Best_practice_Nov_2013.pdf

LEVENSHTEIN V. I. 1966. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady

MANNING G., RAGHAVAN P. , SCHÜTZE H, 2008 Introduction to Information Retrieval, Cambridge University Press

RAJARAMAN A., ULLMAN J. D. 2011. Mining of Massive Datasets Cambridge University Press

WOLF T. ET AL. 2020. Transformers: State-of-the-Art Natural Language Processing EMNLP

Downloads

Published

2026-02-26

Issue

Section

Articles