The paper reflects an interdisciplinary dialogue between legal science, terminology science and computer science aimed at analyzing one specific aspect within the process of digitization of legal operations, in particular via AI: what is the role of labelling and categorization in language and law, and how might legal interpretation be influenced by AI?. We show how a potential challenge in legal AI lies in the obliteration of a certain indeterminacy in legal terms (both concrete and abstract), by arguing that terminology could help controlling the use AI systems make of legal language (via the distinction and systematization of simple terms, concepts, etc.). Terminology-aware generative models would allow more controlled reasoning across languages and jurisdictions and could facilitate explainability by linking each generated statement to a definitional network, thereby exposing the underlying semantic structure of the legal domain. This might help build a trustworthy, explainable AI for law. In this view, terminology could provide an epistemic infrastructure for legal language within an AI operated world, by anchoring generative systems to the conceptual order of law, and by preserving the necessary interplay between determination and indeterminacy that characterises legal language, preventing the risk of current hallucination, over determinacy or the complete loss of penumbra and indeterminacy.

Law, Language, and Terminology. Challenges of Generative AI

Di Nunzio G. M.;Vezzani F.
2026

Abstract

The paper reflects an interdisciplinary dialogue between legal science, terminology science and computer science aimed at analyzing one specific aspect within the process of digitization of legal operations, in particular via AI: what is the role of labelling and categorization in language and law, and how might legal interpretation be influenced by AI?. We show how a potential challenge in legal AI lies in the obliteration of a certain indeterminacy in legal terms (both concrete and abstract), by arguing that terminology could help controlling the use AI systems make of legal language (via the distinction and systematization of simple terms, concepts, etc.). Terminology-aware generative models would allow more controlled reasoning across languages and jurisdictions and could facilitate explainability by linking each generated statement to a definitional network, thereby exposing the underlying semantic structure of the legal domain. This might help build a trustworthy, explainable AI for law. In this view, terminology could provide an epistemic infrastructure for legal language within an AI operated world, by anchoring generative systems to the conceptual order of law, and by preserving the necessary interplay between determination and indeterminacy that characterises legal language, preventing the risk of current hallucination, over determinacy or the complete loss of penumbra and indeterminacy.
2026
   HetERogeneous sEmantic Data integratIon for the guT-bRain interplaY
   HEREDITARY
   European Commission
   Horizon Europe Framework Programme - HORIZON Research and Innovation Actions
   101137074
File in questo prodotto:
File Dimensione Formato  
s11196-026-10430-z (3).pdf

Accesso riservato

Tipologia: Published (Publisher's Version of Record)
Licenza: Accesso privato - non pubblico
Dimensione 814.29 kB
Formato Adobe PDF
814.29 kB Adobe PDF Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3591808
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact