Categorical Explaining Functors: Ensuring Coherence in Logical Explanations

Post-hoc methods in Explainable AI (XAI) elucidate black-box models by identifying input features critical to the model’s decision-making. Recent advancements in these methods have facilitated the generation of logic-based explanations that capture interactions among input features. However, these techniques often encounter critical limitations, notably the inability to ensure logical consistency and fidelity between generated explanations and the model’s actual decision-making processes. Such inconsistencies jeopardize the reliability of explanations particularly in high-risk domains. To address this gap, we introduce a novel, theoretically rigorous approach rooted in category theory. Specifically, we propose the concept of an explaining functor, which preserves logical entailment structurally between the explanations and the decisions of black-box models. By establishing a categorical framework, our method guarantees the coherence and accuracy of extracted explanations, thus overcoming the common pitfalls associated with heuristic-based explanation methods. We demonstrate the practical efficacy of our theoretical contributions through two synthetic benchmarks that highlight significant reductions in contradictory and unfaithful explanations. Our experiments show how our framework can provide mathematically grounded, compositional, and coherent explanations.

Categorical Explaining Functors: Ensuring Coherence in Logical Explanations

Fioravanti S.;Giannini F.;Barbiero P.;Frazzetto P.;Confalonieri R.;Zanasi F.;Navarin N.

2025

Abstract

Post-hoc methods in Explainable AI (XAI) elucidate black-box models by identifying input features critical to the model’s decision-making. Recent advancements in these methods have facilitated the generation of logic-based explanations that capture interactions among input features. However, these techniques often encounter critical limitations, notably the inability to ensure logical consistency and fidelity between generated explanations and the model’s actual decision-making processes. Such inconsistencies jeopardize the reliability of explanations particularly in high-risk domains. To address this gap, we introduce a novel, theoretically rigorous approach rooted in category theory. Specifically, we propose the concept of an explaining functor, which preserves logical entailment structurally between the explanations and the decisions of black-box models. By establishing a categorical framework, our method guarantees the coherence and accuracy of extracted explanations, thus overcoming the common pitfalls associated with heuristic-based explanation methods. We demonstrate the practical efficacy of our theoretical contributions through two synthetic benchmarks that highlight significant reductions in contradictory and unfaithful explanations. Our experiments show how our framework can provide mathematically grounded, compositional, and coherent explanations.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo del Libro
	
				Proceedings of the International Conference on Knowledge Representation and Reasoning
			
	Collana/serie monografica
	
				PROCEEDINGS-INTERNATIONAL CONFERENCE ON PRINCIPLES OF KNOWLEDGE REPRESENTATION AND REASONING
			
	Titolo convegno
	
				22nd International Conference on Principles of Knowledge Representation and Reasoning, KR 2025
			
	Codice DOI
	
				https://dx.doi.org/10.24963/kr.2025/30
			
	Codice Scopus
	
				2-s2.0-105034274659
			
	Codice OpenAlex
	
				W4415903185
			
	Identificativo progetto
	
	Titolo Progetto
	
									Symbolic conditioning of Graph Generative Models
								
	Acronimo
	
									SymboliG
								
	Nome finanziatore
	
										European Union under the National Recovery and Resilience Plan (NRRP) Mission 4 Component 2 Investment 1.3
									
	Finanziamento
	
									NextGenerationEU, Code PE0000013, Concession Decree No. 1555 of October 11, 2022 CUP C63C22000770006
								
	N. Contratto
	
									NextGenerationEU, Code PE0000013, Concession Decree No. 1555 of October 11, 2022 CUP C63C22000770006
								
	Appare nelle tipologie:
	
				04.01 - Contributo in atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3590889

Citazioni

ND

0

ND

0

social impact