SegClarity: An Attribution-Based XAI Workflow for Evaluating Historical Document Layout Models

Iheb Brini; Najoua Rahal; Maroua Mehri; Rolf Ingold; Najoua Essoukri Ben Amara

doi:10.3390/jimaging11120424

,

and

¹

Ecole Nationale d’Ingénieurs de Sousse, Laboratory of Advanced Technology and Intelligent Systems (LATIS), Université de Sousse, Sousse 4054, Tunisia

²

DIVA Group, University of Fribourg, 1700 Fribourg, Switzerland

^*

Author to whom correspondence should be addressed.

J. Imaging2025, 11(12), 424;https://doi.org/10.3390/jimaging11120424
(registering DOI)

This article belongs to the Special Issue Explainable AI in Computer Vision

Version Notes

Order Reprints

Abstract

In recent years, deep learning networks have demonstrated remarkable progress in the semantic segmentation of historical documents. Nonetheless, their limited explainability remains a critical concern, as these models frequently operate as black boxes, thereby constraining confidence in the trustworthiness of their outputs. To enhance transparency and reliability in their deployment, increasing attention has been directed toward explainable artificial intelligence (XAI) techniques. These techniques typically produce fine-grained attribution maps in the form of heatmaps, illustrating feature contributions from different blocks and layers within a deep neural network (DNN). However, such maps often closely resemble the segmentation outputs themselves, and there is currently no consensus regarding appropriate explainability metrics for semantic segmentation. To overcome these challenges, we present SegClarity, a novel workflow designed to integrate explainability into the analysis of historical documents. The workflow combines visual and quantitative evaluations specifically tailored to segmentation-based applications. Furthermore, we introduce the Attribution Concordance Score (ACS), a new explainability metric that provides quantitative insights into the consistency and reliability of attribution maps. To evaluate the effectiveness of our approach, we conducted extensive qualitative and quantitative experiments using two datasets of historical document images, two U-Net model variants, and four attribution-based XAI methods. A qualitative assessment involved four XAI methods across multiple U-Net layers, including comparisons at the input level with state-of-the-art perturbation methods RISE and MiSuRe. Quantitatively, five XAI evaluation metrics were employed to benchmark these approaches comprehensively. Beyond historical document analysis, we further validated the workflow’s generalization by demonstrating its transferability to the Cityscapes dataset, a challenging benchmark for urban scene segmentation. The results demonstrate that the proposed workflow substantially improves the interpretability and reliability of deep learning models applied to the semantic segmentation of historical documents. To enhance reproducibility, we have released SegClarity’s source code along with interactive examples of the proposed workflow.

Keywords:

deep neural networks; document layout analysis; semantic segmentation; explainable artificial intelligence; attribution maps; evaluation metrics

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Article metric data becomes available approximately 24 hours after publication online.