You are currently viewing a new version of our website. To view the old version click .
Journal of Imaging
  • This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
  • Article
  • Open Access

28 November 2025

SegClarity: An Attribution-Based XAI Workflow for Evaluating Historical Document Layout Models

,
,
,
and
1
Ecole Nationale d’Ingénieurs de Sousse, Laboratory of Advanced Technology and Intelligent Systems (LATIS), Université de Sousse, Sousse 4054, Tunisia
2
DIVA Group, University of Fribourg, 1700 Fribourg, Switzerland
*
Author to whom correspondence should be addressed.
J. Imaging2025, 11(12), 424;https://doi.org/10.3390/jimaging11120424 
(registering DOI)
This article belongs to the Special Issue Explainable AI in Computer Vision

Abstract

In recent years, deep learning networks have demonstrated remarkable progress in the semantic segmentation of historical documents. Nonetheless, their limited explainability remains a critical concern, as these models frequently operate as black boxes, thereby constraining confidence in the trustworthiness of their outputs. To enhance transparency and reliability in their deployment, increasing attention has been directed toward explainable artificial intelligence (XAI) techniques. These techniques typically produce fine-grained attribution maps in the form of heatmaps, illustrating feature contributions from different blocks and layers within a deep neural network (DNN). However, such maps often closely resemble the segmentation outputs themselves, and there is currently no consensus regarding appropriate explainability metrics for semantic segmentation. To overcome these challenges, we present SegClarity, a novel workflow designed to integrate explainability into the analysis of historical documents. The workflow combines visual and quantitative evaluations specifically tailored to segmentation-based applications. Furthermore, we introduce the Attribution Concordance Score (ACS), a new explainability metric that provides quantitative insights into the consistency and reliability of attribution maps. To evaluate the effectiveness of our approach, we conducted extensive qualitative and quantitative experiments using two datasets of historical document images, two U-Net model variants, and four attribution-based XAI methods. A qualitative assessment involved four XAI methods across multiple U-Net layers, including comparisons at the input level with state-of-the-art perturbation methods RISE and MiSuRe. Quantitatively, five XAI evaluation metrics were employed to benchmark these approaches comprehensively. Beyond historical document analysis, we further validated the workflow’s generalization by demonstrating its transferability to the Cityscapes dataset, a challenging benchmark for urban scene segmentation. The results demonstrate that the proposed workflow substantially improves the interpretability and reliability of deep learning models applied to the semantic segmentation of historical documents. To enhance reproducibility, we have released SegClarity’s source code along with interactive examples of the proposed workflow.

Article Metrics

Citations

Article Access Statistics

Article metric data becomes available approximately 24 hours after publication online.