Next Article in Journal
Mouse Data Defence Technology Using Machine Learning in Image-Based User Authentication: Based on the WM_INPUT Message
Previous Article in Journal
Intelligent Routing Optimization via GCN-Transformer Hybrid Encoder and Reinforcement Learning in Space–Air–Ground Integrated Networks
Previous Article in Special Issue
Weak-Cue Mixed Similarity Matrix and Boundary Expansion Clustering for Multi-Target Multi-Camera Tracking Systems in Highway Scenarios
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Efficient Object-Related Scene Text Grouping Pipeline for Visual Scene Analysis in Large-Scale Investigative Data

by
Enrique Shinohara
*,
Jorge García
,
Luis Unzueta
and
Peter Leškovský
Department of Intelligent Security Video Analytics, Vicomtech, 20009 Donostia-San Sebastián, Spain
*
Author to whom correspondence should be addressed.
Electronics 2026, 15(1), 12; https://doi.org/10.3390/electronics15010012 (registering DOI)
Submission received: 15 October 2025 / Revised: 10 December 2025 / Accepted: 16 December 2025 / Published: 19 December 2025
(This article belongs to the Special Issue Deep Learning-Based Scene Text Detection)

Abstract

Law Enforcement Agencies (LEAs) typically analyse vast collections of media files, extracting visual information that helps them to advance investigations. While recent advancements in deep learning-based computer vision algorithms have revolutionised the ability to detect multi-class objects and text instances (characters, words, numbers) from in-the-wild scenes, their association remains relatively unexplored. Previous studies focus on clustering text given its semantic relationship or layout, rather than its relationship with objects. In this paper, we present an efficient, modular pipeline for contextual scene text grouping with three complementary strategies: 2D planar segmentation, multi-class instance segmentation and promptable segmentation. The strategies address common scenes where related text instances frequently share the same 2D planar surface and object (vehicle, banner, etc.). Evaluated on a custom dataset of 1100 images, the overall grouping performance remained consistently high across all three strategies (B-Cubed F1 92–95%; Pairwise F1 80–82%), with adjusted Rand indices between 0.08 and 0.23. Our results demonstrate clear trade-offs between computational efficiency and contextual generalisation, where geometric methods offer reliability, semantic approaches provide scalability and class-agnostic strategies offer the most robust generalisation. The dataset used for testing will be made available upon request.
Keywords: object-level grouping; instance segmentation; planar surface detection; scene text recognition; scene understanding object-level grouping; instance segmentation; planar surface detection; scene text recognition; scene understanding

Share and Cite

MDPI and ACS Style

Shinohara, E.; García, J.; Unzueta, L.; Leškovský, P. Efficient Object-Related Scene Text Grouping Pipeline for Visual Scene Analysis in Large-Scale Investigative Data. Electronics 2026, 15, 12. https://doi.org/10.3390/electronics15010012

AMA Style

Shinohara E, García J, Unzueta L, Leškovský P. Efficient Object-Related Scene Text Grouping Pipeline for Visual Scene Analysis in Large-Scale Investigative Data. Electronics. 2026; 15(1):12. https://doi.org/10.3390/electronics15010012

Chicago/Turabian Style

Shinohara, Enrique, Jorge García, Luis Unzueta, and Peter Leškovský. 2026. "Efficient Object-Related Scene Text Grouping Pipeline for Visual Scene Analysis in Large-Scale Investigative Data" Electronics 15, no. 1: 12. https://doi.org/10.3390/electronics15010012

APA Style

Shinohara, E., García, J., Unzueta, L., & Leškovský, P. (2026). Efficient Object-Related Scene Text Grouping Pipeline for Visual Scene Analysis in Large-Scale Investigative Data. Electronics, 15(1), 12. https://doi.org/10.3390/electronics15010012

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop