1. Introduction
Cardiac impulse propagation depends on multiple structural and functional determinants. Among these, cardiomyocyte size and shape play a central role [
1,
2]. At the cellular level, membrane area contributes to capacitive properties, whereas cell volume influences intra-cellular resistance. Beyond the single cell, conduction is further modulated by tissue-level organization, including the spatial arrangement of myocytes, non-myocyte cells, and connective tissue within the extracellular space [
3]. At the molecular level, impulse propagation is critically shaped by the expression and spatial distribution of connexins.
Connexins form intercellular channels at gap junctions, providing direct electrical and metabolic coupling between adjacent cardiomyocytes and thereby sustaining the syncytial behavior of the myocardium [
4]. In healthy ventricular tissue, connexin-43 (CX43) is predominantly concentrated at the intercalated discs located at the longitudinal ends of cardiomyocytes. Under pathological or stress conditions, however, CX43 may redistribute toward the lateral cell borders, a phenomenon commonly referred to as lateralization [
5].
Because the spatial arrangement of gap junctions directly influences both the strength and anisotropy of intercellular coupling, changes in CX43 distribution are considered functionally relevant. In diseased myocardium, increased lateralization, often accompanied by an overall reduction in CX43 expression, has been associated with abnormal impulse propagation and increased susceptibility to arrhythmias [
6]. However, the functional consequences of this redistribution remain debated, as some studies suggest that laterally displaced CX43 may partially compensate for structural inhomogeneities by preserving some degree of coupling in remodeled regions [
7]. This unresolved issue highlights the need for quantitative approaches capable of characterizing CX43 distribution across large tissue areas.
Quantitative assessment of CX43 distribution in histological sections poses substantial practical challenges. The number of intercellular junctions within the myocardium far exceeds what can be comprehensively evaluated by conventional microscopy, while manual annotation is labor-intensive and therefore usually restricts analysis to small regions of interest. As a result, many studies may fail to capture the spatial heterogeneity of connexin remodeling across an entire tissue section [
8]. A further challenge lies in the segmentation of individual cardiomyocytes in intact tissue. Their complex branching morphology, dense packing, and irregular boundaries make reliable delineation difficult even for trained observers [
9]. In standard histological preparations, oblique sectioning and limited membrane contrast further increase the uncertainty associated with identifying individual cellular profiles [
10].
These limitations have motivated the development of automated and semi-automated image analysis methods. In cardiac tissue, deep learning approaches have been applied to histopathological classification and cellular quantification [
11,
12,
13]. More specialized methods have also been proposed to assess gap junction distribution in rat ventricular myocardium [
14] and to evaluate CX43 localization through co-localization with N-cadherin in atrial tissue [
15]. Automated quantification of intercellular coupling has likewise been reported in both non-cardiac cell cultures [
16] and cardiac co-culture systems [
17]. Despite these advances, no currently available method provides an automated framework for classifying CX43 distribution patterns across large myocardial tissue sections, particularly for distinguishing terminal from lateralized signal.
A method specifically designed for the automated quantification of CX43 lateralization in fluorescence immunohistochemistry images of ventricular myocardium has previously been reported [
18]. For clarity, this method is hereafter referred to as MARTA. In that approach, explicit segmentation of individual cardiomyocytes was required as a prerequisite for quantification. Once cell contours had been identified, each cell area was subdivided into terminal and central compartments using geometric rules, and the CX43 signal was assigned accordingly to estimate a lateral-to-total ratio. This design imposed several limitations. First, quantification was restricted to regions in which cardiomyocytes could be successfully segmented, leaving any CX43 signal outside delineated cells unassigned. Second, the method relied on a fixed geometric partition of the cell profile, assuming that compartment boundaries derived from a bounding rectangle adequately represented the underlying cellular morphology. Third, the pipeline consisted of a predefined sequence of morphological operations, including dilation, erosion, and contour extraction, and therefore did not learn from data, potentially limiting its robustness across images with different staining characteristics or tissue properties. Finally, extending this framework to supervised learning would require instance-level segmentation masks of individual cardiomyocytes, a type of annotation that is particularly difficult to generate in intact myocardial tissue.
Taken together, these considerations support the development of complementary strategies that reduce dependence on cardiomyocyte segmentation, without implying that segmentation-based methods are unsuitable in all contexts.
In the present work, we introduce CLARISA (Connexin Lateralization Automated ROI-based Image Signal Analyzer), an ROI-based approach in which CX43-positive regions, rather than cardiomyocytes, are the primary unit of analysis. Each region is classified directly as terminal or lateralized based on its appearance within the surrounding tissue context. This reframing also simplifies annotation: experts only need to identify CX43-positive regions and assign each to one of two classes—a substantially lighter task than producing cell-level segmentation masks, since CX43-positive regions appear as discrete, well-delimited structures, whereas cardiomyocyte boundaries are often ambiguous in intact tissue. This lighter annotation burden makes it feasible to build a labeled dataset, which in turn enables training a deep learning classifier that learns the morphological features distinguishing lateralized from terminal signal. At inference, the classifier assigns a lateralization probability to each detected CX43-positive region, independently of whether it falls within a delineated cell.
CLARISA’s framework comprises three main components: (i) generation of an expert-annotated dataset of CX43-positive regions labeled as terminal or lateralized; (ii) training of a deep learning classifier, based on a pretrained backbone, to predict the lateralization probability of individual regions from their local and contextual image appearance; and (iii) a semi-automated whole-section inference module that detects candidate regions using adjustable image-processing parameters and classifies them with the trained model, yielding an estimate of the lateralized fraction within the detected CX43-positive area. In addition, the framework includes an interactive annotation tool intended to facilitate the creation and extension of ROI-level datasets in future applications. Spatial probability maps are also generated to support visual inspection of predicted lateralization patterns. The approach does not require explicit cardiomyocyte segmentation and is presented here as a proof-of-principle framework rather than as a fully automated or broadly generalizable tool. An overview is provided in
Figure 1.
The aim of this study was to develop and evaluate CLARISA as a segmentation-free, ROI-based methodological framework for CX43 lateralization assessment in fluorescence immunohistochemistry images of rodent ventricular myocardium. The classifier was evaluated at the level of individual CX43-positive regions using held-out tissue sections, and whole-section applicability was assessed through the semi-automated inference module described above, including an exploratory comparison with the previously developed MARTA framework. The method, together with the annotated dataset, expert annotation tool, pretrained model, and code required to reproduce the pipeline, has been made publicly available through the resources listed in the
Supplementary Materials. Together, these resources are intended to provide a reproducible baseline that can be inspected, extended, or retrained with additional annotations for future datasets acquired under different experimental conditions.
3. Discussion
3.1. Training Data Diversity, Independent Deployment, and Annotation Uncertainty
A key limitation of the present study is the restricted diversity of the development dataset. Although the classifier was trained using transfer learning and a relatively large number of annotated CX43-positive ROIs, these annotations were derived from only two global images and a limited number of tissue sections. These global images corresponded to complete short-axis ventricular sections reconstructed from multiple microscopic fields of view and therefore contained within-section spatial heterogeneity. Nevertheless, the dataset remains limited in terms of the number of independent global images, tissue sections, animals, staining batches, and imaging conditions represented. As a result, the training data may not fully capture the broader variability that can affect CX43 appearance in practice, including differences in section orientation with respect to the myocardial fibers, staining intensity, signal fragmentation, image contrast, blur, or acquisition conditions.
The augmentation strategy used during training partially addressed geometric variability by introducing controlled changes in in-plane orientation, apparent scale, and crop positioning. However, we did not perform a dedicated robustness analysis against realistic imaging distortions such as systematic fluorescence intensity variation, defocus blur, noise, or slice-thickness-related signal degradation. Some of this variability may have been naturally present in the sampled ROIs, but it was not independently controlled or systematically simulated. Therefore, the performance estimates obtained within the training/validation/test framework should be interpreted as preliminary and dataset-dependent, rather than as evidence of broad robustness across heterogeneous imaging conditions.
To provide an additional evaluation beyond the original development images, we analyzed IM15, a section extracted from a global image not used during model development and acquired at a different spatial resolution. This focused whole-section experiment compared CLARISA outputs with expert visual annotation and with the previously published MARTA segmentation-based method. The results support the feasibility of deploying the framework outside the original development images, particularly because CLARISA produced a global percent lateralization estimate closely aligned with the expert annotation. However, this analysis involved a single additional section from a specific experimental context. It should therefore be interpreted as an initial independent deployment evaluation, not as evidence of broad external generalizability.
A further limitation concerns the annotation process used to define the reference labels. The original CX43-positive ROI annotations were generated by a single domain expert, which limits the objective characterization of ground-truth uncertainty. To partially address this issue, we performed a labeling-consistency experiment using the expert annotation workflow described above. As reported in
Section 2.3, both repeated annotation by the original expert and independent annotation by a second expert showed substantial agreement with the original labels. However, approximately 15–17% of ROIs changed label across repeated or independent annotation, indicating that a minority of regions remained annotation-sensitive. Visual inspection of these discordant cases suggested that they often corresponded to more challenging regions, including ROIs with weak or poorly defined fiber signal, lower image quality, local blur, fragmented staining, or insufficient contextual information to confidently infer the underlying fiber orientation and, consequently, the distribution of CX43 within it. These observations indicate that annotation uncertainty is not merely a procedural limitation but also reflects the biological and imaging complexity of some regions. Nevertheless, these experiments do not replace a full multi-expert annotation protocol for the complete dataset. Future extensions of the training set should therefore incorporate structured intra- and inter-rater procedures to more rigorously characterize annotation variability.
CLARISA should therefore be understood not only as a fixed classifier trained on the present dataset, but also as a transparent methodological framework that can be inspected, reproduced, and adapted to new experimental settings through additional annotation and retraining. For this purpose, the public repository includes both the full training pipeline and an expert annotation tool, described in
Supplementary Section S1.4 and based on the semi-automated ROI detection procedure detailed in
Section 4.6.1. This tool is intended to support faster and more standardized expert labeling when expanding the current dataset or generating new datasets under different experimental conditions. By presenting detected ROIs through a standardized interface and exporting structured ROI-level annotations, it may also improve the operational reproducibility of future annotation sessions. In the present work, the tool was used both to annotate IM15 for the focused comparison with expert visual inspection and MARTA, and to support the labeling-consistency experiments described above.
3.2. Spatially Mixed Error Patterns and Local Ambiguity
Spatial inspection of ROI-level predictions on the held-out test tissue slides IM1313 and IM1314 suggested that model errors were not organized into large, homogeneous regions of failure. Instead, both slides showed locally mixed neighborhoods in which correctly and incorrectly classified ROIs coexist (see
Figure 7). Thus, the main observation from the spatial analysis was not the presence of fully misclassified image regions, but rather the existence of locally ambiguous microenvironments in which prediction outcomes varied over short spatial distances. Because this analysis was based on two held-out sections, these spatial error patterns should be regarded as descriptive rather than as definitive evidence of general model behavior.
To examine this pattern more closely, we selected regions displaying local coexistence of correct and incorrect predictions, as well as high proximity or partial overlap between ROIs (see
Figure 7C,D). The zoomed views show that ROIs located very close to one another, and in some cases partially overlapping, could nevertheless yield different prediction outcomes. This suggests that the classifier may be sensitive to fine local differences in the visual content captured by each ROI. However, the bounding boxes displayed in
Figure 7 do not exactly correspond to the full 256 × 256 and 512 × 512 pixel crops used as model input. Therefore, spatial proximity or partial overlap between displayed ROI boxes does not necessarily imply that the model received identical visual information. Conversely, the fact that some partially overlapping ROIs remained correctly classified indicates that shared image content between neighboring crops was not, by itself, sufficient to produce systematic failure. Overall, these observations suggest that the model’s limitations are more closely related to fine-scale local ambiguity than to an inability to recognize larger tissue regions as a whole.
This interpretation is supported by the representative crops shown in
Supplementary Section S3. Several discordant or borderline cases occurred in morphologically challenging regions, where the distinction between terminal and lateralized patterns was visually difficult. For example, ROI 3012 in IM1313 and ROIs 3028 and 3034 in IM1314 were better interpreted as borderline examples than as clear-cut classification cases (
Supplementary Figures S6 and S7). These ROIs illustrate continuity between well-defined terminal or lateralized patterns and intermediate local morphologies in which the reference label itself may be difficult to assign with full confidence.
Other discordant cases were associated with blur, low contrast, dim signal, or atypical local morphology, as illustrated by ROIs 3011, 3010, 3009, and 3013 in IM1313 and ROIs 3052, 3055, 3063, and 3069 in IM1314 (
Supplementary Figures S6 and S7). In selected cases, such as ROI 3069, difficulties in centering the analysis window may also have contributed. This reflects a methodological limitation of the crop-generation procedure: when the bounding box lies close to the image border, or when the relevant signal is not optimally centered within the detected ROI, the corresponding local or contextual crop may not fully capture the surrounding tissue context used by a human observer.
Together with the annotation uncertainty discussed in
Section 3.1, these observations suggest that not every model–label disagreement should be interpreted as an unequivocal model failure. Some apparent model errors should therefore be interpreted in light of local ambiguity in the reference label itself, particularly for ROIs located in morphologically complex tissue regions.
3.3. Whole-Section Deployment and Methodological Comparison
3.3.1. Deployment on Held-Out Whole Sections
The two held-out sections exhibited distinct whole-section spatial patterns. IM1314 showed more prominent clustered high-probability regions and a higher global lateralization estimate than IM1313, whereas IM1313 displayed a more homogeneous pattern with fewer and more scattered lateralized foci. These observations suggest that CLARISA can generate spatially structured tissue-level outputs, rather than merely producing independent ROI-level classifications.
The ROI-level probability distributions also differed between sections. In IM1313, ROIs assigned to the terminal class were tightly concentrated near zero, whereas ROIs assigned to the lateralized class showed a broader spread of predicted probabilities above the decision threshold. In IM1314, lateralized ROIs were more compactly distributed at high probability values, while terminal ROIs showed broader dispersion. This section-specific confidence structure suggests that CLARISA’s predictions were not equally decisive across both images. In particular, the broader distribution observed in IM1313 is consistent with the local ambiguities discussed in
Section 3.2, although this correspondence should be interpreted descriptively rather than as evidence of a general slide-level error pattern.
This analysis should therefore be viewed primarily as a demonstration of whole-section deployment rather than as a formal validation of spatial accuracy. Because exhaustive expert annotation was not available for these held-out sections, its main value lies in showing that CLARISA can be operationally applied to complete tissue sections and can produce interpretable spatial maps and quantitative section-level summaries.
Whole-section deployment should nevertheless be considered semi-automated rather than fully automated. ROI classification is performed automatically by the trained model, but the preceding detection of candidate CX43-positive regions depends on user-defined image-processing parameters, including intensity thresholding and morphological operations. This parameter dependence may influence the number and type of ROIs entering the classifier and, consequently, the estimated global lateralization percentage. However, CLARISA requires substantially fewer user-defined parameters than MARTA, the segmentation-based method used for comparison in this work, which involves more than 25 adjustable parameters. Future work should focus on standardized or automated parameter selection and on systematic robustness analyses across slides with different staining intensity, resolution, contrast, and acquisition conditions.
3.3.2. Exploratory Comparison with Expert Annotation and MARTA in an Independent Section
Whereas
Section 2.4 examined whole-section deployment in held-out test sections,
Section 2.5 evaluated CLARISA on an additional image not used for model training or hyperparameter tuning. In this independent section, CLARISA outputs were compared with manual expert annotation and with MARTA, a previously published segmentation-based method. At the tissue-pattern level, the expert-derived and CLARISA-derived heatmaps showed similar large-scale spatial organization, including a central warm band and cooler peripheral regions. However, the agreement was not exact, with local differences in the extent and intensity of some regions.
At the ROI level, CLARISA generally assigned lower lateralization probabilities to ROIs annotated by the expert as terminal and higher probabilities to those annotated as lateralized. This supports an association between model output and expert labeling at the level of individual detected regions. However, the overlap around the decision threshold indicates that not all ROIs were cleanly separable into the two categories. Rather than representing only model disagreement, this pattern likely reflects, at least in part, the local ambiguity discussed in
Section 3.1 and
Section 3.2, where some CX43-positive regions were difficult to assign confidently even by visual inspection.
The comparison with MARTA highlights a methodological distinction that is central to whole-section applicability. MARTA depends on successful cardiomyocyte segmentation and therefore quantifies only the subset of CX43-positive signal located within segmented cells. By contrast, CLARISA operates directly on detected CX43-positive ROIs and is affected by a different set of assumptions, related to ROI detection, crop context, and classification. Accordingly, the comparison should be interpreted as an exploratory methodological benchmark rather than as a definitive test of superiority.
This distinction was reflected in the section-level estimates. In IM15, CLARISA yielded a global lateralization estimate much closer to the expert-derived %LatArea
all value than MARTA, with absolute differences of 1.30 and 20.78 percentage points, respectively. However, because CLARISA and MARTA do not quantify identical signal populations, this numerical difference should not be interpreted in isolation. MARTA’s dependence on successful cardiomyocyte segmentation reduced the CX43-positive signal available for quantification in this section, as visible in
Figure 4D. Thus, part of the difference between methods reflects their effective analysis domains.
The intra-cellular analysis partially addressed this issue by limiting CLARISA to the 147 ROIs located within MARTA-segmented cardiomyocytes. This reduced CLARISA’s estimate from 44.26% to 37.4%, indicating that spatial coverage contributed to the discrepancy. Nevertheless, an approximately 15 percentage-point difference between CLARISA and MARTA persisted after restriction, while the expert estimate over the same intra-cellular ROIs was 48.9%. This pattern suggests that the numerical gap between methods is not explained solely by spatial coverage, but also by their different operational definitions of lateralization. MARTA computes the fraction of CX43-positive fluorescence area located within geometrically defined lateral compartments of segmented cardiomyocytes, whereas CLARISA and the expert assign labels to individual ROIs based on their visual appearance in context. These definitions can yield different outputs even when applied to the same local CX43 foci.
The ROI-level cross-tabulations support this interpretation. CLARISA and the expert showed similar fair-to-moderate agreement across the full IM15 dataset and within the intra-cellular subset, indicating that restricting the analysis to MARTA-segmented cells did not substantially alter the CLARISA–expert relationship. In contrast, both CLARISA and the expert showed only slight agreement with MARTA’s geometric compartment assignments. This suggests that compartment location captures one aspect of CX43 organization but does not fully reproduce the visual criteria used by the expert or learned by CLARISA from ROI-level annotations.
Overall, this single-section comparison should be regarded as hypothesis-generating. It suggests that CLARISA and segmentation-based approaches provide related but non-equivalent readouts of CX43 lateralization, but systematic evaluation against larger independent datasets and, ideally, biological or functional endpoints will be required to determine their relative validity and use cases.
3.4. Biological Relevance and Potential Applications
CX43 is a central mediator of electrical and metabolic coupling in ventricular myocardium, and its redistribution from intercalated discs toward lateral cell membranes has been associated with altered impulse propagation and increased arrhythmogenic vulnerability [
20]. However, the quantitative characterization of this topological remodeling and its functional consequences remains a major challenge in the field of cardiac arrhythmias [
20].
This spatial component is biologically relevant because changes in CX43 organization may not be captured by bulk expression measurements alone. In several pathological settings, overall CX43 expression assessed by qPCR or Western blot can remain similar despite differences in tissue activation, impulse propagation, or arrhythmia susceptibility [
5,
21,
22]. In such cases, the distribution of CX43 across the myocardium may provide information that complements total expression levels. Methods capable of quantifying CX43 organization over whole tissue sections may therefore help identify structural correlates of conduction disturbances and support comparative studies of connexin remodeling across disease models, treatments, species, or experimental conditions.
A practical motivation for automated analysis is the scale of the annotation task. Manual classification of individual CX43-positive regions as terminal or lateralized is time-intensive; in our workflow, labeling fewer than 250 regions required more than three hours of expert time. By contrast, CLARISA can process thousands of automatically detected regions per image and generate both ROI-level classifications and tissue-scale spatial summaries. This throughput makes it possible to analyze CX43 distribution at a scale that would be impractical through manual inspection alone, particularly in studies involving entire tissue sections or multiple experimental groups.
Within these constraints, CLARISA should be viewed as a proof-of-principle framework rather than a broadly validated or clinically applicable tool. The current evaluation remains limited to one species, a restricted number of tissue sections, and imaging conditions close to those used during model development. Generalization to other species, pathological models, staining protocols, imaging platforms, and acquisition settings will require systematic validation. Nevertheless, by combining segmentation-free ROI detection, multi-scale classification, whole-section probability mapping, and an expert annotation workflow for dataset expansion, CLARISA provides a reproducible and scalable starting point for quantitative studies of CX43 spatial remodeling in experimental cardiac tissue.
5. Conclusions
In this study, we developed CLARISA, a segmentation-free, ROI-based framework for assessing CX43 lateralization in fluorescence images of ventricular myocardium without requiring explicit cardiomyocyte segmentation. An expert-annotated dataset of CX43-positive ROIs classified as terminal or lateralized was generated to support model training, and a dual-scale transfer-learning classifier was developed to predict the lateralization status of individual regions from their local morphology and surrounding tissue context. The classifier was integrated into a semi-automated whole-section inference workflow that generates spatial probability maps and quantifies the proportion of lateralized CX43 signal across complete tissue images.
The classifier was integrated into a semi-automated whole-section inference workflow that generates spatial probability maps and estimates the proportion of lateralized CX43 signal across complete tissue images. The present results support the feasibility of automated ROI classification and whole-section spatial mapping as a proof-of-principle methodological framework. However, CLARISA should not yet be interpreted as a fully automated, broadly generalizable, or clinically validated tool. The current evidence remains limited by the size and diversity of the dataset, the reliance on annotations initially generated by a single expert, the need for manual adjustment of some inference parameters, and the limited validation across heterogeneous imaging conditions.
Beyond the trained classifier, CLARISA is provided as a transparent and extensible framework. The pretrained model, annotation resources, expert annotation tool, and training scripts are publicly available, allowing users to inspect and reproduce the workflow, evaluate the model on comparable data, and generate new ROI-level datasets for dataset-specific adaptation.
Future work should include larger and more diverse datasets, structured multi-expert annotation protocols, automated or standardized parameter selection, and broader validation across independent tissue preparations, species, staining protocols, pathological models, and imaging pipelines. Overall, CLARISA provides a scalable starting point for segmentation-free CX43 lateralization assessment in experimental cardiac tissue, particularly in studies where manual ROI-level quantification would be labor-intensive and difficult to scale.