Next Article in Journal
Insights into Transport Function of the Murine Organic Anion-Transporting Polypeptide OATP1B2 by Comparison with Its Rat and Human Orthologues
Previous Article in Journal
Editorial—Pesticide Risk Assessment, Emerging and Re-Emerging Problems
Previous Article in Special Issue
Comprehensive In Vitro Safety Assessment of Acorus calamus Rhizome Oil Using OECD-Compliant New Approach Methods: Classification as a GHS Category 1B Sensitiser and Category 2 Irritant
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

RNA-Seq Can Be Used to Quantify Gene Expression Levels for Use in the GARDskin Assay

Senzagen AB, 22381 Lund, Sweden
*
Author to whom correspondence should be addressed.
Toxics 2026, 14(1), 9; https://doi.org/10.3390/toxics14010009 (registering DOI)
Submission received: 25 September 2025 / Revised: 4 December 2025 / Accepted: 18 December 2025 / Published: 20 December 2025

Abstract

Non-animal methods for identification and characterization of skin sensitizers are continuously evolving, advancing towards more effective, accurate, and informational assays. The GARDskin assay is a scientifically and regulatory recognized assay for the assessment of skin sensitizers. It currently relies on targeted gene expression measurement to derive hazard classifications. With the progression of next generation sequencing technologies, whole transcriptome analysis provides an interesting alternative to the currently implemented targeted gene expression approach. RNA-seq was evaluated for its use in the GARDskin assay as a gene expression quantification method. Based on 24 paired samples acquired on both RNA-seq and the NanoString nCounter platform (the currently standard GARDskin acquisition method), gene expression profiles were found to be highly similar. Comparisons of treatment effects yielded a Spearman’s correlation coefficient of 0.95 and a Lin’s concordance correlation coefficient of 0.87. RNA-seq data was also used to classify the sensitizing hazard of 24 treatments using the standard GARDskin analysis pipeline. The classifications corresponded completely with references, rendering correct classifications for all treatments. In conclusion, it was found that the RNA-seq data strongly resembled NanoString nCounter data, and that it could be used to derive reliable hazard classifications in the GARDskin assay.

1. Introduction

Allergic contact dermatitis (ACD) is caused by small molecular weight chemicals that are able to penetrate the protective skin barrier, bind to endogenous proteins while inducing an inflammatory milieu via, for example, triggering of innate danger signals, thereby initiating an immune response that results in the formation of a specific immunological memory towards the chemical [1,2,3]. Repeated exposure to the same chemical elicits the immune response that gives rise to the adverse effects typically associated with ACD.
While there are interindividual and environmental factors that affect both the susceptibility and severity of ACD [4], the underlying pathomechanism of the condition is relatively well established [5], which has facilitated the development of several assays for identification and characterization of skin sensitizers. The earliest such assays were the in vivo variants [6,7,8,9,10,11], which are progressively being replaced by non-animal alternatives, which include in chemico [12,13,14], in vitro [15,16,17,18,19,20,21], and in silico [22] approaches. Non-animal methods is a continuously evolving field, and new methods and improvements to existing assays are being actively developed and presented [23,24,25,26,27,28,29].
The Genomic Allergen Rapid Detection (GARD) assay for assessment of skin sensitizers (GARDskin) is one such in vitro test method. It is based on a human dendritic cell-like cell line and was developed for hazard identification of skin sensitizers [16,30,31]. It is described in the OECD test guideline 442E [32] as a method for monitoring dendritic cell activation, which is the third key event (KE) of the adverse outcome pathway (AOP) for skin sensitization [2]. The first developmental steps towards the formalization of the GARDskin assay were described by Johansson et al., 2011 [16], where whole transcriptome analyses of a dendritic cell-like cell line treated with a panel of established skin sensitizers and non-sensitizers were used to identify a predictive subset of genes capable of effectively separating the sensitizing chemicals from the non-sensitizing ones. The identified gene set included genes that could be linked to biologically meaningful cellular processes, including dendritic cell activation and maturation, and cytoprotective pathways. The wider biological effects of chemical sensitizers in the GARDskin cell system have been examined in detail elsewhere [33]. Following the initial discovery study, the GARDskin assay has been standardized [30], validated [31], shown to be effective on chemical subsets typically considered difficult to test [34,35,36], and further extended to enable medical device classifications [37] and potency predictions [23,38]. It predicts hazard outcomes from the induced gene expression patterns in the genomic prediction signature, termed the GARDskin prediction signature (GPS), using a trained and fixed support vector machine (SVM) (as detailed in the supporting document to the GARDskin assay [39]). The gene expression quantification is performed using a targeted acquisition approach, relying on the NanoString nCounter platform. While the assay was originally developed using Affymetrix arrays, standardization requirements prompted the transition towards a simpler quantification platform, as elaborated on by Forreryd et al. [40], which resulted in a transfer to the NanoString platform [30].
The NanoString nCounter platform has several properties that are advantageous for standardized quantification of defined gene sets. For example, the platform allows for simultaneous quantification of up to 800 target genes; the experimental protocols are simple and easy to implement and standardize; the required hands-on time is low, and the time from experimental initiation to data availability is short; the quantified data is simple to handle and analyze as the targeted transcripts are automatically quantified by the platform without need for further processing. Finally, the platform’s readouts have been shown to be robust and reproducible [41].
Despite this, it is still relevant to explore other gene expression quantification platforms that may be compatible with the GARDskin assay, which is an endeavor that has also been performed previously for other biomarker-based assays [42,43]. By, for example, extending the number of compatible acquisition platforms, the assay may become more accessible. One alternative technology capable of producing quantified gene expression measurements is next generation sequencing (NGS), particularly RNA sequencing (RNA-seq), which has a wide user base and is implemented in many laboratories worldwide [44]. It has matured into a robust acquisition technology over the last decades, and experimental components are available from several different providers [45,46]. While generated data still require significantly more processing before gene level abundance estimates are obtained compared with the NanoString platform, many available tools have been designed with both efficiency and accuracy in mind [47,48,49,50]. In addition, commonly used peer-reviewed pipelines have also been packaged into user-friendly interfaces which reduces the potential hurdle of handling and generating reproducible NGS data [51,52]. Finally, while a targeted quantification approach is effective when genomic targets are known, whole transcriptome data is a technically feasible alternative [53], with one of its main benefits being that the acquired data is not limited to a priori defined gene sets, which significantly extends the ability to perform secondary analyses, including exploratory analyses and differential expression testing [54]. In the context of standardized testing, these types of secondary analyses may be used to enhance the understanding of potential perturbances’ cellular effects.
Therefore, the main objective of this study was to investigate if NGS technology could be used as a method for quantifying gene expression data in the GARDskin assay. In this work, comparisons of NanoString nCounter and RNA-seq data from paired samples show that the genes in the GPS can be reproducibly quantified on both platforms. Notably, the sequencing experiments were performed by an independent laboratory with encoded sample identities. The study also showed that data quantified with RNA-seq can be efficiently used as a data source in the existing GARDskin analysis pipeline to produce accurate hazard classifications.

2. Materials and Methods

2.1. Overview of the Study Design

The main objective of the study was to evaluate if GARDskin classifications could be accurately obtained from gene expression data quantified using RNA-seq, as an alternative quantification method to the NanoString nCounter platform (Bruker Corporation, Billerica, MA, USA) described in the GARDskin protocols [31,55]. To this end, the study was set up in two main parts.
The first part aimed at comparing gene expression data between the two platforms using paired RNA samples. A total of 24 paired RNA samples were acquired on both platforms. These corresponded to 12 replicates of the GARDskin unstimulated control and 12 replicates of the GARDskin positive control p-phenylenediamine (PPD). These samples were collected to allow for direct comparisons across platforms, and to optimize the reconstruction of NanoString probe signals on the RNA-seq platform. The paired RNA samples were collected from experiments consistent with the GARDskin protocols [31,55], and did not rely on reference RNA such as the Universal Human Reference RNA (UHRR).
The second major part of the study included 75 additional RNA samples acquired on the RNA-seq platform only. These samples corresponded to 22 different chemical treatments and the GARDskin positive and unstimulated control. The main purpose of these samples was to assess the predictive capacity of gene expression measurements quantified with RNA-seq.
To further strengthen the integrity of the study and of any results obtained, the RNA-seq analysis was performed by an independent contract research organization (CRO). The CRO obtained all the RNA samples encoded, and the sequencing experiments were performed without disclosure of sample identities.

2.2. Chemicals for Exposure Experiments

All chemicals selected for the evaluation of the NGS platform are described in Table 1. All chemicals but 2-nitro-p-phenylenediamine, propyl gallate, and chlorobenzene were purchased from Sigma-Aldrich (MilliporeSigma, Burlington, MA, USA). 2-Nitro-p-phenylenediamine, propyl gallate, and chlorobenzene were purchased from Fisher Scientific (Thermo Fisher Scientific, Waltham, MA, USA). The chemicals included all of GARDskin’s proficiency chemicals (defined in TG442E [32]). The complete dataset was chosen to ensure that a wide potency range was included while also ensuring that non-sensitizers were present. Furthermore, all chemicals were described in annex 2 of guideline 497 [56] and had previously been correctly classified by the GARDskin assay [30,31,57]. The human and LLNA references, including LLNA Median-Like Location Parameter (MLLP) EC3 values and binary calls, were extracted from annex 2 of guideline 497 [56] (where human calls were based on the human predictive patch tests databased described by Strickland et al., 2023 [58]) for all chemicals except chlorobenzene. The LLNA reference for chlorobenzene was obtained from Gerberick et al., 2005 [59] and from the European Chemicals Agency (ECHA) registration dossier (https://echa.europa.eu/substance-information/-/substanceinfo/100.003.299, accessed on 21 October 2025). A consensus reference call was established for every chemical by coalescing human and LLNA data, prioritizing human evidence if the references were different.
Thus, the overall dataset included both sensitizers and non-sensitizers. The sensitizers included chemicals of varying potencies, ranging from very weak to extreme. Furthermore, the selected chemicals included a range of reaction chemistries, including reactivity domain alerts for SN2, SNAr, Schiff base formations, and Michael acceptors [60].

2.3. Cell Exposure Experiments and Total RNA Isolation

All exposure experiments were conducted in compliance with the standard GARDskin assay protocols [55].
The SenzaCell cell line (ATCC Depository PTA-123875) was exposed to each of the chemicals at a concentration that induced 85–95% relative viability while not exceeding a maximum exposure concentration of 500 µM. Chemicals that did not affect the cells’ viability were run at a maximum of 500 µM or the highest soluble concentration. All treatment concentrations are described in Table 1. Cells were incubated with the chemicals for 24 h, after which relative viabilities were reconfirmed, and cells were harvested and lysed in TRIzol (Thermo Fisher Scientific, Waltham, MA, USA), before total RNA was isolated. The quality and the concentration of the isolated total RNA were determined as described below (see the quantification sections) before being used as input into the respective quantification pipelines. The viability measurements were performed using propidium iodide staining and flow cytometry analysis.

2.4. NanoString nCounter Quantification

The NanoString quantification was performed in accordance with the provider’s protocols, as detailed in the GARDskin assay protocols [55]. The quality and the concentration of the RNA samples were determined using Agilent Bioanalyzer 2100 (Agilent Technologies, Santa Clara, CA, USA).

2.5. RNA Sequencing

Total RNA quality was evaluated using Agilent TapeStation 4200 (Agilent Technologies, Santa Clara, CA, USA), and the RNA concentration was estimated using Qubit Flex (Thermo Fisher Scientific, Waltham, MA, USA). The library was prepared using Illumina® Stranded mRNA Prep, Ligation (20040534, Illumina, San Diego, CA, USA) per the provider’s protocols (Illumina Stranded mRNA Prep, Ligation Reference Guide [Document # 1000000124518 v03]). The library’s quality and concentration were assessed using Agilent TapeStation 4200 and Qubit Flex, respectively. Sequencing was performed on a NovaSeq 6000 (Illumina, San Diego, CA, USA), generating paired-end reads of 150 bp, using NovaSeq S2 Reagent Kit (Illumina, San Diego, CA, USA), 300 cycles v1.5.

2.6. Quantification of RNA-Seq Data

The quality of the sequenced data was assessed using fastQC version 0.12.1 (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 13 October 2025)). Reads were trimmed for adapter sequences and low-quality calls using fastp version 0.23.4 [49]. Transcripts were quantified using salmon version 1.10.0 [48] with selective alignment enabled [50], using the entire genome as background. The quantification was made against an index constructed from human GENCODE reference, release 47 (GRCh38.p14) [61]. The complete parameter set given to the respective program is detailed in Supplementary Table S1. No pre-defined exclusion criteria were set for the identification of outlier samples. The quality control revealed no samples of questionable quality, and all samples were retained and included in the analysis.

2.7. Mapping Candidate Transcripts to the NanoString GARDskin Prediction Signature

In order to reconstruct the NanoString signal on quantified RNA-seq data, transcripts that were likely targets of the NanoString probes were first identified. Target transcripts were identified by comparing the reference transcriptome’s sequences (GENCODE release 47) to the 100 bp NanoString probes’ binding sequences [39]. The alignments between probe sequences and the reference transcriptome were made using Bowtie 2 version 2.5.4 [62]. NanoString probes that did not obtain any mapped reference transcript were assigned all transcripts that were annotated to the NanoString probe’s target gene. Following this step, each of the 196 NanoString probes had been assigned at least one candidate transcript available for quantification using RNA-seq.

2.8. Refining the RNA-Seq Candidate Transcripts for NanoString Signal Reconstruction

The sets of candidate transcripts identified via sequence alignments were optimized to maximize the resemblance between the two platforms’ signals. The optimization was performed using a backward elimination procedure relying on the 24 paired samples acquired on both platforms (see above).
Firstly, for the optimization, the RNA-seq transcript abundance estimates were expressed in transcripts per million (TPM), as obtained from the salmon quantification step. TPMs were used as it corrects for both library size and for gene lengths. Similarly, individual NanoString samples were normalized using a scaling factor based on the total number of acquired counts [30], as described in Equation (1). This normalization was chosen as it constitutes the first step in the standard GARDskin analysis pipeline, aiming to reduce for variation in library size [30].
y i j = c o u n t i j j = 1 n c o u n t i j
where yij is the normalized expression value for gene j in sample i, and countij is the observed count for gene j in sample i.
Then, each of the 196 genes were processed independently, with the aim of minimizing the difference between NanoString data for an individual probe with the reconstructed signal from RNA-seq. A reconstructed RNA-seq signal was obtained by adding all remaining candidate transcripts’ abundance estimates for a particular sample. The difference between the two platforms were calculated using the mean absolute error, as expressed in Equation (2), of log-transformed and centered abundance estimates (see Equation (2)).
d j = i = 1 n = 24 ( log 2 y i j 1 N k = 1 N log 2 y k j ) ( log 2 z i j 1 N k = 1 N log 2 z k j ) 24
where dj is the difference for gene j between the platforms, yij is the normalized expression value for sample i of gene j quantified on the NanoString nCounter platform, and zij is the reconstructed RNA-seq signal for gene j and sample i.
For the backward elimination procedure, each of the remaining transcript candidates were held out from the signal reconstruction and the difference was recorded. The transcript whose exclusion produced the lowest difference compared to the NanoString data was excluded. This approach was repeated until only one transcript remained. The final set of transcripts used to reconstruct a NanoString probe’s signal was the set of transcripts that produced the overall lowest difference between the platforms.

2.9. Comparison of Expression Levels Between the Platforms

Having defined the transcript sets to reconstruct the GPS on RNA-seq data, the gene expression levels were compared between the platforms. To this end, the data from both platforms were first normalized using a similar strategy, based on the 196 quantified genes. The normalization was a simple scaling strategy, based on size factors calculated for each of the datasets independently.
The normalization was performed as follows: firstly, based on the 24 samples in each dataset, an average expression level was determined for every gene by calculating the geometric mean of its expression across samples (see Equation (3)).
μ j = e 1 N i = 1 N log y i j
where μj is the average expression for gene j, N is the number of samples (N = 24), and yij is the abundance estimate for sample i and gene j. For NanoString, the abundance estimate was the original counts, while it was the reconstructed probe signals based on TPMs for the RNA-seq data.
A scaling factor was then calculated for each sample by comparing its individual expression levels with the dataset’s average. The comparison was made based on a robust geometric mean of ratios. Compared with the standard geometric mean, the Huber M-estimator of location [63] was used to determine the average difference for the individual samples. Equation (4) shows the derivation of the individual samples’ scaling factors.
s f i = e f log y i j μ j
where sfi is the scaling factor for sample i, yij the expression level for sample i and gene j, μj the average expression level for gene j as determined in Equation (3), and f the calculation of Huber M-estimate of location, as implemented in the R-package DescTools version 0.99.47 [64].
Each sample was finally divided by the calculated scaling factor to obtain normalized expression levels for subsequent comparisons. To avoid issues with zeros, all NanoString counts less than an estimate of the limit of detection (LOD) were replaced by the value of the LOD. The LOD for the NanoString platform was estimated as the average across the samples’ median negative control probe counts (LOD = 14). The same strategy was also used for the RNA-seq data, where the LOD was set to a pseudocount of five.
The normalized abundance estimates were compared between the platforms by calculating the geometric average of each gene using the paired replicates of the unstimulated controls (N = 12) for both platforms. The association between the platforms’ estimates was examined visually (scatter plots) and by calculating both linear (Pearson) and non-linear (Spearman’s) correlation coefficients. Deviating abundance estimates between the platforms were identified using standard scores based on the log-transformed ratios between the estimates. The standardization was achieved using robust estimates of location (Huber M-estimator of location) [63] and scale (Qn scale estimator) [65], as implemented in the R-packages DescTools version 0.99.47 [64] and the robustbase version 0.99-0 [66].
The comparison of the effect estimates by the positive control treatment across the platforms was performed using linear regression on log-transformed normalized data. The main term of interest was the interaction effect between treatment and platform. Significantly deviating genes were identified as those with a p-value adjusted for multiple hypothesis testing [67] below 0.05 and a fold change difference of at least 1.5. In addition to these statistics, the association of the effect estimates was characterized by Spearman’s correlation. Furthermore, Lin’s concordance correlation coefficient was calculated as the association and was expected to be the same independent of the platform.

2.10. Generation of GARDskin Predictions

GARDskin predictions were obtained for the 75 samples acquired for assessment of predictive capacity with the complete standard GARDskin analysis [39,55]. The RNA-seq data input comprised the reconstructed signals as described above, which were fed into the standard GARDskin analysis pipeline. The pipeline includes the following steps: normalization using counts per total counts (CPTC; Equation (1)) based on the 196 genes, BARA normalization to reduce differences between batches [39,68], and classification using the already trained SVM. Complete details, including fitted parameter values are described in the Supporting Document to the Genomic Allergen Rapid Detection Test Method for Skin Sensitisation (GARDTMskin), described in Test Guideline 442E [39].
The GARDskin analysis pipeline has been developed and defined in a series of publications [16,30,31]. The 196 genes were identified in a data-driven manner by optimizing the discriminatory ability, characterized by the Kullback–Leibler divergence, of an SVM [16]. The CPTC normalization was discussed and described in Forreryd et al., 2016 [30] as a method for reducing technical variance associated with RNA content and quantification levels (library size). The BARA normalization was implemented to remove batch effects between the GARDskin training set that was used to fit the classification model (SVM) and every subsequently acquired test set. Finally, the classification model is a linear SVM trained on a dataset of 38 chemicals, of which 20 were known non-sensitizers and 18 were well-characterized skin sensitizers [30]. The entire pipeline has undergone extensive validation, including testing of previously unseen test chemicals, a ring-trial study [31], and an independent peer review by the Joint Research Center’s EU Reference Laboratory for alternatives to animal testing (EURL ECVAM) (Scientific Advisory Committee (ESAC)) [69].

2.11. Assessment of Sequencing Depth on GARDskin Classifications

The impact of the sequencing depth on GARDskin classifications was examined using two approaches. The first less computationally expensive method was binomial subsampling on the already quantified expression matrix. The quantified expression matrix was subsampled at levels between 50,000 and 25,000,000 counts. The subsampling was repeated 100 times for every level. The reduced data was fed through the standard GARDskin analysis pipeline as described previously to generate predicted decision values.
The second more computationally expensive approach was to apply the subsampling to the reads in the fastq files. The quantification and prediction pipelines were then performed as already described. Two levels of subsampling were performed and an average of five or ten million reads were retained. The subsampling was performed using fq version 0.12.0 (https://github.com/stjude-rust-labs/fq (accessed on 13 October 2025)). See Supplementary Table S1 for the detailed function call.
The performance of the subsampling was evaluated with respect to the likeness of the predictions obtained on the full dataset, and with respect to the overall classification performance. The deviation from the predictions on the complete dataset was quantified using the root mean squared error (RMSE) to compare decision values, and the prediction performance with respect to reference labels was evaluated using binary cross-entropy. Decision values were transformed with the standard logistic function before the binary cross-entropy was calculated.

2.12. Visualizations

All visualizations were created using R version 4.2.0 [70] and the ggplot2 package version 3.5.2 [71].
The principal components of the RNA-seq data were visualized by aggregating technical replicates using limma version 3.52.0 [72] prior to the singular value decomposition. Aggregated data on log-scale was centered and standardized before the principal components were calculated.

3. Results

3.1. Reconstructing the NanoString Probe Signals from RNA-Seq Data

The GARDskin assay is an existing, scientifically reviewed and validated [30,31], and regulatory-approved [32] assay for hazard assessment of skin sensitizers. The current state of the assay, as described in the OECD TG442E, uses an analysis pipeline and prediction model defined and trained on gene expression data originating from the NanoString nCounter platform. The prospect of using RNA-seq data in the GARDskin assay requires that the assay’s currently defined models remain unaltered and that cross-platform predictions can be accurately made. To evaluate this, RNA-seq data were acquired for 24 different treatments (see Table 1). Paired samples (acquired using both RNA-seq and NanoString) were acquired for the positive control (p-Phenylenediamine; N = 12) and the unstimulated control (N = 12).
The RNA-seq data were first adapted for compatibility with the existing GARDskin model’s components by attempting to reconstruct the signals of the 196 measured genes of the NanoString platform. The target sequences of the NanoString platform were mapped to the transcriptome’s FASTA sequences to identify candidate transcripts for signal reconstruction. All but one of the NanoString target sequences could be mapped to at least one reference transcript. The only unmapped target sequence, corresponding to the gene COX20, was assigned all reference transcripts annotated as belonging to the gene COX20. The sets of candidate transcripts were refined by comparing reconstructed signals with NanoString data for the paired samples, applying a backward elimination procedure to eliminate transcripts from the respective set to maximize the likeness between RNA-seq- and NanoString nCounter data.
Following mapping and refinement, a total of 872 transcripts were used to reconstruct the 196 NanoString probe signals. The transcript identities and their mapping to the NanoString probes are described in Supplementary Table S2. A gene-by-gene comparison between the two platforms, based on the 24 paired samples, are displayed in Figure 1. As seen, the platform renders similar signals for majority of the measurements. Clear linear relationships can be seen between the platforms’ abundance estimates (Figure 1a) as well as a particularly strong correlation between the relative effect estimates (Figure 1b). However, it is also apparent that a few genes deviate more markedly from the overall associations. For the abundance estimate, these genes include SNORA45, five histone cluster genes (HIST1H1C, HIST1H1E, HIST1H3G, HIST1H3J, HIST2H2AA3), and UFC1. For the abundance estimates, the Pearson correlation coefficient was calculated to be 0.66 and the Spearman correlation coefficient was calculated to be 0.69. For the effect estimates, five histone cluster genes (HIST1H1C, HIST1H1E, HIST1H3J, HIST2H2AA3, HIST2H2BF) together with ACER2 were identified as different between the platforms. The Pearson correlation coefficient was estimated to be 0.87, the Spearman correlation was estimated to be 0.95, and Lin’s concordance correlation coefficient was estimated to be 0.87.
The complete RNA-seq dataset, described using the 196 reconstructed probe signals, was visualized using PCA; see Figure 2. As seen, the reconstructed signals from RNA-seq data captures information relevant for discerning the sensitizers from the non-sensitizers. Indeed, the largest variance component allows for a complete separation between the sensitizing chemicals and the non-sensitizing ones.

3.2. Classifying Sensitizing Hazard with RNA-Seq Data

Hazard classifications based on RNA-seq data were then considered. The RNA-seq data for the 75 samples acquired for assessing predictive capacity comprised the reconstructed probe signals that were fed directly into the existing GARDskin analysis pipeline without making any modifications to it, i.e., all steps were performed as described in the GARDskin protocols with identical parameter values (as defined in the Supporting Document to the GARDskin assay [39]). Classifications and predicted decision values are visualized in Figure 3. As seen, all chemicals were correctly classified. Thus, on this dataset, the accuracy, sensitivity, and specificity were all 100%. Considering the decision values, the overall standard deviation between replicates was estimated to 0.64.

3.3. Assessment of Sequencing Depth

The sequencing depth required for making accurate GARDskin classifications was examined as an adjustable parameter in RNA-seq experiments. The effect of the sequencing depth was tested by subsampling the acquired RNA-seq data. Binomial subsampling was first used on the already quantified expression matrix, and the classification performance versus the number of quantified reads are visualized in Figure 4. As seen, the deterioration of the performance only becomes obvious at the lower sequencing depths, when ≤1 million counts were obtained.
From these observations, it was concluded that an average of 5 to 10 million reads per sample should be sufficient to maintain stable classification performance. The performance at both of these sequencing depths (5 M and 10 M) were verified by subsampling the reads already from the fastq files, and by rerunning the entire quantification and classification pipeline. The results were very similar to the data obtained by the binomial subsampling on the quantified reads. Indeed, the RMSE was estimated to be 0.153 and 0.116, and the cross-entropy was estimated to be 0.194 and 0.197 for the five- and ten-million count datasets, respectively.

4. Discussion

The development of non-animal methods (NAMs) for hazard assessment is continuously evolving, relying on technical advancement to make methods more capable of accurately and efficiently detecting and characterizing hazards. The assessment of skin sensitizers is one subfield where the progression of NAMs has led to the regulatory adoption of multiple individual assays [32,73,74], defined approaches (DAs), and integrated testing strategies (ITSs) [27]. The individual OECD-adopted assays currently include 10 test methods, with each described in a test guideline based on the understanding of the assay’s detection mechanism, in relation to the KEs of the AOP for skin sensitization. The GARDskin assay monitors dendritic cell activation by measuring changes in gene expression induced by test items, which include both neat chemicals and more complex formulations and solid materials [31,34,36,37].
The GARDskin biomarker signature (GPS) consists of 196 genes, which were identified and refined by exploratory analyses on whole transcriptome data [16]. The current testing protocol describes that gene level quantification should be performed using NanoString nCounter technologies. While it is an efficient platform for targeted acquisition, the ability to use gene expression levels from alternative quantification platforms could be beneficial. For example, technologies such as RNA-seq, which evolved rapidly during the last decades [75], have a wider user base and larger laboratory implementation compared with the NanoString nCounter platform. In addition, RNA-seq produces whole transcriptome data, offering interesting avenues for secondary analyses.
We have, therefore, explored the possibility of using gene expression data quantified with RNA-seq for GARDskin hazard classifications as an alternative technique to the currently supported NanoString nCounter platform. A total of 24 samples were acquired on both platforms as paired samples, and an addition set of 75 samples were acquired on the RNA-seq platform for assessment of predictive capacity. Using the paired samples, the acquired abundance estimates as well as the relative treatment effects were compared between the platforms. The abundance estimates showed a linear moderate or strong association between the platforms, as demonstrated in Figure 1a and as indicated by the Pearson correlation coefficient of 0.67. However, a few genes deviated more markedly from the observed relationship. The largest cross-platform differences, relative to the observed relationship, were seen for SNORA45, five histone cluster genes, and UFC1. The SNORA45 target is a small nucleolar RNA (snoRNA), and the library preparation method of the RNA-seq experiment should explain the discrepancy, as the poly-A selection may not effectively capture them [76,77]. A similar explanation can likely also be attributed to the observed differences in the histone cluster genes, as they too lack the poly-A tail [78]. No definitive reason explaining the observed difference for the remaining UFC1 gene has been identified.
While correlated abundance estimates would facilitate cross-platform predictions, potential differences could likely be mitigated if the relative effect differences between the platforms were similar. Figure 1b shows the estimated effects of the assay’s positive control, PPD, for both platforms. As seen, the estimates are strongly and linearly correlated. Furthermore, the estimates were very similar for most measurements, which can be observed in the graph as the datapoints are localized along the identity line (i.e., identical estimates for both platforms). However, again, a few genes showed a significantly different effect estimate. These corresponded to five histone cluster genes, which have already been discussed, and the ACER2 gene. The ACER2 gene is expressed barely above the limit of detection, and it is possible that the difference is linked to the assay’s dynamic range. It is to be noted that the effect size induced by PPD is estimated as relatively higher in the RNA-seq data compared with the NanoString platform. Taken together, and of great importance in relation to the context of the work, while small cross-platform discrepancies were observed, they have seemingly little to no effect on endpoint classifications, as derived from the GARDskin analysis pipeline, as further discussed below. The overall observations are also in line with previous comparisons between the two platforms, where strong agreements have been reported [53,79].
Having considered the performance of the individual genes, attention was given to the larger RNA-seq dataset and the chemicals acquired for assessment of predictive capacity. Figure 2 shows the PCA of the RNA-seq dataset based on the 196 target genes for the 24 treatments. As seen, the GARDskin prediction signature is capable of efficiently separating the skin-sensitizing chemicals from the non-sensitizing ones. Indeed, the first principal component achieves a complete separation between the classes. This strong separability further validates the biological relevance of the genes in the prediction signature and emphasizes that the variance is not dependent on specific quantification technologies.
The ability to generate accurate GARDskin classifications with RNA-seq data was then examined. Given the regulatory status of the assay, where the analysis pipeline and all of the fitted parameters have been explicitly defined, predictions and classifications were obtained by feeding the RNA-seq abundance estimates into the existing pipeline. Obtained predictions and classifications are displayed in Figure 3. As seen, all chemicals were correctly classified. This is also in complete concordance with historical results based on the standard protocols with NanoString quantification. Furthermore, the magnitudes of the decision values were very similar to what is routinely observed in the standard assay, and the variability between the replicates was also in line with expectations [27]. Thus, all observations based on the predictions indicate that the results were very similar to those of a typical testing scenario. No apparent deviations or biases could be identified based on the predicted decision values, which further corroborates the findings that quantification levels across the platforms were overall similar. It also indicates that the existing GARDskin analysis pipeline, which already includes steps designed to handle batch effects between a training set and a test set, is sufficiently robust to also handle differences identified for individual probes, as noted previously.
The final aspect that was considered concerned the required sequencing depth. The sequencing depth can be varied depending on requirements, and there is a natural trade-off between cost-effectiveness and data accuracy and reliability [80]. Based on subsampled data, GARDskin classifications could be effectively established from as little as five million quantified reads per sample.

5. Conclusions

GARDskin data quantified with the NanoString nCounter platform was compared with data from RNA-seq experiments. The results showed that the two platforms produced very similar estimates for majority of the genes. It was shown that gene expression levels quantified with RNA-seq could be used in the GARDskin assay to derive accurate hazard classifications that corresponded completely with the standard assay’s outcomes. The characteristics of the predictions were very similar to those observed in the standard assay, as currently described, suggesting that RNA-seq may be used as a gene expression quantification method to derive hazard classifications with the GARDskin assay. Indeed, given these results, the acquisition platforms could be considered interchangeably in a regulatory context for hazard assessment with GARDskin, following relevant approval procedures. Potential future research avenues could focus on the standardization of the RNAseq platform protocol and definition of quality criteria which would promote a more streamlined adoption in the GARDskin assay.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/toxics14010009/s1; Table S1: programs, versions, and parameters used for RNA-seq data processing; Table S2: transcript IDs used to reconstruct the NanoString signals on the RNA-seq platform.

Author Contributions

Conceptualization, R.G. and H.J.; methodology, R.G. and H.J.; software, R.G.; validation, R.G., J.A. and H.J.; formal analysis, R.G. and J.A.; investigation, R.G., J.A., A.F. and H.J.; resources, H.J.; data curation, R.G. and J.A.; writing—original draft preparation, R.G.; writing—review and editing, R.G., J.A., A.F. and H.J.; visualization, R.G.; supervision, H.J.; project administration, H.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated during and analyzed during the current study are available from the corresponding author on reasonable request.

Acknowledgments

The authors would like to acknowledge Clinical Genomics Lund, SciLifeLab and Center for Translational Genomics (CTG), Lund University for providing expertise and service with sequencing and analysis.

Conflicts of Interest

R.G., J.A., A.F. and H.J. are employees of Senzagen AB, which offers the GARD™ platform assays as commercial services.

Abbreviations

The following abbreviations are used in this manuscript:
AOPAdverse Outcome Pathway
CASRNChemical Abstracts Service Registry Number
CPTCCounts-per-total-counts
EC3The effective concentration that induces a stimulation index of three in the LLNA assay
GPSGARDskin Prediction Signature
HDSGHuman Data Sub-Group
KEKey Event
LFCLog Fold Change
LLNALocal Lymph Node Assay
MLLPMedian-Like Location Parameter
NGSNext Generation Sequencing
PCAPrincipal Component Analysis
RMSERoot Mean Squared Error
SVMSupport Vector Machine
TPMTranscripts per Million

References

  1. Kimber, I.; Basketter, D.A.; Gerberick, G.F.; Ryan, C.A.; Dearman, R.J. Chemical Allergy: Translating Biology into Hazard Characterization. Toxicol. Sci. 2011, 120, S238–S268. [Google Scholar] [CrossRef]
  2. OECD. The Adverse Outcome Pathway for Skin Sensitisation Initiated by Covalent Binding to Proteins; OECD Series on Testing and Assessment; OECD: Paris, France, 2014. [Google Scholar] [CrossRef]
  3. Aleksic, M.; Rajagopal, R.; de-Ávila, R.; Spriggs, S.; Gilmour, N. The Skin Sensitization Adverse Outcome Pathway: Exploring the Role of Mechanistic Understanding for Higher Tier Risk Assessment. Crit. Rev. Toxicol. 2024, 54, 69–91. [Google Scholar] [CrossRef] [PubMed]
  4. Peiser, M.; Tralau, T.; Heidler, J.; Api, A.M.; Arts, J.H.E.; Basketter, D.A.; English, J.; Diepgen, T.L.; Fuhlbrigge, R.C.; Gaspari, A.A.; et al. Allergic Contact Dermatitis: Epidemiology, Molecular Mechanisms, in Vitro Methods and Regulatory Aspects. Current Knowledge Assembled at an International Workshop at BfR, Germany. Cell. Mol. Life Sci. 2012, 69, 763–781. [Google Scholar] [CrossRef] [PubMed]
  5. Esser, P.R.; Martin, S.F. Pathomechanisms of Contact Sensitization. Curr. Allergy Asthma Rep. 2017, 17, 83. [Google Scholar] [CrossRef]
  6. Buehler, E.V. Delayed Contact Hypersensitivity In the Guinea Pig. Arch. Dermatol. 1965, 91, 171–175. [Google Scholar] [CrossRef]
  7. Magnusson, B.; Kligman, A.M. The Identification of Contact Allergens by Animal Assay. The Guinea Pig Maximization Test. J. Investig. Dermatol. 1969, 52, 268–276. [Google Scholar] [CrossRef]
  8. Ritz, H.; Buehler, E. Planning, Conduct and Interpretation of Guinea Pig Sensitization Patch Tests. In Current Concepts in Cutaneous Toxicity; Academic Press: New York, NY, USA, 1980; pp. 25–40. [Google Scholar]
  9. Kimber, I.; Weisenberger, C. A Murine Local Lymph Node Assay for the Identification of Contact Allergens. Arch. Toxicol. 1989, 63, 274–282. [Google Scholar] [CrossRef] [PubMed]
  10. Robinson, M.K.; Nusair, T.L.; Robert Fletcher, E.; Ritz, H.L. A Review of the Buehler Guinea Pig Skin Sensitization Test and Its Use in a Risk Assessment Process for Human Skin Sensitization. Toxicology 1990, 61, 91–107. [Google Scholar] [CrossRef]
  11. Kimber, I.; Dearman, R.J.; Basketter, D.A.; Ryan, C.A.; Gerberick, G.F. The Local Lymph Node Assay: Past, Present and Future. Contact Dermat. 2002, 47, 315–328. [Google Scholar] [CrossRef]
  12. Gerberick, G.F. Development of a Peptide Reactivity Assay for Screening Contact Allergens. Toxicol. Sci. 2004, 81, 332–343. [Google Scholar] [CrossRef]
  13. Fujita, M.; Yamamoto, Y.; Tahara, H.; Kasahara, T.; Jimbo, Y.; Hioki, T. Development of a Prediction Method for Skin Sensitization Using Novel Cysteine and Lysine Derivatives. J. Pharmacol. Toxicol. Methods 2014, 70, 94–105. [Google Scholar] [CrossRef] [PubMed]
  14. Natsch, A.; Haupt, T.; Wareing, B.; Landsiedel, R.; Kolle, S.N. Predictivity of the Kinetic Direct Peptide Reactivity Assay (kDPRA) for Sensitizer Potency Assessment and GHS Subclassification. Altex-Altern. Anim. Exp. 2020, 37, 652–664. [Google Scholar] [CrossRef]
  15. Ashikaga, T.; Yoshida, Y.; Hirota, M.; Yoneyama, K.; Itagaki, H.; Sakaguchi, H.; Miyazawa, M.; Ito, Y.; Suzuki, H.; Toyoda, H. Development of an in Vitro Skin Sensitization Test Using Human Cell Lines: The Human Cell Line Activation Test (h-CLAT): I. Optimization of the h-CLAT Protocol. Toxicol. Vitr. Int. J. Publ. Assoc. BIBRA 2006, 20, 767–773. [Google Scholar] [CrossRef]
  16. Johansson, H.; Lindstedt, M.; Albrekt, A.-S.; Borrebaeck, C.A.K. A Genomic Biomarker Signature Can Predict Skin Sensitizers Using a Cell-Based in Vitro Alternative to Animal Tests. BMC Genom. 2011, 12, 399. [Google Scholar] [CrossRef]
  17. Natsch, A.; Bauch, C.; Foertsch, L.; Gerberick, F.; Norman, K.; Hilberer, A.; Inglis, H.; Landsiedel, R.; Onken, S.; Reuter, H.; et al. The Intra- and Inter-Laboratory Reproducibility and Predictivity of the KeratinoSens Assay to Predict Skin Sensitizers in Vitro: Results of a Ring-Study in Five Laboratories. Toxicol. Vitr. 2011, 25, 733–744. [Google Scholar] [CrossRef]
  18. Takahashi, T.; Kimura, Y.; Saito, R.; Nakajima, Y.; Ohmiya, Y.; Yamasaki, K.; Aiba, S. An In Vitro Test to Screen Skin Sensitizers Using a Stable THP-1–Derived IL-8 Reporter Cell Line, THP-G8. Toxicol. Sci. 2011, 124, 359–369. [Google Scholar] [CrossRef]
  19. Saito, K.; Nukada, Y.; Takenouchi, O.; Miyazawa, M.; Sakaguchi, H.; Nishiyama, N. Development of a New in Vitro Skin Sensitization Assay (Epidermal Sensitization Assay; EpiSensA) Using Reconstructed Human Epidermis. Toxicol. Vitr. Int. J. Publ. Assoc. BIBRA 2013, 27, 2213–2224. [Google Scholar] [CrossRef]
  20. Ramirez, T.; Mehling, A.; Kolle, S.N.; Wruck, C.J.; Teubner, W.; Eltze, T.; Aumann, A.; Urbisch, D.; van Ravenzwaay, B.; Landsiedel, R. LuSens: A Keratinocyte Based ARE Reporter Gene Assay for Use in Integrated Testing Strategies for Skin Sensitization Hazard Identification. Toxicol. Vitr. Int. J. Publ. Assoc. BIBRA 2014, 28, 1482–1497. [Google Scholar] [CrossRef]
  21. Piroird, C.; Ovigne, J.-M.; Rousset, F.; Martinozzi-Teissier, S.; Gomes, C.; Cotovio, J.; Alépée, N. The Myeloid U937 Skin Sensitization Test (U-SENS) Addresses the Activation of Dendritic Cell Event in the Adverse Outcome Pathway for Skin Sensitization. Toxicol. Vitr. Int. J. Publ. Assoc. BIBRA 2015, 29, 901–916. [Google Scholar] [CrossRef]
  22. Wilm, A.; Kühnl, J.; Kirchmair, J. Computational Approaches for Skin Sensitization Prediction. Crit. Rev. Toxicol. 2018, 48, 738–760. [Google Scholar] [CrossRef]
  23. Gradin, R.; Tourneix, F.; Mattson, U.; Andersson, J.; Amaral, F.; Forreryd, A.; Alépée, N.; Johansson, H. In Vitro Prediction of Skin-Sensitizing Potency Using the GARDskin Dose–Response Assay: A Simple Regression Approach. Toxics 2024, 12, 626. [Google Scholar] [CrossRef]
  24. Nepal, R.U.; Jeong, T.C. A Rapid Spectrophotometric Test for Assessing Skin Sensitization Potential of Chemicals by Using N-Acetyl-L-Cysteine Methyl Ester in Chemico. J. Toxicol. Environ. Health Part A 2024, 87, 811–823. [Google Scholar] [CrossRef]
  25. Scheufen Tieghi, R.; Moreira-Filho, J.T.; Martin, H.-J.; Wellnitz, J.; Otoch, M.C.; Rath, M.; Tropsha, A.; Muratov, E.N.; Kleinstreuer, N. A Novel Machine Learning Model and a Web Portal for Predicting the Human Skin Sensitization Effects of Chemical Agents. Toxics 2024, 12, 803. [Google Scholar] [CrossRef]
  26. Tourneix, F.; Carron, L.; Jouffe, L.; Hoffmann, S.; Alépée, N. Deriving a Continuous Point of Departure for Skin Sensitization Risk Assessment Using a Bayesian Network Model. Toxics 2024, 12, 536. [Google Scholar] [CrossRef] [PubMed]
  27. OECD. Guideline No. 497: Defined Approaches on Skin Sensitisation; OECD Guidelines for the Testing of Chemicals, Section 4; OECD Publishing: Paris, France, 2025. [Google Scholar] [CrossRef]
  28. Reinke, E.N.; Reynolds, J.; Gilmour, N.; Reynolds, G.; Strickland, J.; Germolec, D.; Allen, D.G.; Maxwell, G.; Kleinstreuer, N.C. The Skin Allergy Risk Assessment-Integrated Chemical Environment (SARA-ICE) Defined Approach to Derive Points of Departure for Skin Sensitization. Curr. Res. Toxicol. 2025, 8, 100205. [Google Scholar] [CrossRef] [PubMed]
  29. Suzuki, S.; Mizumachi, H.; Miyazawa, M. Skin Sensitization Potency Prediction Based on Read-Across (RAx) Incorporating RhE-Based Testing Strategy (RTSv1)-Defined Approach: RTSv1-Based RAx. J. Appl. Toxicol. JAT 2025, 45, 620–635. [Google Scholar] [CrossRef] [PubMed]
  30. Forreryd, A.; Zeller, K.S.; Lindberg, T.; Johansson, H.; Lindstedt, M. From Genome-Wide Arrays to Tailor-Made Biomarker Readout—Progress towards Routine Analysis of Skin Sensitizing Chemicals with GARD. Toxicol. Vitr. Int. J. Publ. Assoc. BIBRA 2016, 37, 178–188. [Google Scholar] [CrossRef]
  31. Johansson, H.; Gradin, R.; Johansson, A.; Adriaens, E.; Edwards, A.; Zuckerstätter, V.; Jerre, A.; Burleson, F.; Gehrke, H.; Roggen, E.L. Validation of the GARDTMskin Assay for Assessment of Chemical Skin Sensitizers: Ring Trial Results of Predictive Performance and Reproducibility. Toxicol. Sci. 2019, 170, 374–381. [Google Scholar] [CrossRef]
  32. OECD. Test No. 442E: In Vitro Skin Sensitisation. 2023. [CrossRef]
  33. Albrekt, A.-S.; Johansson, H.; Börje, A.; Borrebaeck, C.; Lindstedt, M. Skin Sensitizers Differentially Regulate Signaling Pathways in MUTZ-3 Cells in Relation to Their Individual Potency. BMC Pharmacol. Toxicol. 2014, 15, 5. [Google Scholar] [CrossRef]
  34. Forreryd, A.; Gradin, R.; Humfrey, C.; Sweet, L.; Johansson, H. Exploration of the GARDTMskin Applicability Domain: Indirectly Acting Haptens, Hydrophobic Substances and UVCBs. Altex-Altern. Anim. Exp. 2023, 40, 53–60. [Google Scholar] [CrossRef]
  35. Forreryd, A.; Gradin, R.; Larne, O.; Rajapakse, N.; Deag, E.; Johansson, H. The GARDTMskin Assay: Investigation of the Applicability Domain for Metals. Altex-Altern. Anim. Exp. 2023, 40, 425–438. [Google Scholar] [CrossRef]
  36. Corvaro, M.; Henriquez, J.; Settivari, R.; Mattson, U.; Forreryd, A.; Gradin, R.; Johansson, H.; Gehen, S. GARDTMskin and GARDTMpotency: A Proof-of-Concept Study Investigating Applicability Domain for Agrochemical Formulations. Regul. Toxicol. Pharmacol. RTP 2024, 148, 105595. [Google Scholar] [CrossRef] [PubMed]
  37. Jenvert, R.-M.; Larne, O.; Johansson, A.; Berglin, M.; Pedersen, E.; Johansson, H. Evaluation of the Applicability of GARDskin to Predict Skin Sensitizers in Extracts from Medical Device Materials. Front. Toxicol. 2024, 6, 1320367. [Google Scholar] [CrossRef]
  38. Gradin, R.; Forreryd, A.; Mattson, U.; Jerre, A.; Johansson, H. Quantitative Assessment of Sensitizing Potency Using a Dose–Response Adaptation of GARDskin. Sci. Rep. 2021, 11, 18904. [Google Scholar] [CrossRef]
  39. OECD. Supporting Document to the Genomic Allergen Rapid Detection Test Method for Skin Sensitisation (Gardtmskin), Described in Test Guideline 442E. 2023. Available online: https://one.oecd.org/document/ENV/CBC/MONO(2022)13/en/pdf (accessed on 19 September 2025).
  40. Forreryd, A.; Johansson, H.; Albrekt, A.-S.; Lindstedt, M. Evaluation of High Throughput Gene Expression Platforms Using a Genomic Biomarker Signature for Prediction of Skin Sensitization. BMC Genom. 2014, 15, 379. [Google Scholar] [CrossRef]
  41. Geiss, G.K.; Bumgarner, R.E.; Birditt, B.; Dahl, T.; Dowidar, N.; Dunaway, D.L.; Fell, H.P.; Ferree, S.; George, R.D.; Grogan, T.; et al. Direct Multiplexed Measurement of Gene Expression with Color-Coded Probe Pairs. Nat. Biotechnol. 2008, 26, 317–325. [Google Scholar] [CrossRef]
  42. Li, H.-H.; Chen, R.; Hyduke, D.R.; Williams, A.; Frötschl, R.; Ellinger-Ziegelbauer, H.; O’Lone, R.; Yauk, C.L.; Aubrecht, J.; Fornace, A.J. Development and Validation of a High-Throughput Transcriptomic Biomarker to Address 21st Century Genetic Toxicology Needs. Proc. Natl. Acad. Sci. USA 2017, 114, E10881–E10889. [Google Scholar] [CrossRef] [PubMed]
  43. Chen, R. A High-Throughput and Highly Automated Genotoxicity Screening Assay. Altex 2021, 39, 71–81. [Google Scholar] [CrossRef] [PubMed]
  44. Phillips, K.A.; Douglas, M.P.; Wordsworth, S.; Buchanan, J.; Marshall, D.A. Availability and Funding of Clinical Genomic Sequencing Globally. BMJ Glob. Health 2021, 6, e004415. [Google Scholar] [CrossRef]
  45. Sarantopoulou, D.; Tang, S.Y.; Ricciotti, E.; Lahens, N.F.; Lekkas, D.; Schug, J.; Guo, X.S.; Paschos, G.K.; FitzGerald, G.A.; Pack, A.I.; et al. Comparative Evaluation of RNA-Seq Library Preparation Methods for Strand-Specificity and Low Input. Sci. Rep. 2019, 9, 13477. [Google Scholar] [CrossRef]
  46. Ura, H.; Togi, S.; Niida, Y. A Comparison of mRNA Sequencing (RNA-Seq) Library Preparation Methods for Transcriptome Analysis. BMC Genom. 2022, 23, 303. [Google Scholar] [CrossRef]
  47. Bray, N.L.; Pimentel, H.; Melsted, P.; Pachter, L. Near-Optimal Probabilistic RNA-Seq Quantification. Nat. Biotechnol. 2016, 34, 525–527. [Google Scholar] [CrossRef]
  48. Patro, R.; Duggal, G.; Love, M.I.; Irizarry, R.A.; Kingsford, C. Salmon Provides Fast and Bias-Aware Quantification of Transcript Expression. Nat. Methods 2017, 14, 417–419. [Google Scholar] [CrossRef]
  49. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. Fastp: An Ultra-Fast All-in-One FASTQ Preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
  50. Srivastava, A.; Malik, L.; Sarkar, H.; Zakeri, M.; Almodaresi, F.; Soneson, C.; Love, M.I.; Kingsford, C.; Patro, R. Alignment and Mapping Methodology Influence Transcript Abundance Estimation. Genome Biol. 2020, 21, 239. [Google Scholar] [CrossRef] [PubMed]
  51. Ewels, P.A.; Peltzer, A.; Fillinger, S.; Patel, H.; Alneberg, J.; Wilm, A.; Garcia, M.U.; Di Tommaso, P.; Nahnsen, S. The Nf-Core Framework for Community-Curated Bioinformatics Pipelines. Nat. Biotechnol. 2020, 38, 276–278. [Google Scholar] [CrossRef] [PubMed]
  52. Verheijen, M.C.; Meier, M.J.; Asensio, J.O.; Gant, T.W.; Tong, W.; Yauk, C.L.; Caiment, F. R-ODAF: Omics Data Analysis Framework for Regulatory Application. Regul. Toxicol. Pharmacol. 2022, 131, 105143. [Google Scholar] [CrossRef] [PubMed]
  53. Rezapour, M.; Narayanan, A.; Mowery, W.H.; Gurcan, M.N. Assessing Concordance between RNA-Seq and NanoString Technologies in Ebola-Infected Nonhuman Primates Using Machine Learning. BMC Genom. 2025, 26, 358. [Google Scholar] [CrossRef]
  54. Chen, J.-W.; Shrestha, L.; Green, G.; Leier, A.; Marquez-Lago, T.T. The Hitchhikers’ Guide to RNA Sequencing and Functional Analysis. Brief. Bioinform. 2023, 24, bbac529. [Google Scholar] [CrossRef]
  55. EURL ECVAM. GARDskin Assay Protocol. Available online: https://tsar.jrc.ec.europa.eu/system/files/Published/GARDskin%20Assay%20Protocol%20TSAR.pdf (accessed on 19 September 2025).
  56. OECD. Supporting Document to the OECD Guideline 497 on Defined Approaches for Skin Sensitisation. 2023. Available online: https://one.oecd.org/document/ENV/CBC/MONO(2025)2/en/pdf (accessed on 19 September 2025).
  57. Johansson, H.; Gradin, R.; Forreryd, A.; Agemark, M.; Zeller, K.; Johansson, A.; Larne, O.; van Vliet, E.; Borrebaeck, C.; Lindstedt, M. Evaluation of the GARD Assay in a Blind Cosmetics Europe Study. Altex-Altern. Anim. Exp. 2017, 34, 515–523. [Google Scholar] [CrossRef]
  58. Strickland, J.; Abedini, J.; Allen, D.G.; Gordon, J.; Hull, V.; Kleinstreuer, N.C.; Ko, H.-S.; Matheson, J.; Thierse, H.-J.; Truax, J.; et al. A Database of Human Predictive Patch Test Data for Skin Sensitization. Arch. Toxicol. 2023, 97, 2825–2837. [Google Scholar] [CrossRef]
  59. Gerberick, G.F.; Ryan, C.A.; Kern, P.S.; Schlatter, H.; Dearman, R.J.; Kimber, I.; Patlewicz, G.Y.; Basketter, D.A. Compilation of Historical Local Lymph Node Data for Evaluation of Skin Sensitization Alternative Methods. Dermat. Contact Atopic Occup. Drug 2005, 16, 157–202. [Google Scholar]
  60. Enoch, S.J.; Madden, J.C.; Cronin, M.T.D. Identification of Mechanisms of Toxic Action for Skin Sensitisation Using a SMARTS Pattern Based Approach. SAR QSAR Environ. Res. 2008, 19, 555–578. [Google Scholar] [CrossRef]
  61. Mudge, J.M.; Carbonell-Sala, S.; Diekhans, M.; Martinez, J.G.; Hunt, T.; Jungreis, I.; Loveland, J.E.; Arnan, C.; Barnes, I.; Bennett, R.; et al. GENCODE 2025: Reference Gene Annotation for Human and Mouse. Nucleic Acids Res. 2025, 53, D966–D975. [Google Scholar] [CrossRef]
  62. Langmead, B.; Salzberg, S.L. Fast Gapped-Read Alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef] [PubMed]
  63. Huber, P.J. Robust Statistics, 1st ed.; Wiley Series in Probability and Statistics; Wiley: Hoboken, NJ, USA, 1981. [Google Scholar] [CrossRef]
  64. Signorell, A. DescTools: Tools for Descriptive Statistics. 2023. Available online: https://andrisignorell.github.io/DescTools/ (accessed on 25 September 2025).
  65. Rousseeuw, P.J.; Croux, C. Alternatives to the Median Absolute Deviation. J. Am. Stat. Assoc. 1993, 88, 1273–1283. [Google Scholar] [CrossRef]
  66. Maechler, M.; Rousseeuw, P.; Croux, C.; Todorov, V.; Ruckstuhl, A.; Saliban-Barrera, M.; Verbeke, T.; Koller, M.; Conceicao, E.L.T.; Anna di Palma, M. Robustbase: Basic Robust Statistics. 2023. Available online: http://robustbase.r-forge.r-project.org/ (accessed on 19 September 2025).
  67. Benjamini, Y.; Yekutieli, D. The Control of the False Discovery Rate in Multiple Testing under Dependency. Ann. Stat. 2001, 29, 1165–1188. [Google Scholar] [CrossRef]
  68. Gradin, R.; Lindstedt, M.; Johansson, H. Batch Adjustment by Reference Alignment (BARA): Improved Prediction Performance in Biological Test Sets with Batch Effects. PLoS ONE 2019, 14, e0212669. [Google Scholar] [CrossRef]
  69. European Commission. Joint Research Centre. ESAC Opinion on the Scientific Validity of the GARDskin and GARDpotency Test Methods; Publications Office-LU: Luxembourg, 2021. [Google Scholar]
  70. R Core Team. R: A Language and Environment for Statistical Computing. 2022. Available online: https://www.R-project.org/ (accessed on 19 September 2025).
  71. Wickham, H. Ggplot2, 2nd ed.; Use R! Springer International Publishing: Basel, Switzerland, 2016. [Google Scholar]
  72. Ritchie, M.E.; Phipson, B.; Wu, D.; Hu, Y.; Law, C.W.; Shi, W.; Smyth, G.K. Limma Powers Differential Expression Analyses for RNA-Sequencing and Microarray Studies. Nucleic Acids Res. 2015, 43, e47. [Google Scholar] [CrossRef] [PubMed]
  73. OECD. Test No. 442C: In Chemico Skin Sensitisation. 2023. [CrossRef]
  74. OECD. Test No. 442D: In Vitro Skin Sensitisation. 2022. [CrossRef]
  75. Satam, H.; Joshi, K.; Mangrolia, U.; Waghoo, S.; Zaidi, G.; Rawool, S.; Thakare, R.P.; Banday, S.; Mishra, A.K.; Das, G.; et al. Next-Generation Sequencing Technology: Current Trends and Advancements. Biology 2023, 12, 997. [Google Scholar] [CrossRef]
  76. Kufel, J.; Grzechnik, P. Small Nucleolar RNAs Tell a Different Tale. Trends Genet. 2019, 35, 104–117. [Google Scholar] [CrossRef]
  77. Dahlgren, A.R.; Scott, E.Y.; Mansour, T.; Hales, E.N.; Ross, P.J.; Kalbfleisch, T.S.; MacLeod, J.N.; Petersen, J.L.; Bellone, R.R.; Finno, C.J. Comparison of Poly-A+ Selection and rRNA Depletion in Detection of lncRNA in Two Equine Tissues Using RNA-Seq. Non-Coding RNA 2020, 6, 32. [Google Scholar] [CrossRef]
  78. Dávila López, M.; Samuelsson, T. Early Evolution of Histone mRNA 3′ End Processing. RNA 2008, 14, 1–10. [Google Scholar] [CrossRef] [PubMed]
  79. Speranza, E.; Altamura, L.A.; Kulcsar, K.; Bixler, S.L.; Rossi, C.A.; Schoepp, R.J.; Nagle, E.; Aguilar, W.; Douglas, C.E.; Delp, K.L.; et al. Comparison of Transcriptomic Platforms for Analysis of Whole Blood from Ebola-Infected Cynomolgus Macaques. Sci. Rep. 2017, 7, 14756. [Google Scholar] [CrossRef] [PubMed]
  80. Sims, D.; Sudbery, I.; Ilott, N.E.; Heger, A.; Ponting, C.P. Sequencing Depth and Coverage: Key Considerations in Genomic Analyses. Nat. Rev. Genet. 2014, 15, 121–132. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Comparison of quantified expression profiles between the platforms. Panel (a) shows average expression levels of the unstimulated control samples. Panel (b) compares the estimated effects induced by the positive control relative to the unstimulated control expressed in log2 fold changes (LFCs).
Figure 1. Comparison of quantified expression profiles between the platforms. Panel (a) shows average expression levels of the unstimulated control samples. Panel (b) compares the estimated effects induced by the positive control relative to the unstimulated control expressed in log2 fold changes (LFCs).
Toxics 14 00009 g001
Figure 2. Visualization of the two first principal components of the RNA-seq dataset. Points are colored with respect to the consensus reference sensitizing category. Each point represents a distinct chemical. Abbreviations: NS: non-sensitizer; S: sensitizer.
Figure 2. Visualization of the two first principal components of the RNA-seq dataset. Points are colored with respect to the consensus reference sensitizing category. Each point represents a distinct chemical. Abbreviations: NS: non-sensitizer; S: sensitizer.
Toxics 14 00009 g002
Figure 3. Graphical representation of GARDskin classifications. Points indicate predicted decision values for individual replicates and crossbars describe the mean decision values. Abbreviations: NS: non-sensitizer; S: sensitizer.
Figure 3. Graphical representation of GARDskin classifications. Points indicate predicted decision values for individual replicates and crossbars describe the mean decision values. Abbreviations: NS: non-sensitizer; S: sensitizer.
Toxics 14 00009 g003
Figure 4. Classification performance of subsampled data. Panel (a) shows the root mean squared errors (RMSEs), comparing decision values of subsampled data with decision values from the complete dataset. Panel (b) shows the cross-entropy (log loss), comparing classifications with reference labels. The points describe the average value observed across the simulations and the error bars extend one standard deviation from the mean.
Figure 4. Classification performance of subsampled data. Panel (a) shows the root mean squared errors (RMSEs), comparing decision values of subsampled data with decision values from the complete dataset. Panel (b) shows the cross-entropy (log loss), comparing classifications with reference labels. The points describe the average value observed across the simulations and the error bars extend one standard deviation from the mean.
Toxics 14 00009 g004
Table 1. Description of assayed chemicals.
Table 1. Description of assayed chemicals.
ChemicalCASRNInput ConcentrationLLNA MLLP (%)LLNA CallHDSG CallConsensus Reference CallN Replicates
4-Nitrobenzyl Bromide100-11-82 µM0.051-13
1-Chloro-2,4-dinitrobenzene97-00-74 µM0.0541113
p-Phenylenediamine (PPD)106-50-375 µM0.1111116 (12 + 4)
2-Nitro-p-phenylenediamine5307-14-2400 µM0.41-13
2-Aminophenol95-55-650 µM0.451-13
Isoeugenol97-54-1300 µM1.31113
3-(Dimethylamino)-1-propylamine109-55-7500 µM3.51-13
Resorcinol108-46-3500 µM6.31-13
alpha-Hexylcinnamaldehyde101-86-0300 µM10.81-13
Eugenol97-53-0500 µM11.61113
Geraniol106-24-1500 µM16.11113
Ethylene Glycol Dimethacrylate97-90-5500 µM281-13
2-Hydroxyethyl Acrylate818-61-1100 µM-1-13
Propyl Gallate121-79-9100 µM-1-13
1-Butanol71-36-3500 µM-0-03
1.2-Propanediol57-55-6500 µM-0003
Chlorobenzene108-90-7500 µM-0-03
DMSO (negative control)67-68-50.1%721004
Glycerol56-81-5500 µM-0-03
H2O (negative control)7732-18-50.1%-0003
Hexane110-54-3500 µM-0003
Isopropanol67-63-0500 µM-0-03
Lactic acid50-21-5500 µM-0-03
Unstimulated control-NA-00016 (12 + 4)
LLNA MLLP (%) describes the median-like location parameter for EC3 values. LLNA Call describes the binary sensitization category based on LLNA data. HDGS Call describes the binary sensitization category based on human data. Consensus reference Call describes the binary consensus reference category based on both human and LLNA data. N Replicates describes the total number of replicate acquired for treatment in this study. P-phenylenediamine and unstimulated control have a total of 16 replicates. A total of twelve of these were used for the technical transfer based on paired samples, and four samples were used for assessment of predictive capacity. Abbreviations: CASRN: Chemical Abstract Service Registry Number; LLNA MLLP: local lymph node assay median location-like parameter; HDSG Call: binary sensitization call based on human data collated and interpreted by the human data sub-group [27].
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gradin, R.; Andersson, J.; Forreryd, A.; Johansson, H. RNA-Seq Can Be Used to Quantify Gene Expression Levels for Use in the GARDskin Assay. Toxics 2026, 14, 9. https://doi.org/10.3390/toxics14010009

AMA Style

Gradin R, Andersson J, Forreryd A, Johansson H. RNA-Seq Can Be Used to Quantify Gene Expression Levels for Use in the GARDskin Assay. Toxics. 2026; 14(1):9. https://doi.org/10.3390/toxics14010009

Chicago/Turabian Style

Gradin, Robin, Johan Andersson, Andy Forreryd, and Henrik Johansson. 2026. "RNA-Seq Can Be Used to Quantify Gene Expression Levels for Use in the GARDskin Assay" Toxics 14, no. 1: 9. https://doi.org/10.3390/toxics14010009

APA Style

Gradin, R., Andersson, J., Forreryd, A., & Johansson, H. (2026). RNA-Seq Can Be Used to Quantify Gene Expression Levels for Use in the GARDskin Assay. Toxics, 14(1), 9. https://doi.org/10.3390/toxics14010009

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop