Deep Mutational Scanning in Immunology: Techniques and Applications

Shao, Chengwei; Jia, Siyue; Li, Yue; Li, Jingxin

doi:10.3390/pathogens14101027

Open AccessReview

Deep Mutational Scanning in Immunology: Techniques and Applications

¹

School of Public Health, Southeast University, Nanjing 210009, China

²

Jiangsu Provincial Medical Innovation Center, National Health Commission Key Laboratory of Enteric Pathogenic Microbiology, Jiangsu Provincial Center for Disease Control and Prevention, Nanjing 210009, China

³

Nanjing Vazyme Biotech Co., Ltd., Nanjing 210033, China

^*

Author to whom correspondence should be addressed.

Pathogens 2025, 14(10), 1027; https://doi.org/10.3390/pathogens14101027

Submission received: 11 September 2025 / Revised: 5 October 2025 / Accepted: 9 October 2025 / Published: 10 October 2025

Download

Browse Figure

Versions Notes

Abstract

Mutations may cause changes in the structure and function of immune-related proteins, thereby affecting the operation of the immune system. Deep mutational scanning combines saturation mutagenesis, functional selection, and high-throughput sequencing to evaluate the effects of mutations on a large scale and with high resolution. By systematically and comprehensively analyzing the impact of mutations on the functions of immune-related proteins, the immune response mechanism can be better understood. However, each stage in deep mutation scanning has its limits, and the approach remains constrained in several ways. These include data and selection biases that affect the robustness of effect estimates, insufficient library coverage and editability leading to uneven representation of sites and alleles, system-induced biased signals that deviate phenotypes from their true physiological state, and imperfect models and statistical processing that limit extrapolation capabilities. Therefore, this technology still needs further development. Herein, we summarize the principles and methods of deep mutational scanning and discuss its application in immunological research. The aim is to provide insights into the broader application prospects of deep mutational scanning technology in immunology.

Keywords:

immunology; mutation; sequencing; antibody

1. Introduction

In recent years, advances in gene synthesis, gene editing, and high-throughput sequencing, together with continual improvements in bioinformatics, have markedly expanded the use of deep mutational scanning (DMS, also known as saturation mutation screening) in biomedicine [1,2,3]. DMS is a technology that combines deep sequencing with gene or genome libraries generated by programmed allelic mutations, thereby linking genotypes with phenotypes through high-throughput platforms to detect the functional effects of mutations at each single nucleotide position in a gene or genome [4]. Because genetic variants frequently alter amino acid sequences and thus protein structure and function, DMS has become a powerful approach for dissecting sequence–function relationships. At scale, it quantifies the impact of genetic variation efficiently and at comparatively low cost [5].

The normal function of the immune system depends on the coordinated activity of multiple proteins, including but not limited to antibodies, cell surface receptors, a part of signal transduction molecules, and effector molecules, which play a key role in identifying, responding to, and clearing pathogens and other foreign substances. The structure and function of these proteins may be affected by their own gene mutations, which may lead to enhanced, diminished, or dysregulated immune function [6,7]. Therefore, DMS holds substantial potential and broad prospects in immunology. Its key strength lies in the ability to link “site–variant–function” relationships in high-throughput platforms, thereby mapping molecular-level site effects onto immune outcomes at the cellular and even organismal levels. Compared with traditional mutational analysis, DMS affords broader coverage and higher resolution for assessing the consequences of variation [5]. Systematic, high-throughput characterization of immune-related gene variants is expected to illuminate mutation-driven mechanisms of immune function, refine our understanding of immune responses, and inform the development of immunotherapeutics, vaccines, and immune modulators.

This review describes the basic principles and methods of DMS and introduces the widely utilized technical frameworks at each step of the DMS process. It compares the advantages, limitations, and appropriate use cases of DMS in studies of antibody, antigen, and T-cell receptor (TCR), as well as in immunological disorders. We aim to highlight the significant role of DMS technology in advancing the forefront of immunological research and inspiring deeper exploration in this field in the future.

2. Deep Mutational Scanning Methods and Process

DMS primarily comprises three main components: construction of mutant libraries, functional screening, and high-throughput sequencing analysis (Figure 1) [8]. The central concept is to link “site-variant-function” in a high-throughput framework: First, a mutant library containing many mutations is constructed using synthetic biology or directed mutagenesis, encompassing single-site or multi-site substitutions across defined regions or the full protein sequence. Second, an appropriate selection or screening strategy is applied to report on variant activity. In addition to molecular readouts such as binding affinity, catalytic activity and stability, functional phenotypes of cellular immune activities such as secretion, infection/entry, signaling pathway activation, and survival/proliferation can also be considered. Third, high-throughput sequencing quantifies variant abundances before and after selection to infer effect sizes and establish robust genotype–phenotype relationships [9]. This process allows for the stable mapping of site-level molecular effects to the cellular and even individual levels and can be combined with in vivo models for validation to obtain more physiologically relevant immunological conclusions.

2.1. Construction of the Mutational Library

Ideally, the DMS library encodes every possible amino acid substitution in the protein of interest. Therefore, the success of DMS depends on high coverage of mutant sequences, which requires the use of efficient DNA synthesis and cloning strategies to ensure inclusion of all desired mutations [10].

Mutations are typically introduced by primer synthesis followed by polymerase chain reaction (PCR) to integrate mutations into DNA products for the construction of subsequent libraries [9]. Additionally, oligonucleotides containing mutation fragments can also be used as templates for homologous recombination and introduced into target cells (such as yeast or mammalian cells) through electroporation, liposome-mediated transfection, or other methods. The mutations are integrated into the genome by replacing corresponding regions of the target gene in the cells through homologous recombination mechanisms [11]. Early libraries were often generated by error-prone PCR, which lowers polymerase fidelity to stochastically introduce mutations and rapidly diversify sequences. However, this approach yields uneven mutational spectra with pronounced biases, leading to gaps in substitution coverage and frequent loss-of-function variants. An ideal random mutagenesis scheme would sample all nucleotides uniformly and maximize amino acid diversity—particularly when substituting three consecutive nucleotides. Therefore, the mutation rate needs to strike a balance between maintaining clone uniqueness and maintaining function to achieve optimal results [12,13]. This limitation is particularly prominent in immunology because small structural regions, such as antibody complementarity-determining regions (CDRs), are highly complex and have delicate functions. A single amino acid substitution can significantly change the affinity and specificity of antigen binding. Accordingly, systematic and precise amino acid saturation mutagenesis is essential to elucidate sequence–structure–function relationships in immune proteins. To overcome the limitations of error-prone PCR, programmed allelic series (PALs) were subsequently developed. PALs use synthetic oligonucleotides with degenerate codons (e.g., NNN/NNS/NNK) at specific sites, which can systematically cover all amino acid substitutions, thereby significantly reducing the bias caused by error-prone PCR [4]. In immunological applications, PALs have enabled targeted modification of antibody CDRs, such as combining single-stranded DNA with lambda exonuclease for DNA shuffling and achieving full coverage mutagenesis of the Complementarity-Determining Region 3 (CDR3) through NNK codons, facilitating systematic identification of residues governing antigen recognition [14]. However, PALs still have problems with uneven amino acid distribution and a large number of stop codons. To address these issues, the trinucleotide cassette (T7 Trinuc) design proposed by Krumpe et al. can achieve an equiprobable distribution of amino acids at each site while avoiding the introduction of stop codons, thereby further enhancing the diversity and effectiveness of the library [15,16].

In site-directed mutagenesis, traditional cassette approaches build on Kunkel mutagenesis, but these are time-consuming and limited in efficiency. Based on this, Firnberg and Ostermeier developed PFunkel, which combines Kunkel mutagenesis with Pfu DNA polymerase to enable rapid site-directed mutagenesis on double-stranded plasmid templates—typically within a single day [17]. PFunkel has been used to construct mutant libraries of tumor necrosis factor (TNF), pertussis toxin, and cancer target trophoblast cell surface antigen 2 (TROP2) antibodies, supporting high-resolution epitope mapping [18]. Although PFunkel has greatly improved the efficiency of site-directed mutagenesis, its scalability in long genes or multi-site mutagenesis is still limited. To further address these problems, scalable and uniform nicking mutagenesis (SUNi) has been developed in recent years. SUNi achieves higher uniformity and coverage, significantly reducing wild-type residues, by implementing double nicking sites on the template, optimizing the annealing temperature of the flanking homology arms, and introducing a GC clamp at the 5‘ end. This approach achieves greater uniformity and coverage while significantly reducing wild type. Compared with PFunkel, SUNi not only maintains strong scalability for long fragments and multi-gene targets, but also significantly improves overall library quality and screening efficiency [19].

Beyond strategies relying on oligonucleotide synthesis and plasmid cloning, CRISPR/Cas9 provides a versatile approach for generating high-coverage variants in situ across the genome. The basic idea is to generate programmable cuts at the target locus using Cas9, and then use oligonucleotides or fragment donors to guide homology-directed repair (HDR), thereby completing site-by-site replacements or small insertions/deletions. This allows for barcoding to read diversity and track allelic series. Complementary to random or degenerate codon strategies, CRISPR-mediated saturation mutagenesis emphasizes functional mapping in situ post-transcriptional/post-translational contexts, helping to reduce spurious or biased phenotypic signals caused by overexpression and ectopic expression. Its technical limitations mainly include heterogeneous editing accessibility (PAM/sequence context dependence), differences in HDR efficiency, and potential unintended indel/splicing effects. Therefore, it is recommended to include editing spectrum and diversity monitoring (such as targeted sequencing to assess substitution/indel distribution and wild-type residues) during the library construction stage, set positive/negative sites and neutral sites as controls, and include “editability/editing efficiency” as a covariate in subsequent analyses to improve the robustness and comparability of effect estimates [20,21].

2.2. Functional Screening

DMS requires a model suitable for high-throughput assays to link genotype with phenotype. Typically, the DMS display medium includes cell models that express the gene through steps such as transfection or transduction; for example, in the case of non-cell models, variants were synthesized or expressed through in vitro transcription and translation systems or reconstructed in vitro translation systems (PURE system) [22,23]. Each modality has distinct advantages and use cases: Cellular models can not only be used to analyze the binding or stability of immune-related proteins, but also through integrating mammalian cells and engineered T cell platforms, support systematic screening of multi-level functions such as antibody post-translational modification, immune cell secretion, viral infection response, and TCR specificity remodeling, thereby providing key support for immunology research and the development of immunotherapy strategies. By contrast, non-cell models offer tightly controlled biochemical environments that minimize cellular confounders and are well suited for screening variants affecting binding affinity or other intrinsic biochemical activities [16] (Table 1).

The cell models most commonly employed in DMS screening include yeast and mammalian cells [22,24,25,26]. Yeast offers rapid growth and short doubling times, facilitating large-scale culture and high-throughput screening, thus promoting its widespread application in DMS [26]. In addition, yeast benefits from mature genetic manipulation techniques, with broad applications in transfection and gene editing. Traxlmayr et al. combined yeast display with high-temperature selection to construct a human Immunoglobulin G1 Fragment crystallizable (IgG1-Fc) region mutation library and together with high-throughput sequencing to analyze the effects of residues on IgG1 stability, mapped the “stability landscape” of the IgG1 CH3 domain and inform its structural and evolutionary constraints [26]. However, yeast is not suitable for some human membrane proteins with complex folding or requiring specific glycosylation modification, so DMS related to immune proteins is increasingly performed in mammalian cells, which can provide post-translational modifications (such as glycosylation, phosphorylation, and ubiquitination) more closely resembling physiological states. These modifications are crucial for immune protein function and disease-related phenotypes [22]. Therefore, they have become an important platform for antibody drug engineering and immune receptor research. For example, during the antibody optimization process, DMS can guide the evaluation and modification of single-chain variable fragments (scFv) or antigen-binding fragments (Fab), which can subsequently be converted them into full-length glycosylated IgG in mammalian cells to meet the optimization requirements of therapeutic antibodies in terms of affinity and immunogenicity [21]. In addition, CRISPR-engineered human T-cell platforms enable systematic functional screens. 437 single amino acid variants and 260,000 combinatorial variants of more than 30 TCRs were analyzed by DMS, revealing the differences between TCR binding and antigen activation, and finding that some TCRs can still be strongly activated by antigens even if they cannot be detected by peptide/major histocompatibility complex (pMHC) tetramer binding [27]. This finding highlights the limitations of relying solely on binding assays. In terms of TCR specificity modification, systematic saturation and combinatorial mutagenesis of the CDR3 region have been used to generate large-scale libraries, which were introduced into Jurkat TCRβ-/- cells via lentiviral transduction to establish T-cell libraries. Through tetramer staining and flow cytometric sorting, researchers isolated TCR variants capable of recognizing the original epitope, novel epitopes, or both, followed by high-throughput sequencing for detailed analysis. The results showed that systematic DMS of the CDR3 region could achieve specific remodeling among closely related peptides, providing an important basis for analyzing the mechanism of TCR-pMHC interaction and developing more effective immunotherapies [28].

Non-cell models can efficiently evaluate the affinity phenotypes of many protein mutations while avoiding the complexity and variability that cellular systems might introduce. Common non-cell models include phage and ribosome display [2,29,30]. Phage display is the most widely used non-cell models and has enabled the discovery of hundreds of antibodies for research, diagnostic and therapeutic applications, especially antibodies against challenging targets and antibodies with tailored binding properties [2,23]. Phage display can be used to create and evaluate libraries of up to 1 × 10¹² clones, and high-abundance libraries are conducive to the discovery of high-affinity antibodies against antigens [16,29]. Schofield et al. reported a high-quality phage display library containing over a billion human antibodies, from which over 38,000 recombinant antibodies targeting 292 antigens were produced. They validated 7200 unique clones through specificity testing, sequence analysis, and various biochemical assays, highlighting the immense potential of whole-genome monoclonal antibody development [31]. In contrast to phage display, ribosome display does not require cloning or living cells, thus avoiding many limiting factors such as growth environment control. Therefore, it has the advantage in expressing toxic proteins or selection in hostile conditions, it also supports extremely high diversity (more than 1 × 10¹²). However, because RNA is susceptible to hydrolysis and nuclease degradation, limited molecular stability remains a principal drawback of the ribosome display system [16,30].

2.3. High-Throughput Sequencing and Data Analysis

Analyzing mutation libraries is a powerful strategy to understand the functional implications of variants, but the traditional first-generation Sanger sequencing identifies only a small number of mutants per batch, limiting downstream functional analyses. However, the rapid development and application of high-throughput sequencing have enabled the simultaneous profiling of hundreds of thousands of variants, efficiently enriching for advantageous mutations and establishing a foundation for DMS. Multiple next-generation sequencing (NGS) platforms, including Illumina, Nanopore, ABI/SOLiD, Polonator and Pacific Biosciences, support DMS by delivering billions of bases at low per-base cost [32,33,34].

At the data analysis level, DMS analysis usually relies on the calculation of amino acid variant enrichment ratios to identify key mutations that affect function. Early tools and frameworks, such as Enrich v1.0 and EMPIRIC v1.0 [35,36] compare variant frequencies before and after selection to infer relative fitness and provide basic statistical modeling and visualization for large datasets. However, the core of such methods is still based on the comparison of variant frequencies, and the results are easily affected by experimental conditions, sample processing, and sequencing bias. Especially in immunological research, this bias may lead to the neglect of low-frequency mutants, which often correspond to key immunological functions. For example, the SARS-CoV-2 receptor binding domain (RBD) mutation E484K was initially rare in DMS outputs, but was later confirmed to be a major antibody escape site [37]. Similarly, in TCR studies, certain low-frequency clones may be difficult to detect in peptide-MHC tetramer binding assays yet display strong immune responses in functional readouts [27,28]. These findings suggest that ignoring low-frequency mutants may lead to biased interpretations of immune response mechanisms or pathogen evolutionary trajectories, posing significant challenges to the analysis of results. Therefore, to improve reliability and accuracy, more advanced bioinformatics tools and computational models are necessary to address these limitations. On this basis, Fowler et al. developed Enrich2 v1.1.0, a comprehensive statistical model for analyzing DMS data applicable to datasets with any number of time points. Enrich2 is based on a random effects model with repeated results. It not only provides improved scoring methods to effectively reduce noise and detect small-effect mutations but also estimates mutation scores and standard errors to reflect sampling errors and experimental consistency. However, it relies more on normalization and good repeated design when the depth is very low or there are strong batch differences [36]. DiMSum v1.1.3 also relies on Poisson sequence count distributions and considers empirical variance to estimate errors. DiMSum differs by introducing specific additive and multiplicative modifications to handle empirical variance. By sharing empirical variance across variants, DiMSum is more robust to overdispersion and has a lower requirement for replicate numbers. However, when there is a large dispersion or insufficient barcode collapse, the error estimates may be conservative or introduce systematic biases. DiMSum demonstrates similar performance to Enrich2 on datasets with less pronounced overdispersion [38]. Apart from Enrich2 and DiMSum, Bloom et al. introduced dms_tools v1.0.1 as an analysis software that infers the preference of each codon for each amino acid from the given selection pressure or assesses the extent to which these preferences change under different selection pressures and has been shown to be more accurate in inference from simulated data than simply calculating the ratio of counts before and after selection. The intuitive visualizations created by this software aid in result interpretation, guiding protein engineering, understanding sequence–structure–function relationships, and providing insights for developing better evolutionary models for sequence analysis, but does not integrate three-dimensional structure priors or epistasis modeling in the native statistical framework [39]. In practice, dms_tools has been widely used in immunology, including the analysis of neutralizing antibody escape of HIV-1 envelope protein (Env) [40], the mapping of mutational functions of influenza virus hemagglutinin (HA) [41], and SARS-CoV-2 RBD immune escape studies [37]. These studies have demonstrated the important value of dms_tools in the analysis of viral immune escape and antigenic evolution. Similarly, all three methods are limited by the inherent issues of count data, namely sensitivity to PCR bias, sequencing depth, and batch variation. Consequently, they may underestimate low-frequency, functionally relevant substitutions in immunological applications, covering a wide range of loci, from antigenicity/immunogenicity (including escape) to affinity and specificity, signal transduction and conformational regulation, folding stability, and expression/secretion/localization. In practice, selection can be based on data conditions, such as prioritizing dms_tools for low depth, using Enrich2 for multiple time points, and selecting DiMSum for overdispersion. Threshold calibration and reproducibility testing should be performed in independent batches or cohorts. Furthermore, structural and evolutionary features should be incorporated as covariates into subsequent models, and strict control should be exercised for data exceeding model expectations [36,38,39].

3. Application of Deep Mutational Scanning in Immunology

The application of DMS in immunology focuses on three key molecules: antibodies, antigens and TCRs, which correspond to the core components of humoral immunity, pathogen escape, and cellular immunity, respectively. By systematically mapping mutational effects, DMS enables us to understand the interactions between these core molecules and their mechanisms of action in immune responses, from molecular mechanisms to integrated immune functions.

3.1. Antibody Engineering

Antibody engineering has become an important field in biomedical research and drug development, aiming to improve the biological and functional properties of antibody candidates to enhance clinical efficacy [42]. As a versatile platform, DMS has shown its unique advantages among many technologies and methods through protein mutagenesis and functional screening combined with deep sequencing and bioinformatics, providing the possibility of improving the affinity, specificity, and stability of antibodies [43,44]. In addition, DMS can accurately map the fine conformational epitopes targeted by a given antibody, providing a better understanding of the structural basis of its protective mechanisms, which can enhance preventive or therapeutic interventions for human diseases.

Monoclonal antibodies (mAbs) isolated from immune or synthetic libraries can be further optimized to enhance their therapeutic properties, with affinity to their homologous antigens being a key determinant of functional efficacy. Protein therapeutics with precise in vitro affinity tuning have also provided new insights due to the application of DMS. For example, Fujino et al. implemented a systematic affinity engineering strategy for the Fab fragment of an antibody against the tumor necrosis factor receptor (TNF-αR). They first performed single-site DMS across the six CDRs of the heavy chain (VH) and light chain (VL) to constructed a comprehensive library of single amino acid substitutions, then used ribosome display for functional screening to identify beneficial mutations that increased antigen binding. These single-point mutations were then combinatorially optimized, and a significant improvement in antibody affinity was achieved with only seven amino acid substitutions, reducing the dissociation constant (Kd) from 7.28 nM to 3.45 pM, an overall improvement of more than 2000 times. This result not only demonstrated the powerful ability of DMS in rapidly screening beneficial mutations, but also demonstrated its unique advantages in guiding the combination of multiple mutations and achieving fine optimization of antibody affinity [23]. Building on this foundation, subsequent studies have further utilized DMS to not only to identify advantageous substitutions, but also systematically map the comprehensive fitness landscape of antibody–antigen binding, thus expanding the application of DMS in antibody engineering. Forsyth et al. developed DMS for antibody CDRs, capable of systematically assessing the impact of every possible single-point amino acid substitution on antigen binding. This method utilizes a full-length IgG library containing over 1000 CDR point mutations, displayed in mammalian cells, and sorted based on antigen affinity using flow cytometry. High-throughput sequencing is then used to analyze the enrichment or depletion of different mutations, thereby mapping the functional landscape of high-affinity, low-affinity and neutral mutations. When applied to the humanized anti-EGFR antibody hu225 (the parent antibody of cetuximab), this method covered 1121 single-point substitutions at 59 CDR positions across VH and VL regions, yielding a nearly comprehensive fitness landscape for antibody–antigen binding. Most substitutions were neutral or deleterious, but 67-point mutations that significantly improved affinity were identified. These not only verified the existing structural and functional data, but also revealed new optimized residues. DMS thus provides a robust tool for systematically analyzing the antibody–antigen interface and guiding antibody engineering [24]. Beyond mAb affinity optimization, DMS also informs potential in bispecific antibody design. Given the promise of dual-targeting antibodies in enhancing efficacy, Koenig et al. used a vascular endothelial growth factor (VEGF)/angiopoietin 2 (Ang2) dual action Fab (DAF) as a model, performed systematic DMS on the CDRs, and combined it with phage display and high-throughput sequencing. They not only identified beneficial mutations that enhanced VEGF or Ang2 binding, but also revealed synergistic effects and key stability sites between different CDR residues. Through further combinatorial optimization, the 5A12 antibody, which had an initial affinity of approximately 5 nM, was modified into multiple variants with sub-nanomolar affinities for both antigens, with blocking effects comparable to high-affinity monospecific antibodies. This provides the first demonstration that DMS can achieve dual affinity maturation, thereby expanding the design concept of bispecific antibodies. In addition to showing application prospects in neovascular age-related macular degeneration, this strategy is also applicable to VEGF/Ang2-driven abnormal tumor vascular remodeling and immunosuppression, and is expected to improve the tumor immune microenvironment, enhance immune cell infiltration, and enhance anti-tumor effects [45]. In addition to optimizing affinity and specificity, DMS enables high-resolution conformational epitope mapping. Using yeast display coupled with DMS, Kowalsky et al. mapped the binding epitopes of infliximab and TNF, Hu1B7 and pertussis toxin, and confirmed that the conformational epitopes obtained by this method were highly consistent with existing data, demonstrating the reliability of the experimental process. Compared with traditional low-throughput methods that rely on co-crystallization or mass spectrometry, this new strategy combining comprehensive mutagenesis, cell surface display and deep sequencing can quickly map high-resolution epitope maps of multiple antibody–antigen complexes in a shorter time, at a lower cost and with reduced antibody consumption. These datasets not only revealed the functional contribution of single-point mutations in antigen binding, but also provided data support for predicting escape mutations, evaluating cross-reactivity, and improving protein–protein interaction computational models. Although this method still has certain limitations when dealing with antigens with complex structures or those that rely on glycosylation modifications, it has been successfully applied to complex proteins such as TNF and TROP2, showing its broad prospects in immunology research and antibody drug development [18]. Based on this, researchers have gradually realized that the potential of DMS is not limited to structural and functional elucidation but can also extend to clinically relevant engineering problems. For example, maintaining antibody binding activity while reducing its immunogenicity is a pressing challenge in antibody drug development. After antigen processing, antibody variable regions are cleaved into peptides of approximately 15 amino acids, which fit into the HLA-II binding groove with varying registers. Because CDRs (particularly CDRH3) are often enriched with aromatic and hydrophobic residues, these side chains physiochemically favorably match key pockets (typically P1, P4, P6, and P9) within the HLA-II groove. Consequently, high-affinity HLA-II epitopes are formed at the variable region-framework interface or within the CDRs, resulting in structural overlap with functional CDRs. On this basis, strategies to reduce HLA-II binding affinity include first identifying the registration and anchoring sites of the epitope, followed by site-specific substitutions with mismatched physiochemical properties. For example, aromatic or branched residues that fit the hydrophobic pocket can be replaced with charged or strongly hydrophilic residues, or perturbations can be introduced at key positions to alter registration, thereby weakening the peptide’s anchoring in the pocket, reducing binding free energy and stability, and thus diminishing presentation and T cell recognition. To minimize the impact on antigen binding function, such replacements should be preferentially placed on side chains or CDR edges or adjacent framework sites that are not directly involved in antigen binding, and verified using DMS functional readout [46,47]. To this end, Sivelle et al. proposed a strategy combining T cell epitope prediction with DMS to reduce the immunogenicity of therapeutic antibodies. Because CD4+ T cell epitopes often overlap with antibody CDRs, and direct removal is challenging. They first used the netMHCIIpan3.2 algorithm to predict HLA class II binding in the heavy chain complementarity-determining region 2 (HCDR2) and heavy chain complementarity-determining region 3 (HCDR3) regions, covering 46 alleles representing approximately 90% of the global population. Heatmap was used to identify permissive substitutions that could reduce HLA binding. DMS was then combined with yeast display to screen for mutations that both weakened human leukocyte antigen (HLA) binding and maintained antigen binding. Based on this information, combinatorial libraries were constructed to isolate functional clones. Using the anti-TNF-α antibody adalimumab as a model, the study identified approximately 200 mutants with lower HLA binding scores than the original antibody. When constructed as full-length antibodies, three of these mutants showed higher TNF-α affinity and neutralizing activity than adalimumab. These results indicate a degree of immunogenicity tolerance in antibody sequences and demonstrate that integrating DMS with epitope prediction can reduce immunogenicity, prolong in vivo efficacy, and mitigate anti-drug antibody (ADA) development while maintaining or even enhancing function [48].

3.2. Antigen Epitope Identification

Antigen epitope identification is a critical step in vaccine development, enabling precise characterization of how antibodies engage their antigenic targets. Unlike traditional approaches such as Enzyme-Linked Immunosorbent Assay (ELISA), enzymatic digestion, or chemical cleavage, which only offer limited sequence analysis, the high resolution of DMS can interrogate saturated mutational coverage of antigenic targets, thereby accurately identifying and analyzing epitopes to provide key escape mutation information. In recent years, DMS has been widely applied in virology, especially for rapidly evolving pathogens including SARS-CoV-2, influenza virus, and Zika virus. DMS can better understand how viruses evade the surveillance of the immune system and thus assist in designing effective vaccines, which is crucial for addressing the challenges posed by evolving viruses [49].

The RBD of the SARS-CoV-2 spike glycoprotein mediates viral attachment to the angiotensin converting enzyme 2 (ACE2), determining host range and serving as a dominant target of neutralizing antibodies [50,51,52]. Starr et al. utilized the yeast surface display platform to perform DMS on a library of mutations in the SARS-CoV-2 RBD region, investigating how these variants affect ACE2 binding affinity and protein expression. Although several substitutions increased ACE2 binding, these mutations did not exhibit a selective advantage in the circulating strains of SARS-CoV-2 [53]. The complex relationship between the biochemical phenotype of RBD presented by yeast and viral fitness, along with the method primarily focusing on measuring antibody binding limit its application scope [53]. By contrast, assays that evaluate neutralization using full-length spike in cellular infection models are considered more directly relevant to protection and better capture viral adaptability and immune responses [54]. Dadonaite et al. used non-replicative pseudotyped lentiviruses to create libraries of the Omicron BA.1 and Delta spikes, with DMS directly quantifying the impact of mutations on antibody neutralization. These measurements showed good correlation with traditional pseudovirus neutralization assays [54]. On this basis, they expanded the evaluation to over 9000 mutations in the XBB.1.5 and BA.2 spike proteins, analyzing multiple functions including ACE2 binding, cell entry, and escape from human serum. DMS analysis results revealed mutation sites on the spike protein that significantly affect ACE2 binding or enhance the ability to escape from the serum of breakthrough infected individuals [55]. These data show that DMS provides important insights into the evolution of SARS-CoV-2.

Similarly, mutation maps based on DMS have also been applied to reveal epitope escape information for other viruses, such as Zika virus (ZIKV) and influenza virus. Sourisseau et al. performed DMS to systematically saturate mutagenesis of the Zika virus E protein to generate high-resolution functional landscapes of viral growth and antigenicity [56]. The study not only revealed differences in mutation tolerance among different residues, such as disulfide bond cysteines and histidines associated with low-pH conformational transitions being highly sensitive to mutation, while surface residues were relatively tolerant, but also verified the consistency of these patterns with existing structural understanding. Further analysis showed that mutations in the fusion loop and key linker regions of domains I-III were strictly restricted, which are the core of viral receptor binding and membrane fusion. They also systematically mapped the antigenic escape mutations of two monoclonal antibodies and found that the selective pressure of neutralizing antibodies would promote the enrichment of escape mutations in the population generated by Vero cells [56,57,58]. These results not only validated the reliability of DMS within a single virus system, but also indicate its potential to reveal immune-escape patterns across diverse viruses. Therefore, DMS has been extended to the hemagglutinin (HA) protein of influenza virus. Using a modified DMS strategy for H1N1 strains, researchers systematically evaluated the functional effects of nearly all amino acid substitutions in the influenza A virus HA gene and achieving 98% coverage. By combining a large-scale mutant library with deep sequencing and employing a “helper virus” approach to generate the viral library, they effectively overcame the bottlenecks inherent in traditional plasmid construction, significantly improving the accuracy and reproducibility of the measurements. The resulting fitness maps illuminated the diverse contributions of HA residues to viral replication and immune function: antigenic sites in the globular head were highly tolerant to mutation, consistent with their propensity for immune escape. In contrast, the stem region targeted by broadly neutralizing antibodies, disulfide bond-forming cysteines, and key histidines involved in low-pH conformational changes were subject to strict mutational constraints, underscoring their potential as vaccine or therapeutic targets. Phylogenetic analyses further confirmed that these experimental results accurately reflected evolutionary constraints on HA, with most sites exhibiting conserved amino acid preferences across homologs, thereby supporting the robustness and generalizability of the approach. Moreover, by comparing in vitro and in vivo fitness, DMS can identify variants that achieve high production titers yet display attenuated phenotypes in the host, informing the design of live-attenuated vaccine candidates. Overall, DMS not only deepens our understanding of the HA structure–function relationship and immune escape mechanisms, but also provides important references for predicting influenza evolutionary pathways, building more accurate viral evolution models, and guiding the design of antiviral drugs and vaccines. It also highlights the broad applicability of DMS to other genetically tractable viruses and microbial genomes [59,60,61]. Lee et al. also revealed that favorable mutations are enriched in evolutionary successful lineages by measuring the effects of all single amino acid mutations in H3N2 strain HA on viral growth based on DMS. However, comparisons between H3 HA and H1 HA data showed significant differences in amino acid preferences and mutation tolerances, emphasizing the importance of experimental measurements in understanding viral evolution, although their effectiveness depends on the similarity between experimental and natural strains [62].

3.3. Recognition by T Cell Receptors

TCRs coordinate cellular immunity by recognizing short peptide antigens bound and presented by MHC molecules [63]. DMS has made significant advances in the TCR field, providing new strategies to enhance the affinity and specificity of TCRs for pMHC complexes. Screening of mutation libraries based on DMS has laid a solid foundation for the development of efficient and safe TCR-based immunotherapies.

TCR affinity is inherently low, making it a target for protein engineering to improve stability and affinity. Affinity maturation efforts typically concentrate on residues within the CDR loops at the binding interface [64]. Sharma et al. used DMS to comprehensively analyze the functional residues of TCRs in their recognition of the cancer antigen MART-1·HLA-A2. Unlike previous affinity maturation approaches that focused solely on CDRs, this study systematically included Vα/Vβ interfacial and framework residues for analysis, constructing a single-codon library covering both CDR and non-CDRs. The results revealed that in addition to CDR residues, the Vα/Vβ interface is critical for proper folding and stable binding. Notably, some interface mutations (e.g., F45βY) significantly enhanced TCR stability and affinity: affinity increased approximately 50-fold in yeast display and 60-fold in vitro assays, while maintaining high specificity for MART-1·HLA-A2. Further experimental validation using a two-codon interface library confirmed the potential of these interface residues for affinity and stability optimization. These results demonstrate that interface residues distal to the binding site can substantially modulate TCR function, supporting their inclusion in DMS and directed evolution strategies. This discovery not only broadens our understanding of the TCR structure–function relationship but also provides a viable optimization path and theoretical basis for soluble TCR engineering and adoptive T cell therapy [65]. In addition to the systematic exploration of interface residues, some studies have further combined DMS with directed evolution to map the sequence fitness landscape of TCR-peptide-MHC interactions more comprehensively and to propose principled optimization strategies. Harris et al. optimized the interactions of cancer antigen-specific RD1-MART1HIGH TCR with pMHC by prioritizing DMS enriched substitutions. Research demonstrates that combining directed evolution with DMS can comprehensively characterize the sequence fitness landscape of TCR-peptide-MHC interactions, overcoming limitations of traditional phage or yeast display approaches, which focus on point-by-point modifications within the CDRs. Using the cancer antigen-specific TCR RD1-MART1HIGH as an example, DMS systematically screened for beneficial mutations at both interface and non-interface residues. These substitutions were found not only to enhance binding individually but also to markedly enhance TCR affinity and cell-surface expression when combined. Results showed that affinity could be increased by over 200-fold and yeast surface expression by approximately 6-fold. The study also compared the fitness landscape-informed mutation combination strategy with a multi-codon library screening approach and corroborated this strategy with computational modeling. While computational simulations performed well in predicting mutations at the binding interface, they struggled to accurately predict affinity changes for mutations distal to the interface, highlighting the role of DMS data in complementing and calibrating computational models. Overall, this study demonstrates that DMS can not only efficiently identify and combine affinity mutations, but also take stability factors into account, providing strong experimental and theoretical support for TCR optimization design, cancer immunotherapy and soluble TCR engineering [66].

However, increasing TCR affinity usually does not improve its specificity, and the development of TCRs as immunotherapeutic agents is hindered by low specificity (intrinsic TCR cross-reactivity). Mechanistically, TCR recognition of pMHC is degenerate and relies on the plastic CDR3 loops to achieve induced fit, so a single TCR can accommodate multiple pMHCs with similar physicochemical features. Even when not located at the binding interface, engineered mutations can, via long-range allostery from the framework, alter CDR3 pre-organization and flexibility and indirectly rearrange interfacial interactions (hydrogen bonds/water bridges, salt bridges, hydrophobic packing, and aromatic stacking/cation–π), thereby redistributing binding energy and rewriting the energetic landscape of recognizable peptides. These structural changes manifest kinetically as shifts in k_on/k_off and K_D, with small decreases in k_off being amplified by kinetic proofreading to cross activation thresholds. When mutations restrict CDR3 conformational sampling and increase the k_off for off-target ligands, the recognition repertoire narrows and specificity improves; conversely, if mutations lower the off-target k_off or increase CDR3 flexibility, selectivity may relax and cross-reactivity rise. Thus, fine allosteric tuning of CDR3 pre-organization and dissociation kinetics—without directly altering the interface—is a key route to remodel the TCR recognition landscape [67,68,69]. In clinical applications, affinity-specificity is directly related to immune safety: increased affinity is often accompanied by increased cross-reactivity, and the risks are mainly manifested as on-target/off-tumor (low-level target antigens in normal tissues are attacked) and off-target/cross-reactivity (host peptides like the target peptide are used as targets). Especially when the CD8-independent activation threshold is obtained, it is more likely to be amplified. The consequences can be upgraded from general adverse reactions to lethal toxicity, including organ damage caused by misidentification of myocardial, neural or endocrine-related autologous peptides, systemic toxicity caused by attack of widely low-expressed targets, and cytokine storm and neurotoxicity induced by non-specific strong activation [63,67,68,69]. Various protein engineering strategies have been explored to enhance TCR specificity; thus, designing more specific TCRs has proven to be challenging [70,71,72]. Rosenberg et al. investigated the 868 TCR specific for the HLA-A2 presented HIV SL9 epitope and used DMS to assess whether framework mutations distal from the binding interface could modulate specificity. Single-site mutation libraries of α and β chains were generated using a yeast display system and these libraries covered all 6 CDR loops and adjacent framework regions. DMS analysis revealed that substitutions above the CDR3β loop did not significantly affect SL9 binding affinity but weakened recognition of escape variants, while mutations near the tip of CDR3α reduced the adaptability of TCR to different ligands. These findings indicate that non-interface framework mutations can enhance TCR specificity without changing binding affinity or introducing novel reactivities via direct interface alterations [63]. Thus, systematic mutation analysis combined with DMS provides a new approach to resolving the contradiction between TCR affinity and specificity optimization. However, improving affinity is often accompanied by increased cross-reactivity, which may lead to unexpected recognition of non-target antigens or host peptides by TCR, thereby causing serious side effects or even fatal reactions. Mitigating off-target recognition while improving efficacy and ensuring immune safety remains a central challenge for future TCR engineering.

4. Conclusions and Future Perspectives

As a rapidly advancing technology, DMS has introduced substantial innovations to immunology research. This review not only systematically introduces the core principles and experimental workflow of DMS, but also comprehensively analyzes its advantages and limitations in library construction, display platforms, screening models, and data analysis. Compared to traditional systematic point mutagenesis or directed mutagenesis, DMS achieves high mutation coverage and analytical depth by coupling saturation mutagenesis with NGS. Firstly, it can generate nearly all possible amino acid substitutions across the entire protein, thus overcoming the limitations of previously targeted modifications limited to individual sites. Secondly, NGS offers the ability to simultaneously analyze millions of variants, far exceeding the throughput and resolution limitations of Sanger sequencing. More importantly, DMS has demonstrated distinctive value in multiple core areas of immunology. In antibody engineering, it can systematically map the functional landscape of antibody CDRs, identify key residues that improve affinity and stability, and guide the optimization of multiple mutational combinations. In epitope mapping, DMS provides high-resolution antigen escape maps, revealing the fine structural features of conformational epitopes and providing predictive evidence for vaccine development. In TCR research, DMS has transcended the limitations of focusing solely on CDRs, expanding its analysis to include framework regions and the Vα/Vβ interface, enabling a systematic analysis of TCR-peptide/MHC interactions. These applications not only deepen our understanding of the structure–function relationship between antibodies and receptors but also provide strong technical support for optimizing immunotherapy strategies, modifying antibody drugs, and predicting immune escape, demonstrating its enormous potential to advance immunology research and applications.

4.1. Challenges and Limitations

Despite the importance of DMS in immunology, the interpretation and generalization of DMS findings to actual immune responses or clinical applications remain limited. First, regarding experimental systems, most available DMS datasets are derived from living cell environments, including mammalian cells, yeast, and bacterial models. These platforms enable the expression of human or pathogenic proteins under conditions more closely resembling physiological conditions, thus more realistically reflecting structure–function relationships. These methods offer advantages in terms of high throughput and rapid analysis of relatively simplified biochemical phenotypes, such as protein-ligand binding, but their ability to characterize complex cellular processes and upstream and downstream signaling is limited. It is important to emphasize that even DMS performed in living cell systems, such as mammalian cells, cannot fully recapitulate the complexity of the in vivo immune microenvironment. Gradients of antibodies and cytokines across tissues, the spatial organization, and interactions among immune cells, as well as dynamic regulation driven by inflammation, infection, and metabolic states, all influence the true outcomes of antigen–antibody binding or receptor-ligand interactions. Such differences may result in functional effects observed in experiments diverging from those occurring in vivo, thereby constraining the translational potential and predictive accuracy of DMS data.

Secondly, DMS relies heavily on a designable, quantifiable functional screening readout. This means that DMS is most effective when the biological function of the target protein is well-defined and can be operationalized as quantifiable, high-throughput screening metrics (such as binding, secretion, infection, signaling activation, and cell growth/survival). By contrast, scalable, generalizable screening paradigms remain scarce for immune molecules with poorly characterized functions or those dependent on multicellular circuits or tissue structures. The overall physiological effects of such molecules (such as immune response integration, metabolic regulation, and neuro-immune coupling) are difficult to quantify within existing standardized DMS workflows, limiting the potential for systematic research. When quantifiable phenotypes are limited, dependence on library quality and sampling balance is further magnified. On this basis, library representativeness and coverage also pose practical constraints: oligonucleotide synthesis bias and bottlenecks in cloning and expression/viability lead to uneven sampling, reducing the actual proportion of certain amino acid substitutions and their combinations in the dataset, thereby undermining the stability of effect estimates and tending to systematically underestimate epistasis as well as regions constrained by post-translational modifications. Likewise, data analysis faces stability challenges. Because DMS relies on sequencing-based count readouts, it is highly sensitive to PCR amplification and sequencing depth, and batch effects are difficult to eliminate. Different normalization strategies and choices of hierarchical or mixed-effects modeling can also introduce substantial discrepancies, diminishing cross-platform and cross-study comparability and reproducibility, and further amplifying uncertainty in threshold setting (e.g., positive/negative, or pathogenic/benign) and evidence grading when extrapolating to clinical contexts [36,73].

Thirdly, existing evidence remains insufficient regarding the critical issue of the immune microenvironment and in vivo complexity. While DMS applications for antibodies, antigens, and TCRs are relatively mature, DMS work targeting immune checkpoints or complex receptor networks that can be functionally engineered and validated under conditions resembling the in vivo microenvironment remains limited. At present, only a few related explorations can provide indirect inspiration. For example, the structural study of the programmed cell death protein-1 (PD-1)/PD-L2 high-affinity complex provides residue-level information to support the development of small molecule immune checkpoint drugs, but it has not been fully integrated with the systematic mutation-function mapping of DMS [74]. these observations underscore the urgent need to extend DMS into research systems that more closely approximate in vivo immune complexity.

Finally, translational applications for clinical diagnosis and pathogenic mechanism elucidation are still in early stages, and systematic practices for typing diagnosis of immune diseases, grading of functional evidence of pathogenic variants, and even supporting personalized treatment decisions are still limited. Although current explorations have mostly focused on cross-analysis combining DMS data with population genetic variation databases (such as ClinVar and gnomAD) to determine whether a specific mutation is potentially pathogenic. For example, in the fields of cancer genetics and genetic diseases, several studies have linked DMS functional scores to clinical databases for variant reclassification and evidence grading. Taking breast cancer gene 1 (BRCA1) as an example, functional scores derived from saturated gene editing correlate well with pathogenic/benign annotations in ClinVar and provide actionable evidence for reclassification of variants of uncertain significance (VUS). Similarly, DMS readouts of transcriptional activation in tumor protein p53 (TP53) can distinguish between clinically pathogenic and benign alleles, and functional scores for phosphatase and tensin homolog (PTEN) are also consistent with clinical phenotypes and database annotations. This practice demonstrates a relatively general workflow: accurately mapping the site-variant-effect size matrix generated by DMS to variant coordinates in ClinVar/gnomAD, removing low-depth or low-confidence sites, calculating consistency metrics between the scores and clinical labels, and performing threshold calibration and reproducibility testing in independent cohorts or prospective samples [75,76,77,78,79,80,81,82,83]. However, the overall process is still at the stage of methodological validation and small-scale application, especially in autoimmune disease-related research. It should be noted that in the overall field of clinical diseases, systematic cases of immune-related diseases are still relatively scarce. On the one hand, the pathological phenotypes of many immune molecules require the coordination of multicellular circuits and tissue structures to be manifest, and it is difficult to obtain stable and quantifiable DMS readouts in the short term. On the other hand, there is a lack of prospective validation cohorts that are directly aligned with clinical outcomes (diagnosis, disease subtype, efficacy, or prognosis), resulting in insufficient calibration of thresholds and strength of evidence (Table 2).

Overall, the barriers limiting the clinical translation of DMS primarily include the gap between experimental settings and the in vivo immune microenvironment: in vitro and heterologous systems cannot recapitulate tissue-level antibody/cytokine gradients and cell–cell interactions, thereby reducing the reliability of clinical extrapolation. At the same time, DMS readouts depend on clearly defined, quantifiable phenotypes, which makes it difficult to cover complex immune functions that manifest only through multicellular circuits and tissue-level coordination. Furthermore, library representativeness and coverage are constrained by bottlenecks in oligonucleotide synthesis, cloning, and expression/viability, leading to uneven sampling and a tendency to underestimate epistasis and regions constrained by post-translational modifications. On the analysis side, sequencing-based count data are sensitive to batch effects and modeling choices, which weakens cross-platform comparability and makes threshold setting and evidence grading for clinical extrapolation less stable. In addition, functional engineering and validation under more physiologically relevant conditions remain insufficient, and clinical applications are largely at the stage of methodological validation and small-scale pilot use. Consequently, DMS is presently better suited to provide mechanistic and supportive evidence; its clinical interpretability will continue to depend on calibration and validation within near-physiological models and more robust analytical frameworks.

4.2. Future Perspectives

Despite the limitations, DMS holds great promise in immunology. Looking ahead, the key lies in translating the “site-variant-function” readout at the cellular level and mapping it to immune outcomes at the tissue and individual levels. To this end, DMS can be deeply coupled with single-cell multi-omics to simultaneously analyze cell states, lineage relationships, and functional outputs within the same mutational context. In the complex immune system, numerous correlations often exist, but causal relationships are difficult to directly confirm. By combining CRISPR-Cas9 and DMS technologies, precise, controllable perturbations at key nodes of immune pathways are expected to more clearly determine whether changes in specific molecules directly lead to downstream immune responses, thereby substantially improving the reliability of causal inferences [85,86,87,88,89,90]. Furthermore, by utilizing phenotype-barcode and time-series sampling, longitudinal selection pressure tracking can be performed under conditions such as cytokine secretion, pathogen infection, or drug pressure, thereby characterizing the dynamic trajectory of mutation effects as the microenvironment changes. Organoid and tissue slice co-culture models enable readouts of complex phenotypes such as secretion, migration, chemotaxis, and cell interactions in contexts that more closely recapitulate in vivo spatial structure, cell composition and matrix composition. In vivo platforms such as humanized mice can help connect the above readouts to endpoint indicators of tissue remodeling and clinical patterns, thereby enhancing the physiological relevance and inferential value of DMS results [91,92,93,94,95,96,97]. At the data and model level, multimodal integration of DMS count data with proteomics, transcriptomics, and other data can improve the detection and interpretability of low-frequency but critical mutations using hierarchical/mixed effects models and causal inference methods [98,99,100]. At the same time, structural biology and computational modeling should be incorporated the effects of mutations on three-dimensional conformation, binding interface, and homeostasis into the analytical framework. Combined with machine learning and deep learning, variants with better stability, affinity, and specificity can be pre-prioritized during the in vitro design stage [101]. At the translational level, the combination of all the above strategies is expected to elevate the cell-level readout of DMS to evidence that can be used for clinical interpretation, including functional annotation and classification of immune-related variants, to support the classification, diagnosis, and therapeutic stratification of immune diseases. This will also facilitate the optimization of antibodies, receptors, and ligands, providing a quantitative basis for target and site identification in diagnostics. Furthermore, by linking patient samples (such as peripheral blood mononuclear cells (PBMCs) and tumor-infiltrating immune cells), a functional evaluation pipeline aligned with clinical efficacy endpoints will be established, enabling the design and optimization of personalized treatment plans. Overall, through technological integration and methodological innovation, DMS will provide more precise and comprehensive tools for immunology research, advancing our understanding and clinical utilization of the immune system.

Author Contributions

Conceptualization came from C.S. and J.L. C.S. and Y.L. contributed to the literature search. C.S. drafted the manuscript. J.L. and S.J. contributed to the revision of this manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Major Research Plan of the National Natural Science Foundation of China (NSFC), grant number 92269205.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this review.

Acknowledgments

Figure support was provided by Figdraw.

Conflicts of Interest

Yue Li is an employee of Nanjing Vazyme Biotech Co., Ltd. The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ACE2	angiotensin converting enzyme 2
ADA	anti-drug antibodies
Ang2	angiopoietin 2
CDR	complementarity determining region
DAF	dual action Fab
DMS	deep mutational scanning
ELISA	enzyme-linked immunosorbent assay
Fab	antigen-binding fragments
HCDR	heavy chain complementarity-determining region
HDR	homology-directed repair
HLA	human leukocyte antigen
IgG1	immunoglobulin G1
mAbs	monoclonal antibodies
MHC	major histocompatibility complex
PALs	programmed allelic series
PCR	polymerase chain reaction
PD-1	programmed cell death protein-1
RBD	receptor binding domain
scFv	single-chain variable fragments
SUNi	scalable and uniform nicking mutagenesis
TCR	T-cell receptor
TROP2	trophoblast cell surface antigen 2
TNF	tumor necrosis factor
TP53	tumor protein p53
PBMC	peripheral blood mononuclear cell
PTEN	phosphatase and tensin homolog
VEGF	vascular endothelial growth factor
VUS	variants of uncertain significance

References

Hietpas, R.T.; Jensen, J.D.; Bolon, D.N.A. Experimental illumination of a fitness landscape. Proc. Natl. Acad. Sci. USA 2011, 108, 7896–7901. [Google Scholar] [CrossRef] [PubMed]
Ernst, A.; Gfeller, D.; Kan, Z.; Seshagiri, S.; Kim, P.M.; Bader, G.D.; Sidhu, S.S. Coevolution of PDZ domain–ligand interactions analyzed by high-throughput phage display and deep sequencing. Mol. Biosyst. 2010, 6, 1782–1790. [Google Scholar] [CrossRef] [PubMed]
Kinney, J.B.; McCandlish, D.M. Massively parallel assays and quantitative sequence–function relationships. Annu. Rev. Genom. Hum. Genet. 2019, 20, 99–127. [Google Scholar] [CrossRef] [PubMed]
Wei, H.; Li, X. Deep mutational scanning: A versatile tool in systematically mapping genotypes to phenotypes. Front. Genet. 2023, 14, 1087267. [Google Scholar] [CrossRef]
Fowler, D.M.; Araya, C.L.; Fleishman, S.J.; Kellogg, E.H.; Stephany, J.J.; Baker, D.; Fields, S. High-resolution mapping of protein sequence-function relationships. Nat. Methods 2010, 7, 741–746. [Google Scholar] [CrossRef]
Parkin, J.; Cohen, B. An overview of the immune system. Lancet 2001, 357, 1777–1789. [Google Scholar] [CrossRef]
Van Beek, E.M.; Cochrane, F.; Barclay, A.N.; Berg, T.K.v.D. Signal regulatory proteins in the immune system. J. Immunol. 2005, 175, 7781–7787. [Google Scholar] [CrossRef]
Weile, J.; Roth, F.P. Multiplexed assays of variant effects contribute to a growing genotype–phenotype atlas. Hum. Genet. 2018, 137, 665–678. [Google Scholar] [CrossRef]
Burton, T.D.; Eyre, N.S. Applications of deep mutational scanning in virology. Viruses 2021, 13, 1020. [Google Scholar] [CrossRef]
Kemble, H.; Nghe, P.; Tenaillon, O. Recent insights into the genotype–phenotype relationship from massively parallel genetic assays. Evol. Appl. 2019, 12, 1721–1742. [Google Scholar] [CrossRef]
Fowler, D.M.; Fields, S. Deep mutational scanning: A new style of protein science. Nat. Methods 2014, 11, 801–807. [Google Scholar] [CrossRef]
Tee, K.L.; Wong, T.S. Polishing the craft of genetic diversity creation in directed evolution. Biotechnol. Adv. 2013, 31, 1707–1721. [Google Scholar] [CrossRef]
Drummond, D.A.; Iverson, B.L.; Georgiou, G.; Arnold, F.H. Why High-error-rate Random Mutagenesis Libraries are Enriched in Functional and Improved Proteins. J. Mol. Biol. 2005, 350, 806–816. [Google Scholar] [CrossRef]
Lim, B.N.; Choong, Y.S.; Ismail, A.; Glökler, J.; Konthur, Z.; Lim, T.S. Directed evolution of nucleotide-based libraries using lambda exonuclease. BioTechniques 2012, 53, 357–364. [Google Scholar] [CrossRef]
Krumpe, L.R.; Schumacher, K.M.; McMahon, J.B.; Makowski, L.; Mori, T. Trinucleotide cassettes increase diversity of T7 phage-displayed peptide library. BMC Biotechnol. 2007, 7, 65. [Google Scholar] [CrossRef]
Hanning, K.R.; Minot, M.; Warrender, A.K.; Kelton, W.; Reddy, S.T. Deep mutational scanning for therapeutic antibody engineering. Trends Pharmacol. Sci. 2022, 43, 123–135. [Google Scholar] [CrossRef] [PubMed]
Firnberg, E.; Ostermeier, M. PFunkel: Efficient, expansive, user-defined mutagenesis. PLoS ONE 2012, 7, e52031. [Google Scholar] [CrossRef] [PubMed]
Kowalsky, C.A.; Faber, M.S.; Nath, A.; Dann, H.E.; Kelly, V.W.; Liu, L.; Shanker, P.; Wagner, E.K.; Maynard, J.A.; Chan, C.; et al. Rapid fine conformational epitope mapping using comprehensive mutagenesis and deep sequencing. J. Biol. Chem. 2015, 290, 26457–26470. [Google Scholar] [CrossRef] [PubMed]
Mighell, T.L.; Toledano, I.; Lehner, B. SUNi mutagenesis: Scalable and uniform nicking for efficient generation of variant libraries. PLoS ONE 2023, 18, e0288158. [Google Scholar] [CrossRef]
Oh, E.J.; Liu, R.; Liang, L.; Freed, E.F.; Eckert, C.A.; Gill, R.T. Multiplex evolution of antibody fragments utilizing a yeast surface display platform. ACS Synth. Biol. 2020, 9, 2197–2202. [Google Scholar] [CrossRef]
Mason, D.M.; Weber, C.R.; Parola, C.; Meng, S.M.; Greiff, V.; Kelton, W.J.; Reddy, S.T. High-throughput antibody engineering in mammalian cells by CRISPR/Cas9-mediated homology-directed mutagenesis. Nucleic Acids Res. 2018, 46, 7436–7449. [Google Scholar] [CrossRef]
Ahmed, S.; Bhasin, M.; Manjunath, K.; Varadarajan, R. Prediction of residue-specific contributions to binding and thermal stability using yeast surface display. Front. Mol. Biosci. 2022, 8, 800819. [Google Scholar] [CrossRef]
Fujino, Y.; Fujita, R.; Wada, K.; Fujishige, K.; Kanamori, T.; Hunt, L.; Shimizu, Y.; Ueda, T. Robust in vitro affinity maturation strategy based on interface-focused high-throughput mutational scanning. Biochem. Biophys. Res. Commun. 2012, 428, 395–400. [Google Scholar] [CrossRef] [PubMed]
Forsyth, C.M.; Juan, V.; Akamatsu, Y.; DuBridge, R.B.; Doan, M.; Ivanov, A.V.; Ma, Z.; Polakoff, D.; Razo, J.; Wilson, K.; et al. Deep mutational scanning of an antibody against epidermal growth factor receptor using mammalian cell display and massively parallel pyrosequencing. In mAbs; Taylor & Francis: Abingdon, UK, 2013; Volume 5, pp. 523–532. [Google Scholar]
Koch, P.; Schmitt, S.; Heynisch, A.; Gumpinger, A.; Wüthrich, I.; Gysin, M.; Shcherbakov, D.; Hobbie, S.N.; Panke, S.; Held, M. Optimization of the antimicrobial peptide Bac7 by deep mutational scanning. BMC Biol. 2022, 20, 114. [Google Scholar] [CrossRef] [PubMed]
Traxlmayr, M.W.; Hasenhindl, C.; Hackl, M.; Stadlmayr, G.; Rybka, J.D.; Borth, N.; Grillari, J.; Rüker, F.; Obinger, C. Construction of a stability landscape of the CH3 domain of human IgG1 by combining directed evolution with high throughput sequencing. J. Mol. Biol. 2012, 423, 397–412. [Google Scholar] [CrossRef] [PubMed]
Vazquez-Lombardi, R.; Jung, J.S.; Schlatter, F.S.; Mei, A.; Mantuano, N.R.; Bieberich, F.; Hong, K.-L.; Kucharczyk, J.; Kapetanovic, E.; Aznauryan, E.; et al. High-throughput T cell receptor engineering by functional screening identifies candidates with enhanced potency and specificity. Immunity 2022, 55, 1953–1966.e10. [Google Scholar] [CrossRef]
Abdelfattah, N.S.; Kula, T.; Elledge, S.J. T-Switch: A specificity-based engineering platform for developing safe and effective T cell therapeutics. Immunity 2024, 57, 2945–2958.e5. [Google Scholar] [CrossRef]
Ledsgaard, L.; Ljungars, A.; Rimbault, C.; Sørensen, C.V.; Tulika, T.; Wade, J.; Wouters, Y.; McCafferty, J.; Laustsen, A.H. Advances in antibody phage display technology. Drug Discov. Today 2022, 27, 2151–2169. [Google Scholar] [CrossRef]
Villemagne, D.; Jackson, R.; Douthwaite, J.A. Highly efficient ribosome display selection by use of purified components for in vitro translation. J. Immunol. Methods 2006, 313, 140–148. [Google Scholar] [CrossRef]
Schofield, D.J.; Pope, A.R.; Clementel, V.; Buckell, J.; Chapple, S.D.; Clarke, K.F.; Conquer, J.S.; Crofts, A.M.; Crowther, S.R.; Dyson, M.R.; et al. Application of phage display to high throughput antibody generation and characterization. Genome Biol. 2007, 8, R254. [Google Scholar] [CrossRef]
Araya, C.L.; Fowler, D.M. Deep mutational scanning: Assessing protein function on a massive scale. Trends Biotechnol. 2011, 29, 435–442. [Google Scholar] [CrossRef]
Van Dijk, E.L.; Jaszczyszyn, Y.; Naquin, D.; Thermes, C. The third revolution in sequencing technology. Trends Genet. 2018, 34, 666–681. [Google Scholar] [CrossRef]
Wang, Y.; Zhao, Y.; Bollas, A.; Wang, Y.; Au, K.F. Nanopore sequencing technology, bioinformatics and applications. Nat. Biotechnol. 2021, 39, 1348–1365. [Google Scholar] [CrossRef] [PubMed]
Fowler, D.M.; Araya, C.L.; Gerard, W.; Fields, S. Enrich: Software for analysis of protein function by enrichment and depletion of variants. Bioinformatics 2011, 27, 3430–3431. [Google Scholar] [CrossRef] [PubMed]
Rubin, A.F.; Gelman, H.; Lucas, N.; Bajjalieh, S.M.; Papenfuss, A.T.; Speed, T.P.; Fowler, D.M. A statistical framework for analyzing deep mutational scanning data. Genome Biol. 2017, 18, 150. [Google Scholar] [CrossRef] [PubMed]
Greaney, A.J.; Loes, A.N.; Crawford, K.H.; Starr, T.N.; Malone, K.D.; Chu, H.Y.; Bloom, J.D. Comprehensive mapping of mutations in the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human plasma antibodies. Cell Host Microbe. 2021, 29, 463–476.e6. [Google Scholar] [CrossRef]
Faure, A.J.; Schmiedel, J.M.; Baeza-Centurion, P.; Lehner, B. DiMSum: An error model and pipeline for analyzing deep mutational scanning data and diagnosing common experimental pathologies. Genome Biol. 2020, 21, 207. [Google Scholar] [CrossRef]
Bloom, J.D. Software for the analysis and visualization of deep mutational scanning data. BMC Bioinform. 2015, 16, 168. [Google Scholar] [CrossRef]
Dingens, A.S.; Haddox, H.K.; Overbaugh, J.; Bloom, J.D. Comprehensive mapping of HIV-1 escape from a broadly neutralizing antibody. Cell Host Microbe 2017, 21, 777–787.e4. [Google Scholar] [CrossRef]
Lee, J.M.; Eguia, R.; Zost, S.J.; Choudhary, S.; Wilson, P.C.; Bedford, T.; Stevens-Ayers, T.; Boeckh, M.; Hurt, A.C.; Lakdawala, S.S.; et al. Mapping person-to-person variation in viral mutations that escape polyclonal serum targeting influenza hemagglutinin. eLife 2019, 8, e49324. [Google Scholar] [CrossRef]
Norman, R.A.; Ambrosetti, F.; Bonvin, A.M.J.J.; Colwell, L.J.; Kelm, S.; Kumar, S.; Krawczyk, K. Computational approaches to therapeutic antibody design: Established methods and emerging trends. Brief. Bioinform. 2020, 21, 1549–1567. [Google Scholar] [CrossRef]
Olson, C.A.; Wu, N.C.; Sun, R. A comprehensive biophysical description of pairwise epistasis throughout an entire protein domain. Curr. Biol. 2014, 24, 2643–2651. [Google Scholar] [CrossRef] [PubMed]
Otwinowski, J.; McCandlish, D.M.; Plotkin, J.B. Inferring the shape of global epistasis. Proc. Natl. Acad. Sci. USA. 2018, 115, E7550–E7558. [Google Scholar] [CrossRef] [PubMed]
Koenig, P.; Lee, C.V.; Sanowar, S.; Wu, P.; Stinson, J.; Harris, S.F.; Fuh, G. Deep Sequencing-guided Design of a High Affinity Dual Specificity Antibody to Target Two Angiogenic Factors in Neovascular Age-related Macular Degeneration. J. Biol. Chem. 2015, 290, 21773–21786. [Google Scholar] [CrossRef] [PubMed]
Liang, S.; Zhang, C. Prediction of immunogenicity for humanized and full human therapeutic antibodies. PLoS ONE 2020, 15, e0238150. [Google Scholar] [CrossRef]
Sarri, C.A.; Papadopoulos, G.E.; Papa, A.; Tsakris, A.; Pervanidou, D.; Baka, A.; Politis, C.; Billinis, C.; Hadjichristodoulou, C.; Mamuris, Z.; et al. Amino acid signatures in the HLA class II peptide-binding region associated with protection/susceptibility to the severe West Nile Virus disease. PLoS ONE 2018, 13, e0205557. [Google Scholar] [CrossRef]
Sivelle, C.; Sierocki, R.; Lesparre, Y.; Lomet, A.; Quintilio, W.; Dubois, S.; Correia, E.; Moro, A.M.; Maillère, B.; Nozach, H. Combining deep mutational scanning to heatmap of HLA class II binding of immunogenic sequences to preserve functionality and mitigate predicted immunogenicity. Front. Immunol. 2023, 14, 1197919. [Google Scholar] [CrossRef]
Narayanan, K.K.; Procko, E. Deep mutational scanning of viral glycoproteins and their host receptors. Front. Mol. Biosci. 2021, 8, 636660. [Google Scholar] [CrossRef]
Greaney, A.J.; Starr, T.N.; Gilchuk, P.; Zost, S.J.; Binshtein, E.; Loes, A.N.; Hilton, S.K.; Huddleston, J.; Eguia, R.; Crawford, K.H.; et al. Complete mapping of mutations to the SARS-CoV-2 spike receptor-binding domain that escape antibody recognition. Cell Host Microbe 2021, 29, 44–57.e9. [Google Scholar] [CrossRef]
Greaney, A.J.; Starr, T.N.; Barnes, C.O.; Weisblum, Y.; Schmidt, F.; Caskey, M.; Gaebler, C.; Cho, A.; Agudelo, M.; Finkin, S.; et al. Mapping mutations to the SARS-CoV-2 RBD that escape binding by different classes of antibodies. Nat. Commun. 2021, 12, 4196. [Google Scholar] [CrossRef]
Javanmardi, K.; Segall-Shapiro, T.H.; Chou, C.-W.; Boutz, D.R.; Olsen, R.J.; Xie, X.; Xia, H.; Shi, P.-Y.; Johnson, C.D.; Annapareddy, A.; et al. Antibody escape and cryptic cross-domain stabilization in the SARS-CoV-2 Omicron spike protein. Cell Host Microbe 2022, 30, 1242–1254.e6. [Google Scholar] [CrossRef] [PubMed]
Starr, T.N.; Greaney, A.J.; Hilton, S.K.; Ellis, D.; Crawford, K.H.D.; Dingens, A.S.; Navarro, M.J.; Bowen, J.E.; Tortorici, M.A.; Walls, A.C.; et al. Deep mutational scanning of SARS-CoV-2 receptor binding domain reveals constraints on folding and ACE2 binding. Cell 2020, 182, 1295–1310.e20. [Google Scholar] [CrossRef]
Dadonaite, B.; Crawford, K.H.; Radford, C.E.; Farrell, A.G.; Yu, T.C.; Hannon, W.W.; Zhou, P.; Andrabi, R.; Burton, D.R.; Liu, L.; et al. A pseudovirus system enables deep mutational scanning of the full SARS-CoV-2 spike. Cell 2023, 186, 1263–1278.e20. [Google Scholar] [CrossRef]
Dadonaite, B.; Brown, J.; McMahon, T.E.; Farrell, A.G.; Asarnow, D.; Stewart, C.; Logue, J.; Murrell, B.; Chu, H.Y.; Veesler, D.; et al. Full-spike deep mutational scanning helps predict the evolutionary success of SARS-CoV-2 clades. bioRxiv 2023. [Google Scholar] [CrossRef]
Sourisseau, M.; Lawrence, D.J.P.; Schwarz, M.C.; Storrs, C.H.; Veit, E.C.; Bloom, J.D.; Evans, M.J. Deep mutational scanning comprehensively maps how Zika envelope protein mutations affect viral growth and antibody escape. J. Virol. 2019, 93, e01291-19. [Google Scholar] [CrossRef]
Yang, M.; Dent, M.; Lai, H.; Sun, H.; Chen, Q. Immunization of Zika virus envelope protein domain III induces specific and neutralizing immune responses against Zika virus. Vaccine 2017, 35, 4287–4294. [Google Scholar] [CrossRef]
Stettler, K.; Beltramello, M.; Espinosa, D.A.; Graham, V.; Cassotta, A.; Bianchi, S.; Vanzetta, F.; Minola, A.; Jaconi, S.; Mele, F.; et al. Specificity, cross-reactivity, and function of antibodies elicited by Zika virus infection. Science 2016, 353, 823–826. [Google Scholar] [CrossRef]
Wu, N.C.; Young, A.P.; Al-Mawsawi, L.Q.; Olson, C.A.; Feng, J.; Qi, H.; Chen, S.-H.; Lu, I.-H.; Lin, C.-Y.; Chin, R.G.; et al. High-throughput profiling of influenza A virus hemagglutinin gene at single-nucleotide resolution. Sci. Rep. 2014, 4, 4942. [Google Scholar] [CrossRef]
Doud, M.B.; Bloom, J.D. Accurate measurement of the effects of all amino-acid mutations on influenza hemagglutinin. Viruses 2016, 8, 155. [Google Scholar] [CrossRef] [PubMed]
Thyagarajan, B.; Bloom, J.D. The inherent mutational tolerance and antigenic evolvability of influenza hemagglutinin. eLife 2014, 3, e03300. [Google Scholar] [CrossRef] [PubMed]
Lee, J.M.; Huddleston, J.; Doud, M.B.; Hooper, K.A.; Wu, N.C.; Bedford, T.; Bloom, J.D. Deep mutational scanning of hemagglutinin helps predict evolutionary fates of human H3N2 influenza variants. Proc. Natl. Acad. Sci. USA 2018, 115, E8276–E8285. [Google Scholar] [CrossRef] [PubMed]
Rosenberg, A.M.; Ayres, C.M.; Medina-Cucurella, A.V.; Whitehead, T.A.; Baker, B.M. Enhanced T cell receptor specificity through framework engineering. Front. Immunol. 2024, 15, 1345368. [Google Scholar] [CrossRef] [PubMed]
Foote, J.; Eisen, H.N. Breaking the affinity ceiling for antibodies and T cell receptors. Proc. Natl. Acad. Sci. USA 2000, 97, 10679–10681. [Google Scholar] [CrossRef] [PubMed]
Sharma, P.; Kranz, D.M. Subtle changes at the variable domain interface of the T-cell receptor can strongly increase affinity. J. Biol. Chem. 2018, 293, 1820–1834. [Google Scholar] [CrossRef]
Harris, D.T.; Wang, N.; Riley, T.P.; Anderson, S.D.; Singh, N.K.; Procko, E.; Baker, B.M.; Kranz, D.M. Deep mutational scans as a guide to engineering high affinity T cell receptor interactions with peptide-bound major histocompatibility complex. J. Biol. Chem. 2016, 291, 24566–24578. [Google Scholar] [CrossRef]
Calis, J.J.A.; De Boer, R.J.; Keşmir, C. Degenerate T-cell recognition of peptides on MHC molecules creates large holes in the T-cell repertoire. PLoS Comput. Biol. 2012, 8, e1002412. [Google Scholar] [CrossRef]
Armstrong, K.M.; Piepenbrink, K.H.; Baker, B.M. Conformational changes and flexibility in T-cell receptor recognition of peptide–MHC complexes. Biochem. J. 2008, 415, 183–196. [Google Scholar] [CrossRef]
Mason, D. A very high level of crossreactivity is an essential feature of the T-cell receptor. Immunol. Today 1998, 19, 395–404. [Google Scholar] [CrossRef]
Bowerman, N.A.; Crofts, T.S.; Chlewicki, L.; Do, P.; Baker, B.M.; Garcia, K.C.; Kranz, D.M. Engineering the binding properties of the T cell receptor: Peptide: MHC ternary complex that governs T cell activity. Mol. Immunol. 2009, 46, 3000–3008. [Google Scholar] [CrossRef][Green Version]
Holler, P.D.; Holman, P.O.; Shusta, E.V.; O’Herrin, S.; Wittrup, K.D.; Kranz, D.M. In vitro evolution of a T cell receptor with high affinity for peptide/MHC. Proc. Natl. Acad. Sci. USA 2000, 97, 5387–5392. [Google Scholar] [CrossRef]
Cole, D.K.; Sami, M.; Scott, D.R.; Rizkallah, P.J.; Borbulevych, O.Y.; Todorov, P.T.; Moysey, R.K.; Jakobsen, B.K.; Boulter, J.M.; Baker, B.M.; et al. Increased peptide contacts govern high affinity binding of a modified TCR whilst maintaining a native pMHC docking mode. Front. Immunol. 2013, 4, 168. [Google Scholar] [CrossRef]
Matuszewski, S.; E Hildebrandt, M.; Ghenu, A.-H.; Jensen, J.D.; Bank, C. A statistical guide to the design of deep mutational scanning experiments. Genetics 2016, 204, 77–87. [Google Scholar] [CrossRef]
Tang, S.; Kim, P.S. A high-affinity human PD-1/PD-L2 complex informs avenues for small-molecule immune checkpoint drug discovery. Proc. Natl. Acad. Sci. USA 2019, 116, 24500–24506. [Google Scholar] [CrossRef] [PubMed]
Findlay, G.M.; Daza, R.M.; Martin, B.; Zhang, M.D.; Leith, A.P.; Gasperini, M.; Janizek, J.D.; Huang, X.; Starita, L.M.; Shendure, J. Accurate classification of BRCA1 variants with saturation genome editing. Nature 2018, 562, 217–222. [Google Scholar] [CrossRef] [PubMed]
Starita, L.M.; Young, D.L.; Islam, M.; O Kitzman, J.; Gullingsrud, J.; Hause, R.J.; Fowler, D.M.; Parvin, J.D.; Shendure, J.; Fields, S. Massively parallel functional analysis of BRCA1 RING domain variants. Genetics 2015, 200, 413–422. [Google Scholar] [CrossRef] [PubMed]
Kotler, E.; Shani, O.; Goldfeld, G.; Lotan-Pompan, M.; Tarcic, O.; Gershoni, A.; Hopf, T.A.; Marks, D.S.; Oren, M.; Segal, E. A systematic p53 mutation library links differential functional impact to cancer mutation pattern and evolutionary conservation. Mol. Cell 2018, 71, 178–190.e8. [Google Scholar]
Matreyek, K.A.; Starita, L.M.; Stephany, J.J.; Martin, B.; Chiasson, M.A.; Gray, V.E.; Kircher, M.; Khechaduri, A.; Dines, J.N.; Hause, R.J.; et al. Multiplex assessment of protein variant abundance by massively parallel sequencing. Nat. Genet. 2018, 50, 874–882. [Google Scholar] [CrossRef]
Mighell, T.L.; Evans-Dutson, S.; O’Roak, B.J. A saturation mutagenesis approach to understanding PTEN lipid phosphatase activity and genotype-phenotype relationships. Am. J. Hum. Genet. 2018, 102, 943–955. [Google Scholar] [CrossRef]
Funk, J.S.; Klimovich, M.; Drangenstein, D.; Pielhoop, O.; Hunold, P.; Borowek, A.; Noeparast, M.; Pavlakis, E.; Neumann, M.; Balourdas, D.-I.; et al. Deep CRISPR mutagenesis characterizes the functional diversity of TP53 mutations. Nat. Genet. 2025, 57, 140–153. [Google Scholar] [CrossRef]
Livesey, B.J.; Marsh, J.A. Variant effect predictor correlation with functional assays is reflective of clinical classification performance. Genome Biol. 2025, 26, 104. [Google Scholar] [CrossRef]
Bennett, G.; Karbassi, I.; Chen, W.; Harrison, S.M.; Lebo, M.S.; Meng, L.; Nagan, N.; Rigobello, R.; Rehm, H.L. Distinct rates of VUS reclassification are observed when subclassifying VUS by evidence level. Genet. Med. 2025, 27, 101400. [Google Scholar] [CrossRef]
Blutt, S.E.; Estes, M.K. Organoid models for infectious disease. Annu. Rev. Med. 2022, 73, 167–182. [Google Scholar] [CrossRef]
Starita, L.M.; Ahituv, N.; Dunham, M.J.; Kitzman, J.O.; Roth, F.P.; Seelig, G.; Shendure, J.; Fowler, D.M. Variant interpretation: Functional assays to the rescue. Am. J. Hum. Genet. 2017, 101, 315–325. [Google Scholar] [CrossRef] [PubMed]
Doudna, J.; Charpentier, E. Genome editing. The new frontier of genome engineering with CRISPR-Cas9. Science 2014, 346, 6213. [Google Scholar] [CrossRef] [PubMed]
Rees, H.A.; Liu, D.R. Base editing: Precision chemistry on the genome and transcriptome of living cells. Nat. Rev. Genet. 2018, 19, 770–788. [Google Scholar] [CrossRef] [PubMed]
Wang, T.; Wei, J.J.; Sabatini, D.M.; Lander, E.S. Genetic screens in human cells using the CRISPR-Cas9 system. Science 2014, 343, 80–84. [Google Scholar] [CrossRef]
Hart, T.; Chandrashekhar, M.; Aregger, M.; Steinhart, Z.; Brown, K.R.; MacLeod, G.; Mis, M.; Zimmermann, M.; Fradet-Turcotte, A.; Sun, S.; et al. High-resolution CRISPR screens reveal fitness genes and genotype-specific cancer liabilities. Cell 2015, 163, 1515–1526. [Google Scholar] [CrossRef]
Sadhu, M.J.; Bloom, J.S.; Day, L.; Siegel, J.J.; Kosuri, S.; Kruglyak, L. Highly parallel genome variant engineering with CRISPR–Cas9. Nat. Genet. 2018, 50, 510–514. [Google Scholar] [CrossRef]
Hanna, R.E.; Hegde, M.; Fagre, C.R.; DeWeirdt, P.C.; Sangree, A.K.; Szegletes, Z.; Griffith, A.; Feeley, M.N.; Sanson, K.R.; Baidi, Y.; et al. Massively parallel assessment of human variants with base editor screens. Cell 2021, 184, 1064–1080.e20. [Google Scholar] [CrossRef]
Ménoret, S.; Tesson, L.; Remy, S.; Usal, C.; Ouisse, L.-H.; Brusselle, L.; Chenouard, V.; Anegon, I. Advances in transgenic animal models and techniques. Transgenic Res. 2017, 26, 703–708. [Google Scholar] [CrossRef]
Houdebine, L.M. Transgenic animal models in biomedical research. In Target Discovery and Validation Reviews and Protocols: Volume 1, Emerging Strategies for Targets and Biomarker Discovery; Humana Press: Totowa, NJ, USA, 2007; pp. 163–202. [Google Scholar]
Cervenak, J.; Kurrle, R.; Kacskovics, I. Accelerating antibody discovery using transgenic animals overexpressing the neonatal Fc receptor as a result of augmented humoral immunity. Immunol. Rev. 2015, 268, 269–287. [Google Scholar] [CrossRef]
Shakweer, W.M.E.; Krivoruchko, A.Y.; Dessouki, S.; Khattab, A.A. A review of transgenic animal techniques and their applications. J. Genet. Eng. Biotechnol. 2023, 21, 55. [Google Scholar] [CrossRef]
Yuki, K.; Cheng, N.; Nakano, M.; Kuo, C.J. Organoid models of tumor immunology. Trends Immunol. 2020, 41, 652–664. [Google Scholar] [CrossRef] [PubMed]
Bar-Ephraim, Y.E.; Kretzschmar, K.; Clevers, H. Organoids in immunological research. Nat. Rev. Immunol. 2020, 20, 279–293. [Google Scholar] [CrossRef] [PubMed]
Han, Y.; Yang, L.; Lacko, L.A.; Chen, S. Human organoid models to study SARS-CoV-2 infection. Nat. Methods 2022, 19, 418–428. [Google Scholar] [CrossRef] [PubMed]
Bruskin, S.; Ishkin, A.; Nikolsky, Y.; Nikolskaya, T.; Piruzian, E. Analysis of Transcriptomic and Proteomic Data in Immune-Mediated Diseases. In Computational Biology and Applied Bioinformatics; IntechOpen: Rijeka, Croatia, 2011. [Google Scholar]
Franciosa, G.; Kverneland, A.H.; Jensen, A.W.P.; Donia, M.; Olsen, J.V. Proteomics to study cancer immunity and improve treatment. In Seminars in Immunopathology; Springer: Berlin/Heidelberg, Germany, 2023; Volume 45, pp. 241–251. [Google Scholar]
Berge, T.; Eriksson, A.; Brorson, I.S.; Høgestøl, E.A.; Berg-Hansen, P.; Døskeland, A.; Mjaavatten, O.; Bos, S.D.; Harbo, H.F.; Berven, F. Quantitative proteomic analyses of CD4+ and CD8+ T cells reveal differentially expressed proteins in multiple sclerosis patients and healthy controls. Clin. Proteom. 2019, 16, 19. [Google Scholar] [CrossRef]
Rollins, N.J.; Brock, K.P.; Poelwijk, F.J.; Stiffler, M.A.; Gauthier, N.P.; Sander, C.; Marks, D.S. Inferring protein 3D structure from deep mutation scans. Nat. Genet. 2019, 51, 1170–1176. [Google Scholar] [CrossRef]

Figure 1. The workflow of deep mutational scanning. ➀ Saturation mutation is performed on the DNA codons at the target site of the target gene region, so that each position contains codons for all amino acids except the wild type, and each mutant sequence generated is collected to generate a mutation library. ➁ The mutant library is expressed based on a suitable protein display model (cell model or non-cell model), and functional variants are screened by giving specific functional selection (affinity or stability, etc.). ➂ The sequence information of the screened variants is obtained by high-throughput sequencing, and data visualization is achieved through a heatmap, etc. The data information generated based on DMS helps to establish the association between mutant sequences and protein functions.

Table 1. Display platform for DMS.

Display Type	Characteristic	Advantages	Limitation	Refs.
Yeast display	The target fragment (such as an antibody fragment, receptor or antigen mutant) is anchored and fused to the yeast cell surface so that it is fixed on the yeast surface for display.	Eukaryotic system and capable of some post-translational modifications Proven gene manipulation and library construction methods Suitable for large-scale mutation library screening	Human proteins are not suitable for complex folding or require specific glycosylation	[22]
Mammalian display	Rely on viral or plasmid vectors to introduce mutants into cells one by one, and display them inside cells or on the surface through transmembrane anchoring or secretion capture.	Closest to physiological conditions Retains intact post-translational modifications (glycosylation, phosphorylation, etc.) Suitable for antibody drug and immune receptor research	High cost Lower throughput than yeast/phage, limited library size (typically ≤1 × 10⁶)	[16,23,24]
Phage display	The target protein (usually an antibody fragment) is fused and expressed on the phage coat protein, and screening is achieved through phage proliferation and selection.	Large library size (1 × 10⁹–1 × 10¹²) Low cost and high throughput Widely used in antibody discovery	Lack of eukaryotic post-translational modifications Some proteins fold inefficiently	[16]
Ribosome display	Generating ribosome-mRNA-protein complexes through stop-codon-free translation enables genotype-phenotype coupling, which is then screened through ligand binding and high-throughput sequencing.	No cloning or cell culture required Library size can reach >1 × 10¹² Suitable for proteins that are toxic or difficult to express in cells	RNA is prone to degradation and the system exhibits limited stability Lacks post-translational modifications	[16]

Table 2. Current Challenges and Potential Solutions for DMS.

Application	Challenge	Potential Solutions	Refs.
Antibody optimization	Some mutations may significantly improve antigen binding (high affinity), but may also disrupt antibody expression or structure, making it difficult to assess in vivo function.	Combine structural modeling to predict conformational stability and verify antibody function through organoid models or other in vivo experiments.	[16,42]
Antigen escape	Epitope regions are complex and constantly mutate under host immune pressure. A single cell-based or cell-free screening platform may not truly reflect the infection process (it cannot simulate the interaction between immune cells and pathways).	Design multi-site combination mutations and verify escape variants in combination with organoid or animal models.	[49,53,82]
TCR recognition	TCR–MHC interactions are highly dependent on MHC context and peptide conformation.	Designing multi-MHC parallel DMS, combining structural simulation and functional screening.	[63]
Complex or poorly defined immune molecules	Unable to build a function-dependent screening system, making it difficult to quantify mutation function through high-throughput.	Joint proteome/transcriptome prediction functional modules.	[11,84]
Clinical diagnosis	Lack of large-scale immunology database combined with DMS research, the immune system is highly personalized.	Combined database cross-analysis.	[81]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shao, C.; Jia, S.; Li, Y.; Li, J. Deep Mutational Scanning in Immunology: Techniques and Applications. Pathogens 2025, 14, 1027. https://doi.org/10.3390/pathogens14101027

AMA Style

Shao C, Jia S, Li Y, Li J. Deep Mutational Scanning in Immunology: Techniques and Applications. Pathogens. 2025; 14(10):1027. https://doi.org/10.3390/pathogens14101027

Chicago/Turabian Style

Shao, Chengwei, Siyue Jia, Yue Li, and Jingxin Li. 2025. "Deep Mutational Scanning in Immunology: Techniques and Applications" Pathogens 14, no. 10: 1027. https://doi.org/10.3390/pathogens14101027

APA Style

Shao, C., Jia, S., Li, Y., & Li, J. (2025). Deep Mutational Scanning in Immunology: Techniques and Applications. Pathogens, 14(10), 1027. https://doi.org/10.3390/pathogens14101027

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Deep Mutational Scanning in Immunology: Techniques and Applications

Abstract

1. Introduction

2. Deep Mutational Scanning Methods and Process

2.1. Construction of the Mutational Library

2.2. Functional Screening

2.3. High-Throughput Sequencing and Data Analysis

3. Application of Deep Mutational Scanning in Immunology

3.1. Antibody Engineering

3.2. Antigen Epitope Identification

3.3. Recognition by T Cell Receptors

4. Conclusions and Future Perspectives

4.1. Challenges and Limitations

4.2. Future Perspectives

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI