Next Article in Journal
Association of Obesity-Related Genetic Variants with Android Fat Patterning and Cardiometabolic Risk in Women
Previous Article in Journal
Chloroplast Genome Evolution and Codon Usage In the Medicinal Plant Pothos chinensis (Araceae)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification of Novel Gene Cluster Potentially Associated with Insecticide Resistance in Anopheles gambiae s.l.

by
Hyacinthe Dipina Ki
1,2,*,
Mahamadi Kientega
1,
Sabéré O. G. Yemien
1,2,
Hamidou Maiga
1,
Nouhoun Traoré
1,2,
Koama Bayili
1,
Moussa Namountougou
2 and
Abdoulaye Diabaté
1,*
1
Institut de Recherche en Sciences de la Santé (IRSS), Bobo-Dioulasso 01 BP 545, Burkina Faso
2
Unité de Formation et de Recherche en Sciences de la Vie et de la Terre, Université Nazi Boni, Bobo-Dioulasso 01 BP 1091, Burkina Faso
*
Authors to whom correspondence should be addressed.
Genes 2025, 16(9), 1018; https://doi.org/10.3390/genes16091018
Submission received: 16 July 2025 / Revised: 13 August 2025 / Accepted: 15 August 2025 / Published: 28 August 2025
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

Background/Objectives: Despite the increasing emergence of resistance, insecticide-based tools remain the primary method for malaria vector control in Africa. To maintain the effectiveness of these interventions, continuous monitoring and identification of novel resistance mechanisms is essential. This study aimed to investigate potential new insecticide resistance genes in the Anopheles gambiae complex. Methods: We analyzed whole-genome sequencing data from the An. gambiae 1000 Genomes Project. A broad range of genomic analysis techniques and tools were used to identify and explore genetic variation in the candidate resistance genes. Results: High haplotype homozygosity values, indicative of positive selection, were detected in a 2L chromosomal region corresponding to an aldehyde oxidase gene cluster (AGAP006220, AGAP006221, AGAP006224, AGAP006225, AGAP006226). Single nucleotide polymorphisms (SNPs) have been identified in these genes with frequencies up to 100%, including 569, 691, 1433, 978, and 811 non-synonymous SNPs in AGAP006220, AGAP006221, AGAP006224, AGAP006225, and AGAP006226, respectively. Copy number variations (CNVs) such as deletions and amplifications were also identified at low frequencies (<12%). Population structure analyses revealed adaptive and geographic gene flow between An. gambiae and An. coluzzii. Conclusions: This study provides evidence that aldehyde oxidase genes may contribute to insecticide resistance in An. gambiae s.l. populations. These results highlight the importance of genomic surveillance for detecting novel resistance loci and guiding the development of improved vector control strategies under changing ecological and evolutionary conditions.

1. Introduction

Malaria remains one of the leading causes of morbidity and mortality in sub-Saharan Africa, especially among children under five years and pregnant women [1]. Despite recent advances in malaria vaccine development, insecticide-based interventions, including long-lasting insecticidal nets (LLINs) and indoor residual spraying (IRS), remain the most effective prevention methods [2]. The success of these interventions depends on a thorough understanding of vector population dynamics, especially their susceptibility to insecticides, to guide the optimal deployment of control tools. In mosquitoes, two major mechanisms drive insecticide resistance: target-site and metabolic resistance [3,4,5]. Target-site resistance involves genetic changes that alter the protein targeted by the insecticide, whereas metabolic resistance results from changes in the sequence or expression of detoxification enzymes and pathways that metabolize or sequester the insecticide [6]. In addition to these mechanisms, behavioral and cuticular resistance have also been described in malaria vectors [7]. The rise in resistance remains a major concern for malaria control and calls for continued resistance surveillance and rapid development of alternative control tools to accelerate malaria elimination.
Although target-site resistance has been extensively described in the African malaria vector Anopheles gambiae s.l., metabolic resistance remains poorly characterized [8], likely due to the number of gene families involved in detoxification, redundancy among their members, and the multiple mechanisms by which metabolic resistance can arise [9]. However, research on this mechanism is becoming increasingly critical, as recent studies have shown a growing contribution of detoxification genes to the resistance phenotype in An. gambiae s.l. populations [10,11,12]. This trend is likely driven by novel mutations in genes encoding detoxifying enzymes that are associated with resistance phenotypes in members of the An. gambiae complex [13]. Key detoxifying enzymes include cytochrome P450 monooxygenases, glutathione S-transferases, and carboxylesterases [14].
A central challenge in malaria vector control is the identification of the genomic basis of resistance in ways that can inform field-based control programs [15]. Traditionally, insecticide resistance has been monitored using phenotypic assays such as WHO tube tests or CDC bottle bioassays. While useful, these methods are labor-intensive, time-consuming, and require substantial logistical resources. Advances in genomics now provide powerful alternatives, enabling the detection and surveillance of molecular markers associated with resistance in natural mosquito populations [16]. To date, most genomic studies have focused on regions of the genome already linked to resistance. However, given the rapid and dynamic evolution of insecticide resistance, including the emergence of novel mechanisms, whole-genome analysis offers an optimal approach. Genome-wide scans for signatures of adaptive evolution, such as selective sweeps, provide a comprehensive framework for detecting resistance loci beyond known targets.
In this study, we investigated genetic variation within the genomic region containing aldehyde oxidase genes and its potential implications for insecticide resistance in An. gambiae s.l. populations.

2. Materials and Methods

2.1. Data Collection and Analysis

The genomic data analyzed in this study were obtained from the An. gambiae 1000 Genomes Project, phase 3 (Ag1000G; dataset Ag3.0), released in February 2021 [17]. Mosquitoes representing three Anopheles species, An. gambiae, An. coluzzii, and An. arabiensis, were collected between 2004 and 2015 in 11 countries across sub-Saharan Africa (Figure 1, Table S1). Detailed protocols for mosquito sampling, specimen storage, and genomic data management, including access rights, are available on the MalariaGEN website [18].
Library preparation and sequencing were performed at the Wellcome Sanger Institute. All specimens were sequenced to a target coverage of 30× using the Illumina HiSeq 2000 and the Illumina HiSeq X platforms [18].
Alignment, SNP calling, and sample quality control (QC) were carried out by the MalariaGEN Resource Centre Team. Sequencing reads were aligned to the AgamP4 reference genome using BWA version 0.7.15. Indel realignment was performed with the GATK version 3.7-0 RealignerTargetCreator and IndelRealigner, and SNPs were called using the GATK version 3.7-0 UnifiedGenotyper. Genotypes were called for each sample independently, considering all possible alleles at genomic sites where the reference base was not “N.” Following variant calling, both samples and variants underwent multiple QC analyses to ensure data integrity and accuracy [18].

2.2. Detection of Genome Regions Under Recent Positive Selection and Identification of New Candidate Genes

Genome-wide scans for selection (GWSS) were performed on An. gambiae s.l. populations to detect signals of recent selection using the Garud H12 statistic [16], a metric particularly sensitive to recent selective sweeps, such as those driven by insecticide pressure. Analyses were first performed at the country level and then stratified by both country and species. For each analysis, the window size, defined by a specific number of SNPs, was calibrated to balance resolution and signal detection. Genomic regions showing signatures of selective sweeps were screened for insecticide resistance genes. Potential novel candidates were identified through literature searches using resources including VectorBase, NCBI, and Google Scholar.

2.3. Genetic Variability

For each aldehyde oxidase gene, the corresponding transcript was determined. Single nucleotide polymorphisms (SNPs) were identified and categorized according to their predicted functional effects. SNP frequencies within populations were calculated, with further filtering to retain only high-frequency non-synonymous SNPs. Copy number variation (CNV) was assessed by computing the frequencies of gene amplifications and deletions in the aldehyde oxidase gene regions across populations.
Haplotype networks were constructed for aldehyde oxidase genes to investigate potential gene flow between countries and species. Haplotypes carrying the predominant high-frequency mutations were analyzed using a maximum genetic distance threshold of two SNPs.
Population structure was assessed using principal component analysis (PCA) and pairwise Fst estimates. Genetic diversity summary statistics were calculated from SNPs located within the aldehyde oxidase gene regions.

3. Results

3.1. Signals of Positive Selection–Identification of Potential New Insecticide Resistance Genes

We analyzed a total of 1908 individual genomes from An. gambiae s.s., An. coluzzii, and An. arabiensis. Genome-wide scans for selection were performed using the Garud H12 statistic in windows ranging from 1000 to 2000 SNPs, depending on the population. Several peaks of H12 values were detected across different chromosome arms. These peaks correspond to genomic regions containing the Vgsc, Ace1, Rdl, and GSTe3 genes, indicating selective sweeps likely driven by positive selection, as previously reported in An. gambiae populations (Figure 2).
In addition to these well-characterized resistance loci, a strong H12 peak was observed at a novel locus on chromosome 2L (28,510,000–28,590,000), indicating a recent selective sweep in this region. This genomic region corresponds to a cluster of genes encoding detoxification enzymes, including five aldehyde oxidases (AGAP006220, AGAP006221, AGAP006224, AGAP006225, AGAP006226), two glucosyl/glucuronosyl transferases (AGAP006222, AGAP006223), and two carboxylesterases (AGAP006227, AGAP006228). Population-specific analyses revealed that in An. gambiae s.s., the sweep occurred in Burkina Faso, Ghana, and Guinea. In An. arabiensis, it was detected in Tanzania and Malawi, though at lower frequencies. In contrast, the sweep was entirely absent in An. coluzzii (Figure 2).
We focused subsequent analyses on the aldehyde oxidases, as three of these genes were located directly under the peak of the selective sweep and, based on the literature evidence, appear to be strong candidates for involvement in insecticide resistance.

3.2. SNPs in the Aldehyde Oxidase Genes

Single nucleotide polymorphism (SNP) variation was analyzed in aldehyde oxidase genes across the five countries, Burkina Faso, Ghana, Guinea, Tanzania, and Malawi, where these genes showed evidence of positive selection. All five genes showed a high level of polymorphism across the different populations. Specifically, we identified 2202 SNPs (including 569 non-synonymous coding SNPs) in AGAP006220, 2661 SNPs (691 non-synonymous) in AGAP006221, 2698 SNPs (1433 non-synonymous) in AGAP006224, 2515 SNPs (978 non-synonymous) in AGAP006225, and 2104 SNPs (811 non-synonymous) in AGAP006226. The analyses identified a substantial number of non-synonymous coding SNPs at high frequencies across all five genes and in all three species, with some reaching 100% frequency in several cohorts. To focus on the most prevalent variants, the dataset was refined to include only those SNPs with minimum frequencies of 40, 50, or 75% in at least one sample cohort. The refinement showed 21 SNPs at frequencies higher than 40% in AGAP006220 and AGAP006225, and 56 SNPs above 75% in AGAP006221, AGAP006224, and AGAP006226 (Figure 3 and Figures S1–S4). These patterns indicate that these high-frequency variants may be subject to strong selective pressure and may play a significant role in the adaptation of malaria vectors to environmental challenges, including insecticide exposure.

3.3. CNVs in the Aldehyde Oxidase Genes

Copy number variation (CNV) frequencies were assessed for the five aldehyde oxidase genes across populations from Burkina Faso, Ghana, Guinea, Tanzania, and Malawi. The analysis revealed the presence of CNVs in all five genes. Deletions were detected only in AGAP006224, while amplifications were observed in all five genes. Overall, amplification frequencies were low (≤3%) across most of the genes and populations, except for An. arabiensis from Tanzania, where AGAP006225 and AGAP006226 exhibited amplification rates of 10.59% and 11.76%, respectively (Figure 4).

3.4. Gene Flow, Population Structure, and Genetic Diversity at the Aldehyde Oxidases Locus

To investigate gene flow at the aldehyde oxidases locus, the haplotype network technique was used to visualize the relationships between haplotypes from different species and countries and to show how variants are shared between populations. The results revealed strong signals of adaptive gene flow within An. gambiae s.l. populations in West Africa, as well as within populations in East Africa. In West Africa, the largest haplotype network comprised 163 identical haplotypes shared across all vector populations. A similar pattern was observed in East Africa, where 52 identical haplotypes were shared between An. gambiae s.l. populations from Tanzania and Uganda (Figure 5). Evidence of interspecific gene flow was also apparent, with a large haplotype cluster of 1001 identical sequences shared between An. gambiae (n = 609) and An. coluzzii (n = 392), indicating strong adaptive gene flow between these two species. In contrast, no shared haplotypes were detected between An. arabiensis and either An. gambiae or An. coluzzii (Figure 6).
These patterns were corroborated by population structure and genetic diversity analyses performed at the aldehyde oxidases locus across the 11 countries. Genetic differentiation (FST) between An. arabiensis and the two other species were generally higher than those observed between An. gambiae and An. coluzzii, indicating greater genetic divergence (Figure S5). Diversity indices also supported these trends, although some within-species variation was observed. Both measures of diversity (θπ and θw) showed similar results for An. gambiae and An. coluzzii compared to An. arabiensis (Figure 7). Principal component analysis (PCA) showed a distinct and homogeneous cluster of An. arabiensis, whereas An. gambiae and An. coluzzii clustered together, further supporting genetic exchange between these two species (Figure 8). Overall, these findings suggest that An. gambiae and An. coluzzii share a similar demographic history.

4. Discussion

Malaria control remains a major challenge in sub-Saharan Africa. Despite ongoing interventions, the region still experiences the highest global burden of the disease due to complicating factors such as insecticide resistance, changing vector behavior, and rapid urbanization [2,19]. Strengthening surveillance, particularly through genomic tools, offers new opportunities to enhance the effectiveness of vector control strategies [20]. In particular, identifying novel resistance-associated genes is essential for enabling early detection and monitoring within vector populations. These insights help guide and inform the selection and rotation of insecticides, reducing the spread of resistance variants [21]. Recent advances in genomics offer important opportunities to study the genomic architecture and evolution of malaria vectors, enabling high-resolution mapping of resistance genes, population structure, and adaptive evolution [21,22].

4.1. Aldehyde Oxidases Under Positive Selection

Garud H12 statistic revealed evidence of positive selection in aldehyde oxidase genes within An. gambiae and An. arabiensis populations across sub-Saharan Africa. The absence of such selection in An. coluzzii is consistent with recent findings on the carboxylesterases Coeae1f and Coeae2f, which, as previously mentioned, are located on the same locus [23]. Aldehyde oxidases are known for their broad substrate specificity and marked species differences [24].

4.2. Potential Role of Aldehyde Oxidases in Insecticide Resistance

Literature review indicates that aldehyde oxidase genes may play an important role in the adaptation of malaria vectors to environmental pressures, including those induced by insecticide-based control measures. Aldehyde oxidase is a cytosolic enzyme that belongs to the family of structurally related molybdo-flavoproteins like xanthine oxidase. These enzymes have originated from gene duplication events of an ancestral eukaryotic xanthine oxidoreductase gene first identified in the flatworm, Caenorhabditis elegans [24]. Aldehyde oxidases are common enzymes involved in detoxifying and breaking down many substances across various organisms. Although references to aldehyde oxidase and its role in metabolism date back to the 1950s [24], the significance of this enzyme in insecticide metabolism has only emerged in recent years [25]. Like cytochromes P450, aldehyde oxidases contribute to oxidation by carrying out nucleophilic attacks using oxygen derived from water [26]. In Culex mosquitoes, elevated aldehyde oxidase activity has been documented in resistant strains, indicating a potential selective advantage under insecticide exposure [27,28]. In An. gambiae s.l., aldehyde oxidases have been found co-amplified with esterases known to confer resistance to the organophosphate pirimiphos-methyl [23,27]. In addition, in vitro assays have demonstrated their ability to metabolize neonicotinoids including imidacloprid, thiamethoxam, clothianidin, and dinotefuran [29], indicating a potential role in resistance to this newer insecticide class. This is particularly relevant given the increasing interest in neonicotinoids as alternative tools for vector control in areas facing widespread multi-class resistance [30].

4.3. Genetic Mechanisms: SNPs and CNVs

In addition to positive selection, variant analyses revealed non-synonymous single nucleotide polymorphisms (SNPs) and copy number variations (CNVs), including gene amplifications, in the genomic region encompassing aldehyde oxidase genes. Both SNPs and CNVs are recognized as important drivers of insecticide resistance evolution [31,32,33,34] particularly in metabolic resistance, where they can increase gene expression [3,35]. Given the relatively low CNV frequencies detected in our results, we hypothesize that the putative resistance conferred by these detoxification genes may be more strongly associated with SNPs, which occurred at higher frequencies. This is consistent with previous evidence showing that SNPs are key determinants of variation in aldehyde oxidase expression and protein activity [24].

4.4. Gene Flow and Geographic Variation

The adaptive introgression detected between An. gambiae and An. coluzzii supports previous studies reporting extensive genetic exchange between these closely related species [22]. Although no significant selective sweep was detected in An. coluzzii, the ongoing and increasing gene flow suggests that aldehyde oxidase genes could come under selective pressure in this species as well. Based on the observed spatial patterns of gene flow, these putative resistance markers are expected to expand to large areas across the continent.

4.5. Implications for Vector Control

This study highlighted the ongoing evolution of the An. gambiae complex genome in response to environmental pressures and vector control interventions. Genomic changes, including mutations and gene duplications, are driven by factors such as exposure to insecticides, environmental changes and ecological shifts. These evolutionary processes can lead to increased resistance and altered behavior, which complicate malaria control efforts. Continuous genomic surveillance is therefore crucial to track and effectively respond to these changes.

5. Conclusions

This study highlights the importance of genomic data in malaria vector surveillance, particularly for detecting emerging mechanisms of insecticide resistance. Through whole genome analysis, we identified a previously unreported locus on chromosome 2L comprising detoxification genes encoding aldehyde oxidases (AGAP006220, AGAP006221, AGAP006224, AGAP006225, and AGAP006226) under recent positive selection. High-frequency non-synonymous SNPs, 569, 691, 1433, 978, and 811 in the respective genes were detected, with some reaching 100% frequency, alongside copy number variations (deletions and amplifications) at low frequencies (<12%). Population structure analysis revealed adaptive and geographic gene flow between An. gambiae and An. coluzzii, highlighting historical connectivity and potential gene exchange.
While aldehyde oxidases have previously been implicated in insecticide resistance in Culex mosquitoes, our findings provide the first genomic evidence supporting their potential involvement in insecticide resistance in An. gambiae s.l. populations. Further functional and gene expression studies will be necessary to validate this role. If confirmed, these genes could serve as novel molecular markers in resistance monitoring programs, enhancing the design of integrated and adaptive vector control strategies in Africa.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/genes16091018/s1. Table S1: Study sites, collection year, and species composition; Figure S1: SNP allele frequencies in the AGAP006220 gene in the different An. gambiae s.l. populations; Figure S2: SNP allele frequencies in the AGAP006221 gene in the different An. gambiae s.l. populations; Figure S3: SNP allele frequencies in the AGAP006224 gene in the different An. gambiae s.l. populations; Figure S4: SNP allele frequencies in the AGAP006225 gene in the different An. gambiae s.l. populations; Figure S5: Pairwise Fst estimates between different An. gambiae s.l. populations based on the aldehyde oxidase locus.

Author Contributions

Conceptualization, H.D.K. and M.K.; methodology, H.D.K. and M.K.; software, H.D.K. and M.K.; validation, M.K. and H.M.; formal analysis, H.D.K. and M.K.; investigation, H.D.K.; resources, A.D.; data curation, H.D.K. and M.K.; writing—original draft preparation, H.D.K.; writing—review and editing, H.D.K., M.K., S.O.G.Y., N.T., K.B. and H.M.; visualization, M.K.; supervision, M.K., N.T., K.B., H.M., M.N. and A.D.; project administration, A.D.; funding acquisition, A.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Pan-African Mosquito Control Association, grant number OPP1214408, the Bill & Melinda Gates Foundation, grant number INV-037164, and the Wellcome Trust, grant number UNS-128536.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Jupyter Notebooks and scripts to reproduce all the analyses, tables, and figures are available in the GitHub repository: https://github.com/HyacintheKi/aldehyde_data (accessed on 20 March 2025). The SNPs and haplotypes data are available on the homepage of MalariaGEN and can be accessed using the malariagen_data package. The raw sequences in FASTQ format and the aligned sequences in BAM format were stored in the European Nucleotide Archive (ENA, Study Accession n° PRJEB42254).

Acknowledgments

The authors gratefully acknowledge the Ag1000G Consortium for making the Phase 3 dataset publicly available. The authors are particularly grateful to the MalariaGEN Vector Observatory which is an international collaboration working to build capacity for malaria vector genomic research and surveillance. We benefited from training in data analysis for genomic surveillance, organized by MalariaGEN in collaboration with Wellcome Sanger Institute and Pan-African Mosquito Control Association. This training was decisive for the completion of this work.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Venkatesan, P. The 2023 WHO World Malaria Report. Lancet Microbe. 2024, 5, e214. [Google Scholar] [CrossRef]
  2. Wondji, C.S.; Hearn, J.; Irving, H.; Wondji, M.J.; Weedall, G. RNAseq-Based Gene Expression Profiling of the Anopheles funestus Pyrethroid-Resistant Strain FUMOZ Highlights the Predominant Role of the Duplicated CYP6P9a/b Cytochrome P450s. G3 Genes|Genomes|Genet. 2021, 12, jkab352. [Google Scholar] [CrossRef] [PubMed]
  3. Hemingway, J.; Hawkes, N.J.; McCarroll, L.; Ranson, H. The Molecular Basis of Insecticide Resistance in Mosquitoes. Insect Biochem. Mol. Biol. 2004, 34, 653–665. [Google Scholar] [CrossRef]
  4. Ranson, H.; N’Guessan, R.; Lines, J.; Moiroux, N.; Nkuni, Z.; Corbel, V. Pyrethroid Resistance in African Anopheline Mosquitoes: What Are the Implications for Malaria Control? Trends Parasitol. 2011, 27, 91–98. [Google Scholar] [CrossRef] [PubMed]
  5. Mouhamadou, C.S.; de Souza, S.S.; Fodjo, B.K.; Zoh, M.G.; Bli, N.K.; Koudou, B.G. Evidence of Insecticide Resistance Selection in Wild Anopheles coluzzii Mosquitoes Due to Agricultural Pesticide Use. Infect. Dis. Poverty 2019, 8, 64. [Google Scholar] [CrossRef] [PubMed]
  6. Kouamé, R.M.A.; Lynd, A.; Kouamé, J.K.I.; Vavassori, L.; Abo, K.; Donnelly, M.J.; Edi, C.; Lucas, E. Widespread Occurrence of Copy Number Variants and Fixation of Pyrethroid Target Site Resistance in Anopheles gambiae (s.l.) from Southern Côte d’Ivoire. Curr. Res. Parasitol. Vector Borne Dis. 2023, 3, 100117. [Google Scholar] [CrossRef]
  7. Suh, P.F.; Elanga-Ndille, E.; Tchouakui, M.; Sandeu, M.M.; Tagne, D.; Wondji, C.; Ndo, C. Impact of Insecticide Resistance on Malaria Vector Competence: A Literature Review. Malar. J. 2023, 22, 19. [Google Scholar] [CrossRef]
  8. Ibrahim, S.S.; Muhammad, A.; Hearn, J.; Weedall, G.D.; Nagi, S.C.; Mukhtar, M.M.; Fadel, A.N.; Mugenzi, L.J.; Patterson, E.I.; Irving, H.; et al. Molecular Drivers of Insecticide Resistance in the Sahelo-Sudanian Populations of a Major Malaria Vector Anopheles coluzzii. BMC Biol. 2023, 21, 125. [Google Scholar] [CrossRef]
  9. Li, X.; Schuler, M.A.; Berenbaum, M.R. Molecular Mechanisms of Metabolic Resistance to Synthetic and Natural Xenobiotics. Annu. Rev. Entomol. 2007, 52, 231–253. [Google Scholar] [CrossRef]
  10. Lynd, A.; Weetman, D.; Barbosa, S.; Egyir Yawson, A.; Mitchell, S.; Pinto, J.; Hastings, I.; Donnelly, M.J. Field, Genetic, and Modeling Approaches Show Strong Positive Selection Acting upon an Insecticide Resistance Mutation in Anopheles gambiae s.s. Mol. Biol. Evol. 2010, 27, 1117–1125. [Google Scholar] [CrossRef]
  11. Toé, K.H.; N’Falé, S.; Dabiré, R.K.; Ranson, H.; Jones, C.M. The Recent Escalation in Strength of Pyrethroid Resistance in Anopheles coluzzi in West Africa Is Linked to Increased Expression of Multiple Gene Families. BMC Genom. 2015, 16, 146. [Google Scholar] [CrossRef] [PubMed]
  12. Lucas, E.R.; Nagi, S.C.; Egyir-Yawson, A.; Essandoh, J.; Dadzie, S.; Chabi, J.; Djogbénou, L.S.; Medjigbodo, A.A.; Edi, C.V.; Kétoh, G.K.; et al. Genome-Wide Association Studies Reveal Novel Loci Associated with Pyrethroid and Organophosphate Resistance in Anopheles gambiae and Anopheles coluzzii. Nat. Commun. 2023, 14, 4946. [Google Scholar] [CrossRef] [PubMed]
  13. Hancock, P.A.; Ochomo, E.; Messenger, L.A. Genetic Surveillance of Insecticide Resistance in African Anopheles Populations to Inform Malaria Vector Control. Trends Parasitol. 2024, 40, 604–618. [Google Scholar] [CrossRef]
  14. Donnelly, M.J.; Isaacs, A.T.; Weetman, D. Identification, Validation and Application of Molecular Diagnostics for Insecticide Resistance in Malaria Vectors. Trends Parasitol. 2016, 32, 197–206. [Google Scholar] [CrossRef]
  15. Xue, A.T.; Schrider, D.R.; Kern, A.D.; Consortium, A. Discovery of Ongoing Selective Sweeps within Anopheles Mosquito Populations Using Deep Learning. Mol. Biol. Evol. 2021, 38, 1168–1183. [Google Scholar] [CrossRef]
  16. Garud, N.R.; Messer, P.W.; Buzbas, E.O.; Petrov, D.A. Recent Selective Sweeps in North American Drosophila melanogaster Show Signatures of Soft Sweeps. PLoS Genet. 2015, 11, e1005004. [Google Scholar] [CrossRef]
  17. The Anopheles gambiae 1000 Genomes Consortium. Genome Variation and Population Structure in Three African Malaria Vector Species within the Anopheles gambiae Complex; Manubot: 2021. Published online. Available online: https://malariagen.github.io/ag1000g-phase3-data-paper/ (accessed on 20 March 2025).
  18. Ag3.0 (Ag1000G Phase 3)—MalariaGEN Vector Data User Guide. Available online: https://malariagen.github.io/vector-data/ag3/ag3.0.html (accessed on 20 March 2025).
  19. Merga, H.; Degefa, T.; Birhanu, Z.; Tadele, A.; Lee, M.-C.; Yan, G.; Yewhalaw, D. Urban Malaria in Sub-Saharan Africa: A Scoping Review of Epidemiologic Studies. Malar. J. 2025, 24, 131. [Google Scholar] [CrossRef]
  20. Nsanzabana, C. Strengthening Surveillance Systems for Malaria Elimination by Integrating Molecular and Genomic Data. Trop. Med. Infect. Dis. 2019, 4, 139. [Google Scholar] [CrossRef]
  21. Clarkson, C.S.; Temple, H.J.; Miles, A. The Genomics of Insecticide Resistance: Insights from Recent Studies in African Malaria Vectors. Curr. Opin. Insect Sci. 2018, 27, 111–115. [Google Scholar] [CrossRef] [PubMed]
  22. Kientega, M.; Clarkson, C.S.; Traoré, N.; Hui, T.-Y.J.; O’Loughlin, S.; Millogo, A.-A.; Epopa, P.S.; Yao, F.A.; Belem, A.M.G.; Brenas, J.; et al. Whole-Genome Sequencing of Major Malaria Vectors Reveals the Evolution of New Insecticide Resistance Variants in a Longitudinal Study in Burkina Faso. Malar. J. 2024, 23, 280. [Google Scholar] [CrossRef]
  23. Nagi, S.C.; Lucas, E.R.; Egyir-Yawson, A.; Essandoh, J.; Dadzie, S.; Chabi, J.; Djogbénou, L.S.; Medjigbodo, A.A.; Edi, C.V.; Ketoh, G.K.; et al. Parallel Evolution in Mosquito Vectors—A Duplicated Esterase Locus Is Associated with Resistance to Pirimiphos-Methyl in An. gambiae. bioRxiv 2024. [Google Scholar] [CrossRef]
  24. Garattini, E.; Fratelli, M.; Terao, M. The Mammalian Aldehyde Oxidase Gene Family. Hum. Genom. 2009, 4, 119–130. [Google Scholar] [CrossRef]
  25. Zhang, Y.; Yang, Y.; Shen, G.; Mao, X.; Jiao, M.; Lin, Y. Identification and Characterization of Aldehyde Oxidase 5 in the Pheromone Gland of the Silkworm (Lepidoptera: Bombycidae). J. Insect Sci. 2020, 20, 31. [Google Scholar] [CrossRef]
  26. Godoy, R.; Mutis, A.; Carabajal Paladino, L.; Venthur, H. Genome-Wide Identification of Aldehyde Oxidase Genes in Moths and Butterflies Suggests New Insights Into Their Function as Odorant-Degrading Enzymes. Front. Ecol. Evol. 2022, 10, 823119. [Google Scholar] [CrossRef]
  27. Sanoh, S.; Tayama, Y.; Sugihara, K.; Kitamura, S.; Ohta, S. Significance of Aldehyde Oxidase during Drug Development: Effects on Drug Metabolism, Pharmacokinetics, Toxicity, and Efficacy. Drug Metab. Pharmacokinet. 2015, 30, 52–63. [Google Scholar] [CrossRef]
  28. Hemingway, J. The Molecular Basis of Two Contrasting Metabolic Mechanisms of Insecticide Resistance. Insect Biochem. Mol. Biol. 2000, 30, 1009–1015. [Google Scholar] [CrossRef]
  29. Coleman, M.; Vontas, J.G.; Hemingway, J. Molecular Characterization of the Amplified Aldehyde Oxidase from Insecticide Resistant Culex quinquefasciatus. Eur. J. Biochem. 2002, 269, 768–779. [Google Scholar] [CrossRef] [PubMed]
  30. Sexton, J.P.; Clemens, M.; Bell, N.; Hall, J.; Fyfe, V.; Hoffmann, A.A. Patterns and Effects of Gene Flow on Adaptation across Spatial Scales: Implications for Management. J. Evol. Biol. 2024, 37, 732–745. [Google Scholar] [CrossRef] [PubMed]
  31. Clarkson, C.S.; Miles, A.; Harding, N.J.; O’Reilly, A.O.; Weetman, D.; Kwiatkowski, D.; Donnelly, M.J. The Genetic Architecture of Target-site Resistance to Pyrethroid Insecticides in the African Malaria Vectors Anopheles gambiae and Anopheles coluzzii. Mol. Ecol. 2021, 30, 5303–5317. [Google Scholar] [CrossRef]
  32. Stica, C.; Jeffries, C.L.; Irish, S.R.; Barry, Y.; Camara, D.; Yansane, I.; Kristan, M.; Walker, T.; Messenger, L.A. Characterizing the Molecular and Metabolic Mechanisms of Insecticide Resistance in Anopheles gambiae in Faranah, Guinea. Malar. J. 2019, 18, 244. [Google Scholar] [CrossRef]
  33. Jones, C.M.; Liyanapathirana, M.; Agossa, F.R.; Weetman, D.; Ranson, H.; Donnelly, M.J.; Wilding, C.S. Footprints of Positive Selection Associated with a Mutation (N1575Y) in the Voltage-Gated Sodium Channel of Anopheles gambiae. Proc. Natl. Acad. Sci. USA 2012, 109, 6614–6619. [Google Scholar] [CrossRef] [PubMed]
  34. Fouet, C.; Ashu, F.A.; Ambadiang, M.M.; Tchapga, W.; Wondji, C.S.; Kamdem, C. Clothianidin-Resistant Anopheles gambiae Adult Mosquitoes from Yaoundé, Cameroon, Display Reduced Susceptibility to SumiShield® 50WG, a Neonicotinoid Formulation for Indoor Residual Spraying. BMC Infect. Dis. 2024, 24, 133. [Google Scholar] [CrossRef] [PubMed]
  35. Dick, R.A.; Kanne, D.B.; Casida, J.E. Substrate Specificity of Rabbit Aldehyde Oxidase for Nitroguanidine and Nitromethylene Neonicotinoid Insecticides. Chem. Res. Toxicol. 2006, 19, 38–43. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Countries and collection sites.
Figure 1. Countries and collection sites.
Genes 16 01018 g001
Figure 2. H12 statistic on the 2L chromosomal arm of mosquitoes from Burkina Faso (BFA), Ghana (GHA), Mali (MLI), Guinea (GIN), Malawi (MWI), and Tanzania (TZA). Selective sweeps are suggested by peaks. A cluster of five aldehyde oxidases was found under positive selection in Anopheles gambiae and An. arabiensis: AGAP006220 (2L: 28,512,602–28,517,680), AGAP006221 (2L: 28,518,055–28,523,900), AGAP006224 (2L: 28,528,758–28,533,199), AGAP006225 (2L: 28,534,732–28,539,416), and AGAP006226 (2L: 28,540,651–28,545,294).
Figure 2. H12 statistic on the 2L chromosomal arm of mosquitoes from Burkina Faso (BFA), Ghana (GHA), Mali (MLI), Guinea (GIN), Malawi (MWI), and Tanzania (TZA). Selective sweeps are suggested by peaks. A cluster of five aldehyde oxidases was found under positive selection in Anopheles gambiae and An. arabiensis: AGAP006220 (2L: 28,512,602–28,517,680), AGAP006221 (2L: 28,518,055–28,523,900), AGAP006224 (2L: 28,528,758–28,533,199), AGAP006225 (2L: 28,534,732–28,539,416), and AGAP006226 (2L: 28,540,651–28,545,294).
Genes 16 01018 g002
Figure 3. SNP allele frequencies in the AGAP006226 gene in the different An. gambiae s.l. populations. The X-axis shows the different cohorts (country, species, and sampling period). The Y-axis shows the SNPs positions in the gene and the corresponding amino acid change. The gradient color bar shows the distribution of the allelic frequencies.
Figure 3. SNP allele frequencies in the AGAP006226 gene in the different An. gambiae s.l. populations. The X-axis shows the different cohorts (country, species, and sampling period). The Y-axis shows the SNPs positions in the gene and the corresponding amino acid change. The gradient color bar shows the distribution of the allelic frequencies.
Genes 16 01018 g003
Figure 4. CNV frequencies at the aldehyde oxidases genes in the different An. gambiae s.l. populations. The X-axis shows the different cohorts (country, species, and sampling period). The Y-axis shows the genes ID and the CNV type (del: deletion or amp: amplification). The gradient color bar shows the distribution of the CNV frequencies.
Figure 4. CNV frequencies at the aldehyde oxidases genes in the different An. gambiae s.l. populations. The X-axis shows the different cohorts (country, species, and sampling period). The Y-axis shows the genes ID and the CNV type (del: deletion or amp: amplification). The gradient color bar shows the distribution of the CNV frequencies.
Genes 16 01018 g004
Figure 5. Haplotype networks in five aldehyde oxidases (AGAP006220, AGAP006221, AGAP006224, AGAP006225, and AGAP006226) showing gene flow between countries. Identical haplotypes are represented by nodes (circles). The larger the node, the greater the number of identical haplotypes. Maximum distance set to two, i.e., each node is separated by a maximum genetic distance of 2 SNPs. Color indicates the country.
Figure 5. Haplotype networks in five aldehyde oxidases (AGAP006220, AGAP006221, AGAP006224, AGAP006225, and AGAP006226) showing gene flow between countries. Identical haplotypes are represented by nodes (circles). The larger the node, the greater the number of identical haplotypes. Maximum distance set to two, i.e., each node is separated by a maximum genetic distance of 2 SNPs. Color indicates the country.
Genes 16 01018 g005
Figure 6. Haplotype networks in five aldehyde oxidases (AGAP006220, AGAP006221, AGAP006224, AGAP006225, and AGAP006226) showing gene flow between species. Identical haplotypes are represented by nodes (circles). The larger the node, the greater the number of identical haplotypes. Maximum distance set to two, i.e., each node is separated by a maximum genetic distance of 2 SNPs. Color indicates the species.
Figure 6. Haplotype networks in five aldehyde oxidases (AGAP006220, AGAP006221, AGAP006224, AGAP006225, and AGAP006226) showing gene flow between species. Identical haplotypes are represented by nodes (circles). The larger the node, the greater the number of identical haplotypes. Maximum distance set to two, i.e., each node is separated by a maximum genetic distance of 2 SNPs. Color indicates the species.
Genes 16 01018 g006
Figure 7. Genetic diversity statistics of An. gambiae s.l. populations at aldehyde oxidases locus. (A) Nucleotide diversity (θπ). (B) Watterson theta (θw). The X-axis shows the different cohorts (country/sampling period/species): AGO = Angola, BFA = Burkina Faso, CIV = Côte d’Ivoire, CMR = Cameroon, COD = Democratic Republic of Congo, GHA = Ghana, GIN = Guinea, MLI = Mali, MWI = Malawi, TZA = Tanzania, UGA = Uganda.
Figure 7. Genetic diversity statistics of An. gambiae s.l. populations at aldehyde oxidases locus. (A) Nucleotide diversity (θπ). (B) Watterson theta (θw). The X-axis shows the different cohorts (country/sampling period/species): AGO = Angola, BFA = Burkina Faso, CIV = Côte d’Ivoire, CMR = Cameroon, COD = Democratic Republic of Congo, GHA = Ghana, GIN = Guinea, MLI = Mali, MWI = Malawi, TZA = Tanzania, UGA = Uganda.
Genes 16 01018 g007
Figure 8. Principal component analysis of An. gambiae s.l. populations at aldehyde oxidases locus. X-axis shows the first principal component (PC1) and Y-axis the second principal component (PC2).
Figure 8. Principal component analysis of An. gambiae s.l. populations at aldehyde oxidases locus. X-axis shows the first principal component (PC1) and Y-axis the second principal component (PC2).
Genes 16 01018 g008
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ki, H.D.; Kientega, M.; Yemien, S.O.G.; Maiga, H.; Traoré, N.; Bayili, K.; Namountougou, M.; Diabaté, A. Identification of Novel Gene Cluster Potentially Associated with Insecticide Resistance in Anopheles gambiae s.l. Genes 2025, 16, 1018. https://doi.org/10.3390/genes16091018

AMA Style

Ki HD, Kientega M, Yemien SOG, Maiga H, Traoré N, Bayili K, Namountougou M, Diabaté A. Identification of Novel Gene Cluster Potentially Associated with Insecticide Resistance in Anopheles gambiae s.l. Genes. 2025; 16(9):1018. https://doi.org/10.3390/genes16091018

Chicago/Turabian Style

Ki, Hyacinthe Dipina, Mahamadi Kientega, Sabéré O. G. Yemien, Hamidou Maiga, Nouhoun Traoré, Koama Bayili, Moussa Namountougou, and Abdoulaye Diabaté. 2025. "Identification of Novel Gene Cluster Potentially Associated with Insecticide Resistance in Anopheles gambiae s.l." Genes 16, no. 9: 1018. https://doi.org/10.3390/genes16091018

APA Style

Ki, H. D., Kientega, M., Yemien, S. O. G., Maiga, H., Traoré, N., Bayili, K., Namountougou, M., & Diabaté, A. (2025). Identification of Novel Gene Cluster Potentially Associated with Insecticide Resistance in Anopheles gambiae s.l. Genes, 16(9), 1018. https://doi.org/10.3390/genes16091018

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop