Next Article in Journal
Biomass Allocation and Allometric Relationships Among Major Plant Formations in the Alpine Peat Swamp Wetlands of the Yellow River on the Gannon Plateau, Gansu Province, China
Previous Article in Journal
Ascorbic Acid Seed Priming Enhances Yield and Related Responses in Broccoli Under Water Deficit Stress
Previous Article in Special Issue
Hermaphroditism in Cannabis sativa L.: Impacts, Inducers, and Industry Implications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Genetic Advances in Cannabis sativa L.: A Review of Recent Progress and Future Directions

by
Kasuni C. Daundasekara
1,*,
Kalpani P. Thennakoon
1,
Jivendra S. Wickramasinghe
1,
Selamawit Woldesenbet
1,
Christopher Delhom
2,
Suman Chandra
3,4 and
Aruna D. Weerasooriya
1,*
1
CAFNR Research, College of Agriculture, Food and Natural Resources, Prairie View A&M University, Prairie View, TX 77446, USA
2
Cotton Ginning Research Unit, USDA Agricultural Research Service, Stoneville, MS 38756, USA
3
National Center for Natural Products Research, School of Pharmacy, The University of Mississippi, University, MS 38677, USA
4
Department of Biomolecular Sciences, School of Pharmacy, The University of Mississippi, University, MS 38677, USA
*
Authors to whom correspondence should be addressed.
Plants 2026, 15(13), 2088; https://doi.org/10.3390/plants15132088 (registering DOI)
Submission received: 8 May 2026 / Revised: 30 June 2026 / Accepted: 2 July 2026 / Published: 4 July 2026
(This article belongs to the Special Issue Medicinal Cannabis: Phytochemistry and Biotechnological Advances)

Abstract

Cannabis sativa L. is an economically significant multi-use crop valued for fiber, seed, and phytochemical production. Compared with other crops, advancement in Cannabis sativa has been slow due to regulatory constraints and genetic resource limitations. Recent advances in technology have transformed the research landscape, supporting a deeper understanding of the genetic architecture underlying key agronomic traits. This review summarizes current progress in Cannabis sativa genetics and genomics, mainly focusing on structural genome organization, including chromosome-level assemblies and emerging pangenomic resources that capture species-wide diversity. We explore the molecular basis of key agronomic traits, including sex determination, cannabinoid biosynthesis, fiber quality, seed composition, disease resistance, and abiotic stress tolerance, highlighting their complex regulatory networks. Functional genomics tools including virus-induced gene silencing, transient expression systems, and CRISPR/Cas9 genome editing are reviewed as approaches enabling direct gene functional validation. We further review integration of these resources with molecular breeding strategies, including marker-assisted and genomic selection, to accelerate elite genotype development. Finally, we address persistent challenges such as genomic complexity, reference bias, and phenotyping limitations while outlining future research directions. Together, these advances position C. sativa as a compelling system for both fundamental plant biology and applied crop improvement.

1. Introduction

Cannabis sativa L. is a versatile, multi-purpose crop with considerable genetic diversity, broadly divided into distinct groups, such as industrial hemp, usually defined by containing ≤0.3% Δ9-tetrahydrocannabinol (THC) and narcotic marijuana genotypes characterized by higher THC content [1,2]. The significance of this crop has increased considerably due to its wide range of industrial applications and its uses in human nutrition and medicinal products [3,4,5]. The origin of the species trace back to eastern Asia [6,7]. For thousands of years, Cannabis sativa has been cultivated for its fiber, oil-rich seeds, and other valuable compounds [8,9]. In the United States, interest in industrial hemp cultivation increased following the 2018 legalization of low-THC varieties [2,10], and hemp is now recognized as a valuable agricultural commodity [11,12,13]. Compared with marijuana, industrial hemp contains smaller amounts of THC and is grown mainly for its cannabidiol (CBD) content, as well as for fiber and grain production [3,14]. This renewed interest coincides with rapid advances in plant genomics, molecular biology, and breeding technologies, making it a timely model for modern crop improvement systems.
Despite its long history of use, genetics and breeding research in industrial hemp has lagged behind that of other major crops. For most of the past century, legal restrictions on Cannabis in many regions curtailed scientific research and limited the exchange of genetic resources across borders [15,16]. As a result, assessments of genetic variation were historically confined to phenotypic selection, with little to no molecular data being generated [17,18]. The reclassification and legalization of industrial hemp in several regions, such as under the 2018 U.S. Farm Bill, have transformed the research landscape [2]. Advances in next-generation sequencing, coupled with international efforts to develop reference genomes, have provided unprecedented insights into the biology and evolution of industrial hemp [19,20,21,22], making it possible to dissect complex trait variations at very small scales, analyze genome-wide diversity, and accelerate the use of marker-assisted and genomic selection methods.
Recent progress in plant molecular biology, genomics, and biotechnology has greatly deepened our understanding of the complex genetic architecture underlying important agronomic traits in Cannabis sativa (Figure 1). This has allowed for increased precision in the improvement of those traits through both traditional breeding and emerging genetic engineering strategies [23,24,25]. However, industrial hemp has a number of distinctive biological challenges for genetic improvement (i.e., dioecy, high heterozygosity, broad levels of genetic diversity and limited self-pollination capacity) that can complicate controlled breeding, functional confirmation and development of stable genotypes [26,27,28].
In recent years, researchers have published several review articles on Cannabis sativa, exploring from how the plant is classified to how it is grown and used. The early reviews on Cannabis sativa formed a basis for research on botanical characteristics, classification, reproductive biology, systematics, and genetic diversity, which laid the foundation for understanding the origins and diversity of this plant [9,29]. Many researchers have also explored the industrial side of Cannabis sativa, focusing on its fiber and seed applications [30,31,32]. Industrial hemp fiber composition, structure, extraction, processing methods, and applications in textiles have been widely discussed. Industrial hemp seed proteins, essential fatty acids, fiber, and bioactive chemicals are being thoroughly studied because it is thought to be a very nutritious seed [4,33,34,35]. In addition, a few reviews have explored environmental applications such as phytoremediation and circular bioeconomy potential [36,37]. Research on industrial sustainability has looked at hemp as a renewable crop, looking at the economic, environmental, and social aspects [3,10,38,39].
Recent studies focus more on the genetic and genomic improvement of Cannabis sativa. These reviews address molecular markers, sequencing resources, genome assemblies, and functional genomics approaches [40]. They discuss how this helps advance our understanding of genetic diversity, cannabinoid biosynthesis, sex determination, and complex trait regulation [41,42,43]. Some researchers have also explored challenges of genome-editing technologies and breeding innovations in industrial hemp improvement [26,44,45].
Despite this growing body of work, existing reviews still vary considerably in their scope and depth of integration. Agronomic and industrial studies mostly concentrate on production and practical use, while genomic and molecular reviews highly lean toward gene discovery and sequencing resources, leaving a gap between their findings and how those findings guide breeding practices. This gap emphasizes the need for a more thorough discussion that brings together phenotypic characterization, multi-omics data, genetic mapping, genomic prediction, and breeding strategies under one framework.
This review addresses this gap by providing an integrated framework for Cannabis sativa improvement. We integrate phenotypic and molecular data with modern breeding approaches, emphasizing how these data streams contribute to breeding through functional understanding, predictive modeling, and the development of improved cultivars. We discuss recent advances in the genetics, genomics and multi-omics approaches of Cannabis sativa research, with a focus on genome-sequencing efforts, gene function characterization, molecular marker development, and genetic transformation methodologies. We discuss the new technologies that are changing the way we think about improving Cannabis, such as CRISPR/Cas gene editing, transient expression systems, and RNA interference technologies. By integrating these advances, we aim to provide a comprehensive overview of how modern genetic tools are transforming Cannabis research and enabling the development of elite genotypes, optimized for traits including fiber quality, seed composition, phytochemical profiles, and resilience to environmental stressors.

2. Genome Architecture and Evolution of Cannabis sativa

The latest findings in the comprehension and analysis of Cannabis sativa genomics have increased our understanding of how genome structure shapes evolution and phenotypic variation. The chromosomal arrangement and overall genomic organization of Cannabis provide valuable context for interpreting patterns of trait and population variation, as well as the genetic architecture of key agronomic traits.
Cannabis sativa L. is a diploid species (2n = 20) that has nine pairs of autosomes and one pair of heteromorphic sex chromosomes [19,46]. The species is predominantly dioecious (one sex per plant), though some plants can be monoecious (bearing both male and female flowers) (Figure 2) [47]. Sex determination is genetically controlled, with females carrying XX chromosomes and males carrying XY chromosomes [48,49]. Previous studies reported similar diploid genome sizes between sexes. Approximately 1636 (±7.2) Mb for females and 1683 (±13.9) Mb for males [27,50,51]. This difference can largely be attributed to the presence of the larger Y chromosome in males [51].
One of the most defining features of the Cannabis genome is having a high quantity of repetitive DNA. The majority of Cannabis’ repetitive DNA comes from long terminal repeat (LTR) retrotransposons, which contribute to more than 60% of the overall Cannabis genome [52,53,54]. These repetitive elements contribute to overall genome size expansion and offer insights into gene location and the evolutionary history of the Cannabis genome [52]. The accumulation, especially within heterochromatic regions and in the genomic regions surrounding the sex chromosomes, has been attributed to the increased size and structural variation observed between male and female Cannabis plants [55].
The presence of repetitive sequences in a genome challenges for accurate genome assembly when short-read sequencing methods are used [52]. However, advances in sequencing using long reads and chromatin conformation capture methods enabled assembly of genomic sequences (Table S1) that contain many repetitive sequences and clarify genome organization [19,56]. This structural knowledge is important when interpreting recombination patterns, improving the resolution of trait mapping, and understanding the structural variation associated with agronomic or phenotypic traits.

3. Reference Genomes, Assemblies, and Pangenomics

Over the past decade, genomic resources for Cannabis sativa have advanced considerably, yielding deeper insights into cannabinoid biosynthesis, chemotype determination, and trait evolution. The first draft genome of Cannabis sativa was published in 2011, based on a drug-type cultivar known as Purple Kush. This allowed for the completion of the first genome and transcriptome assemblies and the opportunity to conduct comparative analyses of traditional fiber-type genotypes while also establishing the foundation for molecular studies of cannabinoid biosynthesis [46]. Building on this initial draft genome assembly, researchers were able to create high-quality chromosome-level assemblies that clarified the complex organization of cannabinoid synthase loci, providing critical structural insights into how tetrahydrocannabinolic acid (THCA) and cannabidiolic acid (CBDA) are biosynthesized [57,58].
Following the Purple Kush assembly, additional assemblies of other high CBD and wild Cannabis genotypes further expanded the understanding of gene content, repetitive elements and genome organization, supporting both functional genomics and molecular breeding opportunities [19,56]. These genomic resources also identified regions of the Cannabis genome with variable and conserved gene sequences relating to cannabinoid production, fatty acid metabolism and vitamin E biosynthesis, serving as a broad-purpose reference for studying agronomic traits and chemotype diversity.
Haplotype-resolved assemblies and large-scale pangenomic analyses using numerous accessions demonstrated that Cannabis harbors extensive genotypic diversity, with much of it being driven by gene presence/absence variation and patterns of gene exchange among three major population groups, industrial hemp, drug-type, and basal [20,21]. The basal group includes Chinese landraces and feral accessions that diverged prior to modern industrial hemp and drug-type genotypes and retain ancestral genetic variation. These provide insights into the domestication and evolutionary history of Cannabis [20,59]. Stress and disease-resistant genes were found to be overrepresented in flexible genes, indicating the potential for adaptation and breeding applications [20]. Pangenomic frameworks therefore offer an improved basis for mapping traits, reducing reference bias and enhancing the development of marker for breeding.
In addition to genome-wide and pangenomic approaches, short tandem repeats (STRs) have been used to individualize Cannabis plants and determine their origin for forensic purposes [60]. STR profiling systems have undergone internal validations and have demonstrated the ability to differentiate among individual plants, associate plant material with seized samples, and detect clonal propagation [61,62].
More recently, haplotype-resolved genome assemblies of seed-type Cannabis sativa have added further dimension to the understanding of genome organization beyond the reference genome and pangenome paradigm [63]. This allows for the distinction between maternal and paternal alleles and revealing genetic differences that remained concealed before and enhancing structural accuracy in highly heterozygous genomic loci [63]. In summary, these genomic advances define a strong foundation for understanding cannabinoid pathways, genome evolution, and accelerating the enhancement of industrial hemp genotypes for use in fiber and seed production and secondary metabolites, while also supporting high-resolution genetic identification and forensic applications.

Plastid Genome Diversity in Cannabis sativa

Plastid genome analyses have provided an additional layer of genomic information in Cannabis sativa, complementing nuclear genome studies of diversity and evolution. The complete chloroplast genome sequencing of multiple genotypes, including Cannabis sativa var. Yunma 7, has enabled detailed characterization of plastome structure, gene content, and evolutionary conservation [64,65]. Comparative analyses extending to related taxa, such as Humulus lupulus, further highlight the high structural conservation of chloroplast genomes within the Cannabaceae family while still allowing for discrimination among distinct lineages [66].
In addition to whole-plastome sequencing, chloroplast-derived SNP markers have been developed and assessed for crop-type determination and biogeographical origin tracing in European Cannabis sativa samples, demonstrating the value of plastid variation for forensic and breeding applications [67]. Additionally, integrated genomic databases combining nuclear, chloroplast, and mitochondrial data from U.S. Cannabis accessions have expanded reference resources for multi-genome comparisons, enabling more robust phylogeographic and genotype-level analyses [68]. Comparative plastome studies consistently indicate that while the chloroplast genome is relatively conserved, it contains sufficient variation in SNPs and simple sequence repeats (SSRs) to support genotype identification, population structure inference, and germplasm traceability [69]. Together, these organelle genome resources complement nuclear genome-based approaches and strengthen the genomic toolkit available for industrial hemp breeding and evolutionary research.

4. Sex Determination and Reproductive Biology

Sex determination is one of the most significant aspects of Cannabis sativa biology, as it affects not only reproductive development but also many other traits with agronomy and commercial importance. To produce industrial hemp crops with controlled sexual expression, it is important to understand the genetic and molecular basis of sex differentiation for both evolutionary and applied breeding programs.
Unlike many crop species, Cannabis is predominantly dioecious, producing male and female flowers on separate individuals. Female plants are generally preferred for cannabinoid production, as unfertilized female inflorescences accumulate substantially greater concentrations of cannabinoids and other specialized metabolites. As a result, understanding the genetic mechanisms regulating sex determination is critical to both biological research and commercial cultivation. From a chromosomal perspective, Cannabis sativa follows an XY/XX sex determination system, with males carrying XY chromosomes and females carrying XX [70,71]. However, naturally occurring monoecious individuals are frequently employed in the breeding of fiber and seed industrial hemp [72,73].
Early cytogenetic studies with fluorescence in situ hybridization (FISH) using DAPI/C banding provided initial evidence of heteromorphic sex chromosomes in Cannabis, laying the groundwork for subsequent molecular investigations into sex determination [70]. In dioecious Cannabis sativa, the sex chromosomes are highly heteromorphic. The X chromosome is among the largest in the karyotype, while the Y chromosome is distinguished by dense accumulation of heterochromatin and repetitive DNA sequences. The lack of recombination outside the pseudo autosomal region suggests that these chromosomes represent an early stage of evolutionary divergence [70,71]. Generally, monoecious cultivars carry an XX karyotype, implying that the development of bisexual flowers arises from differential regulation of sexual expression rather than simply from the presence of a Y chromosome [47].
Although the chromosomal basis of sex determination is well established, the identification of the primary genetic regulators remains an ongoing area of active research. Recent genomic studies have uncovered an ancient locus on the X chromosome carrying three candidate genes responsible for sexual development in Cannabis sativa, offering new insights into the mechanisms of sex determination and substantiating the idea that sex determination in Cannabis is linked to ancestral elements on the X chromosome [74].
Sex determination in C. sativa extends well beyond simple chromosomal inheritance. It is shaped by a broader network of polygenic and regulatory interactions. Genome-wide association studies and resequencing analyses have mapped sex-associated quantitative trait loci (QTLs) across both sex chromosomes and autosomes, linking genes involved in auxin and gibberellin signaling, as well as transcription factors that guide floral development [75,76]. Some of these QTLs overlap with flowering time loci, suggesting that reproductive timing and sexual differentiation are likely co-regulated. Transcriptomic analyses have revealed extensive sex-biased gene expression occurring before visible floral differentiation, implicating both sex-linked and autosomal genes and highlighting early molecular divergence between male and female developmental programs [76]. Recent haplotype-resolved genome assemblies further strengthened existing theories about X Y divergence because they provide direct proof of the structural differences and repeat elements that have accumulated in the Y chromosome over time [63].
These genetic and molecular mechanisms have direct implications for reproductive biology. Sexual phenotype determines flower morphology, the timing of anthesis, and the production of pollen and seeds, all of which influence both natural population dynamics and cultivated breeding systems. Inflorescence architecture is an important aspect of Cannabis reproductive biology that shapes pollination efficiency, cannabinoid production, disease susceptibility, and seed retention [77,78,79,80]. Female inflorescences develop acropetally, with flowers initiating and maturing progressively from base to apex [80,81,82]. Drug-type cultivars typically form compact inflorescences with short internodes and dense floral clusters, while industrial hemp cultivars tend toward longer internodes and a more open structure [83,84]. The genetic mechanisms governing inflorescence development, floral density, and branching architecture remain poorly understood and represent promising targets for future research [78,79].
Environmental factors and phytohormones also play a role in modulating sex expression. For instance, exogenous application of ethylene inhibitors such as silver thiosulfate can induce male flower development on genetically female plants, a technique widely used to produce feminized seed [76]. These insights have translated into practical tools for early sex identification, a capability of particular value in cannabinoid-focused breeding, where female plants are commercially preferred. Y-linked PCR markers and multiplex assays have proven capable of reliably distinguishing male from female seedlings across a range of genotypes, though their accuracy can vary depending on germplasm background [85,86]. One of the most recent advances in this direction refers to CRISP, an affordable and scalable tool for early sex determination in Cannabis sativa. CRISP offers a rapid and cost-effective pipeline for sex identification, with the potential to meaningfully improve the efficiency of early sex determination in industrial hemp breeding programs [87]. Integrating these tools with an understanding of polygenic regulation and hormonal influences would give breeders meaningful control over sex expression, with direct effects on reproductive efficiency and cannabinoid biosynthesis.
Sex determination in C. sativa is a far more complex process than once thought, operating through a multilayered system that integrates chromosomal architecture, polygenic regulation, and environmental responsiveness. This integrated framework not only shapes reproductive phenotypes central to cultivation and breeding but also reflects the ongoing evolution of sex chromosomes and regulatory networks in the species. Despite significant progress, several fundamental questions regarding Cannabis sex determination remain unresolved. The primary genetic regulators of sex determination have yet to be definitively identified, the interactions between chromosomal and hormonal pathways remain poorly understood, and functional validation of candidate genes is limited. Moving forward, research that brings together cytogenetic, genomic, and transcriptomic approaches will be essential both to resolving the full molecular architecture of sex determination and to translating that knowledge into more effective breeding strategies and a better understanding of how sex chromosomes evolve in plants.

5. Genetics of Cannabinoid Biosynthesis

Cannabinoids are a chemically diverse class of terpenophenolic compounds unique to Cannabis sativa and have been extensively studied. They represent one of the most economically important trait groups in the species. Over one hundred cannabinoids have been identified, with Δ9-tetrahydrocannabinol (THC) and cannabidiol (CBD) being the most broadly studied due to their psychoactive and therapeutic properties, respectively [88,89]. The high variation in cannabinoid profiles across industrial hemp and drug-type cultivars arises from complex interactions among biosynthetic genes (Table 1), structural genomic variation, transcriptional regulation, and environmental influences. The blend of cannabinoids a plant produces defines its chemotype, and this chemical identity highlights the plant’s industrial, medicinal, and recreational applications [90,91,92].
The glandular trichomes of female inflorescence serve as the center of the production of cannabinoids, reflecting a highly spatially organized process (Figure 3), where it integrates metabolic flux from both the polyketide and plastidial methylerythritol phosphate (MEP) pathways. The biosynthetic pathway is initiated by the production of olivetolic acid through a type III polyketide synthase, followed by prenylation to form cannabigerolic acid (CBGA), from which the major cannabinoids are derived [93,104,105].
From CBGA, a set of specialized oxidocyclases catalyze the formation of the major cannabinoid acids, tetrahydrocannabinolic acid (THCA), cannabidiolic acid (CBDA), and cannabichromenic acid (CBCA). The relative activity of these enzymes is what largely determines a plant’s chemotype. These enzymes constitute the core genetic determinants of cannabinoid composition and have been major targets of both genetic and breeding investigations. In industrial hemp, CBDAS activity dominates over that of THCAS and CBCAS, steering the plant toward high CBDA accumulation [93,94,105].
The genome-wide inventory and functional analysis of cannabinoid oxidocyclases in Cannabis sativa have provided more details about the diversity within cannabinoid synthase gene families, including expansion, sequence divergence, and functional diversification of THCAS, CBDAS, and related enzymes. These findings have deepened our understanding of how chemotypes are determined and how cannabinoid biosynthetic gene families have evolved over time [106]. Recent genomic studies have demonstrated that cannabinoid synthase genes occur within highly dynamic genomic regions characterized by tandem duplications, deletions, and structural rearrangements [107]. Copy number variation within THCAS and CBDAS clusters has been associated with differences in cannabinoid profiles among cultivars, suggesting that structural variation contributes substantially to chemotype diversity [108,109].
Transcriptomic research has further revealed that the expression of pathway genes is carefully synchronized with trichome development and that this expression responds dynamically to environmental signals, allowing the plant to adjust cannabinoid output in response to both developmental stage and external stress [104,110,111,112]. Functional experiments, such as overexpression of CsCBDAS, demonstrate cross-talk between terminal synthases and genes further upstream in the pathway, highlighting multi-level regulation that fine-tunes cannabinoid biosynthesis [105].
Beyond enzyme level control, cannabinoid biosynthesis is strongly influenced by transcriptional and epigenetic regulation. The responsiveness of gene promoters to phytohormones and environmental stimuli, together with the activity of transcription factor families such as WRKY, AP2L, MYB, and COL, coordinates gene expression and precursor allocation [104,111,112,113]. Together, these mechanisms ensure robust CBDA accumulation and minimal THCA in industrial hemp genotypes while also identifying key regulatory points that can be targeted through metabolic engineering and precision breeding to tailor cannabinoid profiles for fiber, seed, and therapeutic end uses [93,105,113].
Genome-wide association studies and population structure analyses have identified specific genomic loci that underline chemical diversity in the species, revealing how population stratification shapes metabolite variation. This has strengthened the link between genotypic variation and the cannabinoid and terpene biosynthetic pathways in Cannabis sativa [22]. Cannabinoid biosynthesis in C. sativa is not a simple, linear pathway. It is an interconnected system in which metabolic flux, enzyme specialization, and transcriptional regulation function together. Understanding how these elements interact and how they respond to genetic and environmental variation will be essential to developing more predictable, targeted approaches to cannabinoid production, whether the goal is maximizing therapeutic compounds, meeting regulatory thresholds, or adapting performance across variable growing conditions. Such knowledge will provide a foundation for breeding and developing cultivars tailored to specific commercial and industrial applications.

6. Genetic Basis of Fiber Quality and Quantity in Cannabis sativa

Fiber production has historically been one of the primary agricultural uses of Cannabis sativa and yet the genetic mechanisms controlling fiber development remain considerably more poorly understood than those governing cannabinoid biosynthesis. Fiber quality and yield in Cannabis sativa are complex quantitative traits governed by the biosynthesis, organization, and remodeling of secondary cell wall components, primarily cellulose, hemicellulose, and lignin [14]. Advances in genomic, transcriptomic, and association-based research have begun to clarify the genetic foundation of these traits, revealing both conserved biosynthetic pathways and regulatory networks that shape the development of bast fiber (Table 1).
Bast fibers are derived from specialized cells within the phloem tissue of the outer stem cortex and have remarkable tensile strength, flexibility, and high cellulose content. The development of these fibers involves coordinated regulation of genes associated with cell division, elongation, and secondary wall deposition [114]. Transcriptomic studies have identified numerous genes involved in cellulose biosynthesis, including members of the cellulose synthase (CesA) gene family, as major contributors to fiber formation [115]. A genome-wide characterization of cellulose synthase (CESA) and cellulose synthase-like (CSL) gene families identified 29 CESA/CSL genes distributed across seven subfamilies in C. sativa [98]. The CsCESA genes are directly involved in cellulose microfibril formation, a key determinant of tensile strength, while CSL genes are likely to contribute to hemicellulose biosynthesis, influencing fiber flexibility and structural integrity. Tissue-specific expression patterns further indicate functional specialization of these genes in stem and fiber development, highlighting their potential as targets for genetic improvement [98].
Genome-wide association studies (GWASs) have provided additional insights into the genetic factors controlling fiber characteristics. Using more than 600,000 SNP markers, researchers identified 16 quantitative trait loci linked to fiber composition traits, including cellulose-derived sugars, lignin content, and bast fiber proportion that appeared consistently across multiple growing environments. The candidate genes found within these QTL regions are associated with carbohydrate metabolism, lignin biosynthesis, and cell wall organization, highlighting the importance of carbon allocation and polymer assembly pathways in determining fiber quality [116].
The influence of environmental conditions on fiber traits further reflects underlying gene regulatory networks. When plants are exposed to heavy-metal stress (Cd and Zn), differential expression of genes encoding cellulose synthases, fasciclin-like arabinogalactan proteins (FLAs), and class III peroxidases has been observed, indicating that cell wall biosynthesis and remodeling pathways are responsive to environmental cues [117]. While this work was conducted under stressed conditions, it points to a broader principle: the genetic networks governing fiber development are not fixed but actively tuned by signals from the surrounding environment.
Phenotypic studies conducted across a wide range of hemp germplasm have demonstrated substantial genetic variation and moderate-to-high heritability for fiber-related traits, including cellulose, hemicellulose, lignin content, and bast fiber yield [116]. Together, these findings suggest that both fiber quality and quantity are amenable to genetic improvement through marker-assisted and genomic selection strategies, supported by emerging genomic resources.
Collectively, progress in functional genomics and quantitative genetics has demonstrated that fiber quality and yield in Cannabis sativa are governed by a web of interconnected biosynthetic pathways and regulatory networks that integrate developmental programming with the ability to respond to environmental conditions. These findings have advanced our understanding of the genetic basis of fiber-related traits and strengthened the foundation for breeding cultivars with enhanced fiber quality, yield, and adaptability across diverse growing conditions.

7. Genetic Basis of Seed Quality, Oil Composition, and Yield in Cannabis sativa

Industrial hemp seeds are a high-value agricultural product due to their nutritional composition and expanding industrial applications. The traits that define this value are protein content, oil composition, fatty acid profile, seed size, and overall yield. These are quantitatively inherited and shaped by the interplay between genetic architecture and environmental conditions [118,119]. Recent genomic and transcriptomic work is beginning to clarify the molecular basis of these traits (Table 1), pointing to coordinated regulation across storage, metabolic, and developmental pathways.

7.1. Seed Storage Proteins and Nutritional Quality

The nutritional profile of industrial hemp seed is largely defined by its proteins. The primary storage protein found in industrial hemp seeds is edestin, a type of globulin protein encoded by a multigene family with distinct isoforms (e.g., CsEde1AD and CsEde2AC) [95]. These isoforms exhibit differential expression during seed development, with type 1 being generally more abundant and type 2 being enriched in sulfur-containing amino acids, which contributes to the seed’s overall nutritional value [95].
Additional storage proteins, including 2S albumins and 7S vicilin-like proteins, are encoded by multigene families with variable copy number and expression patterns. Transcriptomic analyses studies show that edestin and 2S albumin genes are among the highest expressed genes in seed development [96]. These findings suggest that seed protein quality is not dictated by individual genes but by the coordinated expression of multiple gene families.

7.2. Oil Biosynthesis and Fatty Acid Composition

In parallel with protein accumulation, industrial hemp seed oil is another key quality trait, particularly due to its high proportion of polyunsaturated fatty acids (PUFAs), including linoleic and α-linolenic acids, which vary significantly among genotypes [120]. This variation reflects genetic control over lipid biosynthetic pathways operating during seed development.
Key enzymes include stearoyl-ACP desaturase (SAD) and fatty acid desaturases (FAD2 and FAD3), which regulate desaturation steps that determine fatty acid composition. These genes are highly expressed in developing seeds and are critical to triacylglycerol (TAG) biosynthesis and PUFA accumulation [97]. Genome-wide analyses have identified approximately 36 genes involved in lipid metabolism pathways, providing a foundation for functional characterization and breeding [19].
Emerging genetic manipulation strategies, including the targeted editing of desaturase genes, suggest potential for modifying oil composition, as demonstrated in related oilseed crops [121,122,123]. However, the application of such techniques in industrial hemp remains limited and is still in the early stages of exploration.

7.3. Seed Size, Yield and Genetic Control

Beyond compositional traits, physical traits such as seed size and overall yield are important key determinants of overall productivity and oil output. QTL mapping studies have revealed a highly polygenic architecture underlying traits such as hundred-seed weight (HSW), seed length, width, and volume, with major loci (e.g., qHSW3, qSV1, qSL1, and qSW4) explaining substantial phenotypic variation [124]. These loci represent important targets for genetic improvement of seed yield.
In biparental mapping populations, numerous QTLs for seed and agronomic traits have been identified, with several loci explaining more than 10% of phenotypic variance [119,124]. These often cluster together in specific regions of the genome, suggesting that a relatively small number of genomic regions may control multiple yield-related traits. Importantly, many seed-related QTLs appear genetically independent from plant architecture traits, indicating the potential to improve seed yield without negatively affecting plant growth [124].

7.4. Seed Retention and Seed Shattering

One of the major challenges in the industrial hemp grain industry is seed shattering. This significantly reduces the marketable yield and hinders mechanical harvesting [78,125,126]. The genetic basis of seed shattering has been well studied in major cereal and oilseed crops [127], yet remarkably little is known about how Cannabis sativa controls seed retention and dispersal [78]. The trait most likely involves developmental pathways linked to seed attachment strength and inflorescence architecture [128]. Characterizing the genes and regulatory networks underlying seed retention will be critical to improving harvest efficiency and yield stability, particularly as hemp breeding increasingly targets grain-specific and dual-purpose genotypes [129,130]. Approaches combining genome-wide association studies, QTL mapping, and functional genomics offer a powerful framework for elucidating the genetic basis of this trait and supporting the development of genotypes with enhanced seed retention.

7.5. Genetic Variation and Breeding Potential

In studies examining diverse hemp germplasm collections, considerable phenotypic and genomic variation has been observed in seed protein composition, oil content, fatty acid profiles, and antinutritional factors, highlighting strong heritable control over seed quality traits [120]. The convergence of multigene family regulation, lipid biosynthesis networks, and polygenic yield loci provides a comprehensive genetic framework for seed trait improvement.
These advances reveal that seed quality and yield in C. sativa are governed by interconnected genetic systems spanning the regulation of storage proteins, lipid metabolism, and quantitative trait architecture. Building on this understanding, marker-assisted selection and genomic prediction represent promising approaches to developing industrial hemp genotypes optimized for seed, oil, and dual-purpose production systems. Combining detailed multi-omics profiling with functional validation will be crucial in the future to linking candidate genes to phenotypic stability and enhancing nutritional and yield-related traits across diverse environments.

8. Genetic Basis of Disease Resistance in Industrial Hemp

As the cultivation of industrial hemp expands across diverse agroecological regions, disease resistance has emerged as a critical target for genetic improvement [131,132]. Unlike major row crops, industrial hemp has historically lacked systematic disease resistance breeding due to regulatory constraints that limited germplasm exchange and genetic investigation.
Recent genomic work, however, is beginning to fill that gap, establishing a clear genetic basis for resistance to powdery mildew and laying the groundwork for incorporating resistance traits into modern breeding programs. In many crop species, resistance is mediated through resistance (R) genes that recognize pathogen-derived effectors and activate immune responses [133,134]. Emerging evidence suggests that similar mechanisms operate in C. sativa, particularly in resistance against powdery mildew, one of the most economically damaging diseases affecting industrial hemp production. A major advance was the identification of PM1, the first mapped resistance locus conferring complete resistance to powdery mildew in industrial hemp. Using high-density SNP mapping in segregating populations, PM1 was localized to a defined genomic region and demonstrated dominant inheritance of resistance. Importantly, PM1-mediated resistance was stable across environments and genetic backgrounds, indicating strong potential for breeding programs. The development of tightly linked molecular markers further enabled marker-assisted selection (MAS), representing the first practical genetic tool for disease resistance improvement in industrial hemp [135]. Beyond its applied value, PM1 established that canonical R gene-mediated immunity operates in C. sativa, shifting disease resistance research from phenotypic observation to mechanistic genetics [136]. This finding provided a conceptual framework for identifying additional resistance loci and applying comparative plant immunity models widely used in other crop systems.
A second resistance locus, PM2, has been identified, demonstrating that powdery mildew resistance in industrial hemp is genetically heterogeneous rather than monogenic. Unlike PM1, PM2 mediates resistance through suppression of fungal penetration and induction of a localized hypersensitive response, indicating a distinct defense mechanism. The coexistence of these loci suggests that multiple immune pathways contribute to resistance against G. ambrosiae [137]. Together, PM1 and PM2 constitute the first genetically defined framework for disease resistance in industrial hemp, one that is both tractable and mechanistically diverse. These loci enable marker-assisted breeding and offer a concrete entry point for integrating disease resistance into elite genotypes.
While powdery mildew represents a well-characterized disease resistance system in Cannabis, transcriptomic studies indicate that broader immune networks are activated in response to fungal pathogens. The dual RNA seq analysis of the interaction between Cannabis sativa plants and the pathogenic fungus Sclerotinia sclerotiorum has highlighted how infection leads to the complete reprogramming of the plant’s immune responses, defense mechanisms, and stress responses on a systems level [87]. This points to the intricate nature of the transcriptional responses involved in fungal resistance apart from just loci of resistance.
Despite these advances, resistance to many economically important pathogens remains largely unexplored. Fine mapping, gene cloning, functional validation, and the application of transcriptomic and genome-editing tools will be essential to converting these loci into durable, deployable resistance strategies and for building a broader understanding of how immunity operates in C. sativa as a species.

9. Genetic Basis of Abiotic Stress Tolerance in Industrial Hemp

As Cannabis sativa is increasingly cultivated across diverse environments characterized by drought, salinity, high temperature and alkaline soils, interest in the genetic basis of abiotic stress tolerance has grown considerably. Unlike traits, which can sometimes be linked to discrete loci, abiotic stress tolerance in industrial hemp is fundamentally a quantitative trait controlled by coordinated transcriptional, hormonal, metabolic responses and environmental conditions. Recent transcriptomic studies have started to map this molecular complexity in meaningful ways.
In industrial hemp, drought stress has been extensively studied at molecular level [100,138,139,140]. Genome-wide RNA sequencing under water deficit conditions has identified more than 1200 differentially expressed genes, including transcription factors from the NAC and B3 families, as well as genes involved in abscisic acid (ABA) signaling, cell wall modification, and reactive oxygen species (ROS) scavenging [100]. These results indicate that drought tolerance involves broad and coordinated transcriptional reprogramming that integrates hormonal signaling with cellular protection and water use regulation.
Studies integrating physiological and transcriptomic analyses have shown that drought response is not limited to stress signaling alone but also involves reconfiguration of primary metabolism. Rather than simply shutting down in response to stress, industrial hemp maintains growth stress balance by adjusting its carbohydrate metabolism, nitrogen utilization, and key hormone signaling pathways, including those involving ABA, auxin, and cytokinin [139]. This difference between active metabolic adjustment and passive stress suppression has important implications for how tolerance traits might be selected and improved.
Similar regulatory complexity is evident under saline and alkaline conditions. Under saline–alkaline conditions, genes involved in hormone signaling, phenylpropanoid biosynthesis, and carbon nitrogen metabolism are significantly enriched, indicating that ionic and pH stress tolerance requires coordinated metabolic and structural adaptation [141]. Suppression of photosynthetic processes combined with activation of antioxidant and detoxification pathways further highlights the dynamic nature of stress responses in Cannabis [142].
At a more targeted level, genome-wide analysis of the WRKY transcription factor family has helped identify key regulatory nodes within these stress networks. Multiple CsWRKY genes were found to be consistently induced by drought, salt, and cold stresses, with Group III WRKYs showing particularly broad and robust responsiveness across stress types [101]. Apart from the WRKY transcription factors, the genome-wide analysis of the GATA transcription factors in Cannabis sativa has been used for identifying lineage-specific expansion and response to stress, especially cold and salt stress during germination, indicating that these transcription factors may play regulatory roles in abiotic stress responses [143]. These transcription factors are strong candidates for central regulatory roles in coordinating abiotic stress-responsive gene expression in industrial hemp.
Moreover, the genome-wide identification and expression studies of the MYB transcription factor family indicate that MYB transcription factors exhibit dynamic regulatory activities in response to salt stress conditions in seed germination phases. Thus, MYB transcription factors also seem to contribute to abiotic stress tolerance at an early developmental phase [102]. Apart from the regulatory role of MYB transcription factors, genome-wide analysis of TIFY transcription factor genes in Cannabis sativa shows their involvement in alkali stress response during seed development. Additionally, there is a possibility of their involvement in cannabinoid metabolism. These findings suggest that TIFY transcription factors regulate the jasmonate signaling pathway and may act as a link between stress responses and secondary-metabolite synthesis [144].
Collectively, current evidence indicates that abiotic stress tolerance in C. sativa is governed by an interconnected regulatory system involving transcriptional reprogramming, hormone signaling, metabolic flexibility, and oxidative stress management. Even though the functional validation of candidate genes is limited, available genomic and transcriptomic resources have significantly improved our understanding of the genetic basis of stress adaptation. This provides a valuable framework for the genetic improvement of industrial hemp for enhanced environmental resilience.

10. Functional Genomics and Genome Editing

10.1. The Functional Validation Gap

Several candidate genes in Cannabis sativa associated with traits such as cannabinoid biosynthesis, sex determination, fiber development, seed quality, disease resistance, and abiotic stress tolerance have been identified through expanding genomic resources. However, the biological functions of most candidate genes are poorly characterized. For practical crop improvement in Cannabis sativa, functional validation serves as a bridge between genomic discovery and breeding applications. Over the past decade, research into the genetic editing and targeted engineering of Cannabis sativa has accelerated considerably (Table 2) but remains in its early stages relative to major crops such as rice or maize. The Cannabis sativa genome has long been challenging to genetically manipulate due to recalcitrance in tissue culture and transformation, limiting functional validation of genes and targeted trait improvement [145]. However, recent studies have begun to unlock both transient and stable gene manipulation systems that lay the groundwork for future breeding and trait engineering in industrial hemp [26,41,146].

10.2. Transient Manipulation Toolkits for Rapid Functional Screening

One of the recent breakthroughs in the field of functional genomics in Cannabis sativa is related to the creation of a transient reverse genetic technique based on virus-induced gene silencing (VIGS) [152]. Using a viral vector based on Cotton leaf crumple virus, successful suppression of reporter genes, including phytoene desaturase (PDS) and magnesium chelatase subunit I (ChlI) genes, was achieved. As a result, they observed a photobleaching phenotype and a reduction of approximately 70–73% in target gene transcripts [152]. This has been further expanded using Tobacco Rattle Virus (TRV) vectors as a rapid reverse genetics tool in Cannabis sativa, enabling efficient gene silencing in multiple tissues and allowing for the functional screening of candidate genes without stable transformation. TRV-mediated silencing successfully reduced transcript levels of target genes, providing a fast and reliable system for validating gene function in metabolic and developmental pathways [153].
Building on VIGS, Cannabis sativa transient expression systems were further optimized through development of an Agrobacterium-mediated agroinfiltration platform. This platform enables both transient gene expression and gene silencing across industrial hemp tissues. Using hairpin RNAi constructs targeting CsPDS, they observed efficient silencing and albino phenotypes in mature leaves and reproductive tissues, with transcript levels below 20% of controls. Such agroinfiltration systems are crucial to rapidly testing gene function, overexpression, and knockdown in different tissues without stable integration [155]. Transient overexpression has also successfully validated complex metabolic fluxes. A high-efficiency transient transformation system was developed in the Cheungsam genotype to overexpress CsCBDAS2, a key enzyme in the cannabinoid biosynthesis pathway. This approach resulted in coordinated upregulation of downstream cannabinoid biosynthetic genes and demonstrated the ability to functionally validate and manipulate metabolic pathways in Cannabis without stable transformation. Such transient overexpression systems complement VIGS, agroinfiltration, and TRV-mediated silencing by providing a flexible tool for the rapid functional analysis and metabolic engineering of target genes [105].
Although powerful for rapid gene characterization, these transient systems do not produce heritable modifications and are therefore limited to functional validation rather than direct cultivar development.

10.3. Stable Transformation and CRISPR-Based Genome Editing

The ultimate goal of functional genomics is the generation of stable and heritable genetic modifications. This was advanced by the first successful stable gene editing in industrial hemp using Agrobacterium-mediated transformation combined with CRISPR/Cas9. By optimizing explant choice and culture conditions, including the use of a developmental regulator chimera to enhance regeneration, the study generated transgenic C. sativa plants with targeted knockout of the phytoene desaturase gene (CsPDS1). Edited seedlings exhibited the expected albino phenotype, and T-DNA integration was confirmed in the Cannabis genome, validating both transformation and editing in a stable context. The work represents the first proof of concept for CRISPR/Cas9-based editing in industrial hemp and provides a foundation for future functional genomics and trait improvement [146]. Beyond gene knockouts, CRISPR-based technologies offer opportunities for engineering cannabinoid biosynthesis, modifying flowering behavior, manipulating sex expression, improving disease resistance, and enhancing stress tolerance.
Despite these advances, transformation and regeneration remain the major bottlenecks limiting widespread genome editing in Cannabis. Transformation efficiencies remain highly genotype-dependent, and many elite cultivars exhibit poor regeneration responses. An improved transformation protocol using a rapid Agrobacterium-mediated transformation method has been described: it produced stably transformed C. sativa plants from various explants (hypocotyls, cotyledons, and meristems) and achieved higher regeneration and transformation rates than earlier attempts. This methodology not only provides a route for transgene insertion but also enhances the potential for targeted editing using CRISPR/Cas systems by increasing the pool of transformable tissues [156].
In parallel, genotype-independent de novo regeneration systems are emerging, enhancing shoot organogenesis from explants across diverse industrial hemp cultivars without genotype limitations [157]. Scalable regeneration is arguably the rate-limiting step in industrial hemp transformation, and progress here has system-wide implications for how reliably genome editing can be applied across breeding germplasm.

10.4. Bottlenecks in Genome Editing and Functional Genomics

However, despite rapid advances in both transient and stable gene manipulation systems, Cannabis sativa still lags behind major model and crop species in the implementation of advanced genome engineering technologies. This gap is primarily due to a combination of biological and technical constraints. A key limitation is the strong genotype-dependent recalcitrance to tissue culture and regeneration, where many elite industrial hemp cultivars show poor callus induction and low shoot regeneration efficiency [158,159]. These limitations directly reduce the recovery of stable edited lines and hinder scalable genome-editing applications. In addition, low and inconsistent transformation efficiency, particularly in Agrobacterium-mediated systems [146,156], further restricts reproducibility across genotypes and laboratories, preventing the establishment of standardized editing pipelines comparable to those in model crops.
Another limiting factor is the incomplete functional annotation of Cannabis genomes and the lack of well-resolved gene regulatory networks for complex traits such as cannabinoid biosynthesis, sex determination, and stress adaptation [21,24]. Most candidate genes identified through these approaches remain experimentally unvalidated [147,160,161], creating a significant gap between genomic prediction and functional application.

10.5. Future Directions in Cannabis Genome Engineering

Current gene editing primarily relies on first-generation CRISPR/Cas9 approaches, while advanced technologies such as multiplex genome editing, base editing, prime editing, and transgene-free editing have not yet been widely implemented in Cannabis [162,163,164]. This limited implementation is largely constrained by regeneration inefficiency and transformation bottlenecks, as well as the absence of genotype-independent delivery systems. Further limitations are caused by the regulatory uncertainty surrounding genetically edited Cannabis [165].
Collectively, the development of VIGS, viral silencing systems, agroinfiltration platforms, transient overexpression tools, and CRISPR/Cas9-based genome editing represents a rapid evolution of functional genomics in C. sativa [146,152]. Through these advances, Cannabis research has been shifting from descriptive genomics toward mechanistic gene function analysis and targeted metabolic engineering. Together, they mark a transition from simple observing genomic patterns to functional and engineering-based biology. This opens new opportunities for the improvement in cannabinoid content, disease resistance, seed quality, and stress tolerance.
Looking ahead, the next phase of Cannabis functional genomics will require integration of next-generation genome engineering strategies with improved regeneration and transformation platforms. Future research should prioritize the adoption of multiplex genome-editing systems, enabling simultaneous manipulation of cannabinoid biosynthetic networks, regulatory transcription factors, and developmental genes [166,167]. Similarly, base editing [163,168] and prime editing [164,169] technologies offer the ability to introduce precise nucleotide changes without double-strand breaks. This is highly valuable for fine-tuning enzyme activity within cannabinoid and terpene pathways. These approaches would allow for allele-level engineering rather than complete gene disruption, enabling more predictable metabolic outcomes.
In addition, transgene-free genome-editing strategies, such as RNP-based delivery systems and DNA-free CRISPR approaches [162,170], will be essential to generating edited lines that are more acceptable for regulatory approval and commercial deployment. Integration of such systems with improved regeneration protocols could significantly accelerate the production of edited elite cultivars.
The future of Cannabis sativa improvement will depend on the integration of multi-omics-driven gene discovery with high-throughput functional validation and genome engineering platforms. Establishing genotype-independent transformation systems, improving regeneration efficiency, and developing standardized editing pipelines will represent a major step towards bridging the current gap between genomic knowledge and applied breeding. Such advances will enable a shift from gene discovery to programmable genome engineering, accelerating the development of industrial hemp cultivars with optimized cannabinoid profiles, enhanced stress resilience, and improved agronomic performance.

11. Molecular Breeding and Genetic Improvement Strategies

The integration of genomic resources into breeding pipelines has reshaped crop improvement across many species [171,172,173,174], and similar approaches are increasingly being applied to industrial hemp [175,176]. For much of its cultivated history, industrial hemp breeding operated on phenotypic selection alone, a process that is inherently slow, labor-intensive, and inefficient for complex traits influenced by numerous loci and strong genotype-by-environment interactions [177,178]. The emergence of high-quality reference genomes, pangenomic resources, genome-wide markers, and trait-mapping studies has created opportunities to transition from phenotype-driven selection toward predictive, genome-informed breeding strategies [21,27,58].
The modern molecular breeding pipeline begins with the identification of genomic regions associated with target traits through linkage mapping, genome-wide association studies (GWASs), transcriptomics, and pangenomic analyses [21,106,113]. These approaches generate candidate loci controlling economically important traits such as cannabinoid composition, flowering time, sex determination, disease resistance, fiber quality, seed oil composition, and abiotic stress tolerance [75,116,179]. Functional validation using transient expression systems, gene silencing technologies, and genome editing subsequently provides biological evidence linking genotype to phenotype [106,152]. Once validated, these loci can be converted into molecular markers that support selection within breeding populations, thereby creating a direct pathway from genomic discovery to cultivar development.
Marker-assisted selection (MAS) remains one of the earliest applications of molecular breeding in industrial hemp breeding. By using tightly linked markers to identify favorable alleles at early developmental stages, breeders can substantially reduce reliance on labor-intensive phenotyping and accelerate breeding cycles. Marker systems associated with cannabinoid profiles, sex determination, and powdery mildew resistance illustrate the potential of MAS for traits controlled by major-effect loci [72,175,180]. However, MAS is most effective for traits controlled by a small number of loci with large effects, and its utility is limited for highly polygenic traits [181]. Moreover, the development and validation of minimal SNP genotyping arrays in Cannabis sativa have enhanced the ability to differentiate among various cultivars and germplasms at relatively affordable costs [148]. The development of SNP-based genotyping platforms and Kompetitive Allele Specific PCR (KASP) markers has further enhanced the efficiency of germplasm characterization, cultivar identification, and marker-assisted breeding [182].
To overcome these limitations, genomic selection (GS) has emerged as a promising strategy for improving complex traits controlled by many small-effect loci [183,184]. Unlike MAS, GS uses genome-wide marker information to predict the breeding values of individuals, allowing selection without direct phenotyping. This approach captures both major and minor genetic effects and enables selection prior to phenotypic evaluation. In industrial hemp, GS holds significant promises for traits such as fiber yield, seed yield, and stress tolerance, where traditional selection is challenging. Because many commercially important industrial hemp traits exhibit complex genetic architectures and substantial environmental influence, GS can accelerate breeding far faster than traditional methods. While still emerging in industrial hemp, recent studies have demonstrated the feasibility of GS and related predictive approaches for traits such as flowering time, biomass accumulation, and cannabinoid composition [75,161]. Nevertheless, successful implementation requires large training populations, accurate phenotypic datasets, robust statistical models, and extensive marker coverage across diverse germplasm [185].
Developments in machine learning and artificial intelligence further expand the predictive capabilities of molecular breeding [186,187]. Multi-trait prediction models that integrate genomic, phenotypic, and environmental data have demonstrated improved accuracy compared with traditional statistical approaches [187]. As multi-omics datasets become increasingly available, integrating genomic, transcriptomic, metabolomic, and environmental information will enable more precise prediction of complex phenotypes and improve selection efficiency across diverse environments.
Molecular breeding strategies are also increasingly being integrated with speed breeding and controlled-environment agriculture [188,189,190]. By accelerating generation turnover, speed breeding enables more rapid population advancement and increases the efficiency of both MAS and GS [191,192]. When combined with genomic prediction and high-throughput phenotyping data, these methods create a self-improving breeding cycle where genomic data guide selection, phenotyping validates predictions, and new data continuously improve predictive models [193,194]. Such approaches have the potential to substantially reduce the time required to develop elite genotypes [189,193,194].
Before molecular breeding can be routinely implemented across industrial hemp breeding programs, several significant challenges still need to be overcome. Compared with major crops, Cannabis breeding has only recently gained access to high-quality reference genomes and large-scale genotyping resources [20,22,64,147,182]. Many economically important traits exhibit complex genetic architectures and strong genotype-by-environment interactions that are poorly understood [26]. In addition, limited access to diverse public germplasm collections, fragmented breeding efforts, inconsistent phenotyping protocols, and a lack of large multi-environment datasets have constrained the development and validation of robust molecular breeding tools [10,26]. As a result, the accuracy of marker-assisted and genomic selection approaches remains lower than that achieved in other major crop breeding systems.
Future progress will require the integration of pangenomics, functional genomics, high-throughput phenotyping, environmental data, and advanced predictive modeling into unified breeding frameworks (Figure 4). The development of large, well-phenotyped populations across diverse environments will be essential to improving genomic prediction accuracy and identifying stable marker–trait associations. Although still in its early stages in Cannabis sativa, controlled-environment cultivation systems and photoperiod manipulation strategies are already being applied to enable rapid generation cycling and accelerated breeding through speed breeding approaches [195,196]. These approaches have the potential to significantly shorten breeding cycles by enabling multiple generations per year, thereby increasing selection intensity and accelerating the fixation of favorable alleles. However, their efficiency in Cannabis remains genotype- and environment-dependent and requires further optimization across diverse industrial hemp germplasm [195]. Furthermore, combining genomic selection with speed breeding, controlled-environment agriculture, and genome-editing technologies will enable more rapid deployment of favorable alleles and accelerate genetic gain. Emerging machine learning approaches that integrate genomic, transcriptomic, metabolomic, and environmental datasets are expected to improve prediction of complex traits and support data-driven breeding decisions [197,198,199].
Looking ahead, the future of Cannabis molecular breeding will likely move toward integrated genome-to-phenome platforms that combine pangenome-informed allele discovery, functional validation of candidate genes, genomic prediction modeling, and targeted genome engineering. As advanced tools become available, including genotype-independent transformation systems, transgene-free genome editing, multiplex editing, and precision breeding technologies, direct manipulation of complex genetic networks that control cannabinoid biosynthesis, flowering time, stress adaptation, fiber quality, and seed composition will be achievable. Together, these integrated pipelines have the potential to transform Cannabis improvement from a largely selection-based method to a more predictive, engineering-driven discipline, accelerating the development of elite cultivars optimized for fiber, grain, cannabinoid, and resilience to changing climates.

12. Challenges in Industrial Hemp Genetic Research

Genetic research in Cannabis sativa faces a combination of challenges that is, in some respects, unique among crop species. Genomic complexity, an outcrossing reproductive system, decades of regulatory restriction, and the relative youth of organized breeding programs have all left their mark on the field. Where major crops benefit from deep genetic resources, well-validated marker platforms, and generations of systematic breeding support, industrial hemp genomics must contend with high repetitive DNA content, incomplete reference assemblies, complex recombination landscapes, and persistent difficulties in trait mapping [46,58,200]. Addressing these limitations is essential to translating genomic knowledge into improved fiber, grain, cannabinoid, and climate-resilient cultivars.
At the genomic level, one of the most persistent challenges is the complexity of the Cannabis genome itself. Even though recent long-read sequencing technologies have significantly improved assembly quality, the genome remains difficult to characterize because of extensive repetitive sequences. Early genomic analyses revealed that a large fraction of C. sativa DNA comprises retrotransposons, particularly LTR/Copia and LTR/Gypsy elements, along with low-complexity sequences, which vary greatly among genotypes and contribute to assembly fragmentation and structural variation across genotypes. These repetitive elements hinder short-read assembly algorithms and complicate the placement of markers in structurally complex regions [52]. In addition, pangenomic studies have demonstrated that a single reference genome captures only a fraction of the species-wide genetic diversity. Presence–absence variation, copy number variation, and large structural rearrangements contribute significantly to trait variation but are frequently absent from traditional reference assemblies. As a result, reliance on a single reference genome can limit the identification of important alleles associated with agronomic performance [20].
These genomic challenges are further amplified at the population level by the highly heterozygous, outcrossing nature of C. sativa [17,201]. High heterozygosity complicates both genome assembly and genetic mapping, and assembly algorithms may collapse divergent alleles into consensus sequences or incorrectly partition haplotypes into separate contigs, leading to errors in gene annotation and inaccurate representation of sequence variation [202,203]. At the population level, heterozygosity generates linkage disequilibrium patterns that sit awkwardly within standard GWAS and QTL analysis frameworks, adding statistical complexity to an already difficult mapping environment [27].
Unlike many crop species, industrial hemp is dioecious with heteromorphic sex chromosomes. Most reference genomes focus on female (XX) plants, leaving the male (XY) genome poorly represented. This asymmetric representation makes it difficult to identify male-specific sequences and sex-determining factors, exacerbating difficulties in mapping sex-linked traits and designing sex identification markers [76]. Transcriptomic studies suggest that gene regulatory networks controlling male and female floral development are substantially divergent, and the absence of robust Y chromosome assemblies limits the ability to test candidate sex determination genes across genetic backgrounds [76].
Beyond genomic structure, a major bottleneck lies in incomplete genotype-to-phenotype resolution. Genome-wide association studies, QTL analyses, transcriptomics, and pangenomic investigations have identified numerous candidate genes associated with cannabinoid production, sex determination, fiber quality, seed composition, disease resistance, and abiotic stress tolerance [93,116,204]. However, relatively few of these candidates have undergone functional validation [144]. The limited availability of efficient transformation and genome-editing systems has slowed the transition from gene discovery to mechanistic understanding, creating a significant bottleneck in Cannabis functional genomics.
From a breeding perspective, industrial hemp also lacks standardized, high-density marker platforms broadly validated across diverse germplasm. Despite the recent development of genotyping tools like the HASCH SNP panel, industrial hemp still suffers from a lack of standardized, high-density marker platforms that are broadly validated across diverse germplasm. HASCH provides a mid-density set of 1504 SNPs suitable for linkage mapping and QTL analysis, but trait architectures for many important characteristics, including disease resistance, stress tolerance, and yield components, remain poorly resolved, in part due to limited marker density and uneven coverage across complex genomic regions [25]. Many agronomically important traits, such as disease resistance, abiotic stress tolerance, and fiber quality, are polygenic and subject to strong genotype-and-environment interactions. This complexity demands large datasets and precise phenotyping to disentangle genetic effects from environmental noise, a costly and time-intensive requirement that has slowed progress in industrial hemp relative to major cereals and horticultural crops [200,205,206].
At the application level, transformation and regeneration remain among the most significant technical barriers facing Cannabis biotechnology. While recent advances have demonstrated successful stable transformation and CRISPR/Cas9-mediated genome editing, transformation efficiency remains highly genotype-dependent and often difficult to reproduce across laboratories [26,45,207]. Many commercially important cultivars remain recalcitrant to regeneration, restricting the application of genome-editing technologies to a limited number of genetic backgrounds [207,208]. The development of genotype-independent regeneration systems and more efficient transformation platforms therefore represents critical importance for future research [209].
Another important limitation is the relative scarcity of large-scale, well-characterized breeding populations. Modern breeding approaches such as genomic selection depend on extensive phenotypic and genotypic datasets that capture genetic variation across diverse environments [210]. However, many industrial hemp breeding programs still lack sufficiently large training populations and standardized phenotyping protocols [28]. In addition, genotype-by-environment interactions substantially influence key traits, including cannabinoid content, flowering time, biomass accumulation, and stress tolerance, reducing the predictive accuracy of genomic models and complicating selection decisions [210].
The integration of multi-omics datasets represents another emerging challenge. Although individual studies have identified genes, metabolites, and regulatory pathways associated with important traits in industrial hemp, relatively few investigations have combined these datasets to develop comprehensive systems-level understanding [211]. Consequently, many of the molecular mechanisms linking genetic variation to phenotypic outcomes remain unresolved.
Regulatory and germplasm-related constraints further complicate Cannabis research. Legal restrictions in many regions continue to limit access to diverse germplasm, impede international exchange of genetic resources, and restrict multi location field evaluations [212]. These limitations have slowed the development of global breeding networks and reduced opportunities for comparative studies across genetically diverse populations. The fragmented nature of available germplasm collections has hindered the establishment of standardized reference panels comparable to those available in major crop species [28].
These challenges highlight the need for a more integrated approach to Cannabis research. Bridging these gaps is fundamental to realizing the full agricultural and industrial capacity of Cannabis sativa.

13. Future Perspectives

The future of industrial hemp genetic research is moving in a clear direction away from the study of individual traits and toward a systems-level view of the plant. Unlocking the full potential of Cannabis sativa will depend on how effectively researchers can bring together diverse biological data and translate them into practical breeding outcomes. Rather than being treated as distinct fields, genomics, transcriptomics, proteomics, and metabolomics are increasingly seen as interconnected, which can be used to examine the same underlying biology. Used together, these tools can reveal the regulatory networks that control a wide range of traits, from cannabinoid production and fiber characteristics to stress tolerance. This shift toward integrated multi-omics represents an important transition from largely empirical breeding toward a more predictive and design-oriented framework, where complex traits can be understood and manipulated with greater precision.
Future gene discovery efforts are likely to move beyond single-reference-genome analyses toward pangenomic and graph-based genomic frameworks. Recent studies have demonstrated that structural variation, presence–absence variation, and genotype-specific genomic regions contribute substantially to trait diversity within C. sativa. As additional genomes become available, graph genomes and pangenomic resources will provide a more complete representation of species-wide genetic diversity and improve the identification of functional alleles controlling economically important traits. These resources will be particularly valuable for dissecting complex traits such as fiber quality, seed yield, stress tolerance, and cannabinoid production, which are influenced by multiple genes and regulatory pathways.
Integration of multi-omics technologies will become increasingly important for understanding the molecular basis of trait variation. While genomics identifies candidate genes, transcriptomics reveals gene regulation patterns, metabolomics characterizes biochemical outputs, and epigenomics identifies environmentally responsive modifications that influence gene expression. Incorporating these datasets together will build systems-level models that accurately describe the biological pathways linking genotype to phenotype. Such approaches have already transformed research across several major crops and are expected to play a similarly important role in industrial hemp.
Functional validation of identified candidate genes will remain central to this transition. The development of CRISPR-based genome editing, together with speed breeding systems that accelerate generation turnover, is opening new possibilities for rapid trait validation and genotype development. Future genome-editing approaches may extend beyond CRISPR/Cas9 systems to include multiplex editing, base editing, prime editing, and transgene-free editing strategies. These emerging technologies offer the potential to precisely modify cannabinoid biosynthetic pathways, improve stress resilience, alter flowering behavior, and enhance seed and fiber quality while minimizing unintended genomic changes.
Predictive approaches that integrate genomic information with advanced phenotyping technologies will play a central role in shaping the future of Cannabis sativa crop improvement. Genomic selection, machine learning, and artificial intelligence are expected to become important components of breeding programs, particularly for complex quantitative traits that are difficult to improve using conventional selection methods. Incorporating genomic, environmental, and phenotypic data into predictive models will assist breeders to identify superior genotypes earlier in the breeding cycle and accelerate genetic gain. Incorporating high-throughput phenotyping platforms, including drone-based imaging, hyperspectral sensing, and automated environmental monitoring systems, will significantly strengthen these predictive frameworks by generating large-scale phenotypic datasets across diverse environments. Combining genomic prediction, genome editing, and speed breeding into unified crop improvement pipelines has the potential to dramatically shorten breeding cycles and increase the efficiency with which new cultivars are developed. Similar approaches are already being implemented in several major crop species and represent a logical next step for industrial hemp improvement.
Looking further ahead, the systematic expansion and characterization of global industrial hemp germplasm will be essential. Landraces and feral populations represent reservoirs of genetic diversity that are poorly explored, and that diversity may hold solutions to challenges, particularly climate-related ones, that are not yet fully understood. Generating high-quality, long-read genome assemblies for these populations and linking them to detailed phenotypic data would provide an important resource for Cannabis research. High-throughput consistent phenotyping methods for evaluating complex agronomic traits such as fiber-processing efficiency, metabolite profiles, and root architecture are equally important, as they would allow data to be compared and combined across studies. Coordinated common garden experiments conducted across contrasting environments would further help separate genuine genetic effects from environmental noise, particularly for traits like flowering time and regulatory compliance, where genotype-by-environment interactions can be substantial.
Looking beyond individual technologies, the future of Cannabis sativa research lies in building integrated genome-to-phenotype-to-breeding pipelines. The convergence of pangenomics, multi-omics integration, genome editing, artificial intelligence, high-throughput phenotyping, and accelerated breeding strategies will redefine C. sativa improvement over the coming decade. Beyond scientific progress alone, regulatory constraints and fragmented international policies remain significant barriers to the advancement of industrial hemp research. Better organization between regulatory authorities and collaboration among scientists from different disciplines will be just as important. With sustained investment and genuine coordination, industrial hemp has the potential to move beyond the constraints that have historically limited it, and to establish itself as a meaningful contributor to a more sustainable, bio-based economy.

14. Conclusions

Cannabis sativa L. has rapidly emerged from being an understudied crop to a genomically characterized species of considerable economic importance in both agricultural and industrial sectors. Advances in DNA-sequencing technologies, genome assembly, and pangenomic analysis have improved our understanding of the genome architecture and revealed the high level of genetic diversity underlying key trait variation. Significant advances in molecular genetics have enabled scientists to determine the biological foundations of different traits, such as the production of cannabinoids, fiber, seed quality, disease resistance, and abiotic stress tolerance.
However, most of the traits associated with industrial hemp have been proven to be polygenic and are strongly influenced by environmental interactions. Therefore, improvements in genomics are bridging the gap between research and practical genotype development through the implementation of genomic selection and editing. Overall, continued integration of multi-omics data, expanded germplasm resources, and functional validation approaches will be essential to fully realizing the potential of Cannabis sativa in sustainable fiber, seed, and phytochemical production systems.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants15132088/s1. Table S1. Overview of Cannabis sativa Genome Assemblies and Key Assembly Metrics.

Author Contributions

Conceptualization, K.C.D. and A.D.W.; writing—original draft preparation, K.C.D.; writing—review and editing, A.D.W., K.P.T., J.S.W., S.W., C.D. and S.C.; supervision, A.D.W.; funding acquisition, A.D.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research study was funded by a USDA-NIFA 1890 Capacity Building Grant (2024-38821-42046) and the USDA Agricultural Research Service (USDA-ARS) (58-6066-3-056).

Data Availability Statement

No new data were created or analyzed in this study.

Acknowledgments

We sincerely thank the reviewers for their thoughtful and constructive comments, which have greatly improved the clarity and overall quality of this manuscript. We gratefully acknowledge support from the USDA Agricultural Research Service (USDA-ARS), the USDA-NIFA 1890 Capacity Building Grant, and the USDA Hemp Seed Bank. We also acknowledge TETRA HEMP COMPANY, LLC, for their support. During the preparation of this manuscript, the authors used DeepAI to assist in the creation of the illustrative images included in figures presented in Figure 1 and Figure 3, and the authors made use of an AI-assisted tool (Claude Sonnet 4.6) for the purposes of grammar correction and readability improvement only. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
CBDCannabidiol
CBDASCannabidiolic Acid Synthase
CBDACannabidiolic Acid
CBGACannabigerolic Acid
GSGenomic Selection
GWASGenome-Wide Association Study
LTRLong Terminal Repeat
MASMarker-Assisted Selection
MbMegabase
QTLQuantitative Trait Locus
SNPSingle Nucleotide Polymorphism
SSRSimple Sequence Repeat
THCTetrahydrocannabinol
THCASTetrahydrocannabinolic Acid Synthase
THCATetrahydrocannabinolic Acid
TRVTobacco Rattle Virus
VIGSVirus-Induced Gene Silencing

References

  1. Cortes, J.G.; Ryu, B.R.; Pauli, C.; Barroso, L.R.; Park, S.-H. Industrial applications of hemp fiber in Europe and evolving regulatory landscape. J. Nat. Fibers 2024, 21, 2435047. [Google Scholar] [CrossRef]
  2. Abernethy, A. Hemp Production and the 2018 Farm Bill; US Food and Drug Administration: Silver Spring, MD, USA, 2019. [Google Scholar]
  3. Rehman, M.; Fahad, S.; Du, G.; Cheng, X.; Yang, Y.; Tang, K.; Liu, L.; Liu, F.-H.; Deng, G. Evaluation of hemp (Cannabis sativa L.) as an industrial crop: A review. Environ. Sci. Pollut. Res. 2021, 28, 52832–52843. [Google Scholar] [CrossRef]
  4. Farinon, B.; Molinari, R.; Costantini, L.; Merendino, N. The seed of industrial hemp (Cannabis sativa L.): Nutritional quality and potential functionality for human health and nutrition. Nutrients 2020, 12, 1935. [Google Scholar] [CrossRef] [PubMed]
  5. Naeem, M.Y.; Corbo, F.; Crupi, P.; Clodoveo, M.L. Hemp: An alternative source for various industries and an emerging tool for functional food and pharmaceutical sectors. Processes 2023, 11, 718. [Google Scholar] [CrossRef]
  6. Osterberger, E.; Lohwasser, U.; Jovanovic, D.; Ruzicka, J.; Novak, J. The origin of the genus Cannabis. Genet. Resour. Crop Evol. 2022, 69, 1439–1449. [Google Scholar]
  7. McPartland, J.M.; Hegman, W.; Long, T. Cannabis in Asia: Its center of origin and early cultivation, based on a synthesis of subfossil pollen and archaeobotanical studies. Veg. Hist. Archaeobotany 2019, 28, 691–702. [Google Scholar] [CrossRef]
  8. Small, E. Evolution and classification of Cannabis sativa (marijuana, hemp) in relation to human utilization. Bot. Rev. 2015, 81, 189–294. [Google Scholar] [CrossRef]
  9. Clarke, R.C.; Merlin, M.D. Cannabis domestication, breeding history, present-day genetic diversity, and future prospects. Crit. Rev. Plant Sci. 2016, 35, 293–327. [Google Scholar] [CrossRef]
  10. Visković, J.; Zheljazkov, V.D.; Sikora, V.; Noller, J.; Latković, D.; Ocamb, C.M.; Koren, A. Industrial hemp (Cannabis sativa L.) agronomy and utilization: A review. Agronomy 2023, 13, 931. [Google Scholar] [CrossRef]
  11. Rampold, S.; Brym, Z.; Kandzer, M.S.; Baker, L.M. Hemp there it is: Examining consumers’ attitudes toward the revitalization of hemp as an agricultural commodity. J. Appl. Commun. 2021, 105, 2. [Google Scholar] [CrossRef]
  12. Patalee, B.; Jeong, H.; Mark, T. What makes hemp economically attractive? A case of Kentucky hemp farmers. Res. World Agric. Econ. 2024, 5, 32–44. [Google Scholar] [CrossRef]
  13. Steiner, J. Making Hemp a 21st Century Commodity in Oregon—And Beyond. Crops Soils 2021, 54, 24–31. [Google Scholar] [CrossRef]
  14. Salentijn, E.M.; Zhang, Q.; Amaducci, S.; Yang, M.; Trindade, L.M. New developments in fiber hemp (Cannabis sativa L.) breeding. Ind. Crops Prod. 2015, 68, 32–41. [Google Scholar] [CrossRef]
  15. Mead, A. Legal and regulatory issues governing Cannabis and Cannabis-derived products in the United States. Front. Plant Sci. 2019, 10, 697. [Google Scholar] [CrossRef] [PubMed]
  16. Weisheit, R.A. Cannabis cultivation in the United States. In World Wide Weed: Global Trends in Cannabis Cultivation and Its Control; Routledge: London, UK, 2011; pp. 145–162. [Google Scholar]
  17. Hillig, K.W. Genetic evidence for speciation in Cannabis (Cannabaceae). Genet. Resour. Crop Evol. 2005, 52, 161–180. [Google Scholar] [CrossRef]
  18. Small, E.; Cronquist, A. A practical and natural taxonomy for Cannabis. Taxon 1976, 405–435. [Google Scholar] [CrossRef]
  19. Wei, H.; Yang, Z.; Niyitanga, S.; Tao, A.; Xu, J.; Fang, P.; Lin, L.; Zhang, L.; Qi, J.; Ming, R. The reference genome of seed hemp (Cannabis sativa) provides new insights into fatty acid and vitamin E synthesis. Plant Commun. 2024, 5, 100718. [Google Scholar] [PubMed]
  20. Wang, S.; Zhong, X.; Cheng, Y.; Yu, Y.; Wan, J.; Liu, Q.; Shu, Y.; Wu, X.; Li, Y. Pan-Genome Analysis of Cannabis sativa: Insights on Genomic Diversity, Evolution, and Environment Adaption. Int. J. Mol. Sci. 2025, 26, 8354. [Google Scholar] [CrossRef] [PubMed]
  21. Stack, G.M.; Quade, M.A.; Wilkerson, D.G.; Monserrate, L.A.; Bentz, P.C.; Carey, S.B.; Grimwood, J.; Toth, J.A.; Crawford, S.; Harkess, A. Comparison of recombination rate, reference bias, and unique pangenomic haplotypes in Cannabis sativa using seven de Novo genome assemblies. Int. J. Mol. Sci. 2025, 26, 1165. [Google Scholar] [CrossRef] [PubMed]
  22. Eržen, M.; Čerenak, A.; Cesar, T.; Jakše, J. Population Structure of Genotypes and Genome-Wide Association Studies of Cannabinoids and Terpenes Synthesis in Hemp (Cannabis sativa L.). Plants 2026, 15, 202. [Google Scholar] [CrossRef] [PubMed]
  23. Mostafaei Dehnavi, M.; Damerum, A.; Taheri, S.; Ebadi, A.; Panahi, S.; Hodgin, G.; Brandley, B.; Salami, S.A.; Taylor, G. Population genomics of a natural Cannabis sativa L. collection from Iran identifies novel genetic loci for flowering time, morphology, sex and chemotyping. BMC Plant Biol. 2025, 25, 80. [Google Scholar] [CrossRef] [PubMed]
  24. de Ronne, M.; Lapierre, É.; Torkamaneh, D. Genetic insights into agronomic and morphological traits of drug-type Cannabis revealed by genome-wide association studies. Sci. Rep. 2024, 14, 9162. [Google Scholar] [CrossRef] [PubMed]
  25. Mansueto, L.; Tandayu, E.; Mieog, J.; Garcia-de Heer, L.; Das, R.; Burn, A.; Mauleon, R.; Kretzschmar, T. HASCH-A high-throughput amplicon-based SNP-platform for medicinal Cannabis and industrial hemp genotyping applications. BMC Genom. 2024, 25, 818. [Google Scholar]
  26. Ingvardsen, C.R.; Brinch-Pedersen, H. Challenges and potentials of new breeding techniques in Cannabis sativa. Front. Plant Sci. 2023, 14, 1154332. [Google Scholar] [CrossRef] [PubMed]
  27. Trubanová, N.; Isobe, S.; Shirasawa, K.; Watanabe, A.; Kelesidis, G.; Melzer, R.; Schilling, S. Genome-specific association study (GSAS) for exploration of variability in hemp (Cannabis sativa). Sci. Rep. 2025, 15, 8371. [Google Scholar] [CrossRef] [PubMed]
  28. Barcaccia, G.; Palumbo, F.; Scariolo, F.; Vannozzi, A.; Borin, M.; Bona, S. Potentials and challenges of genomics for breeding Cannabis cultivars. Front. Plant Sci. 2020, 11, 573299. [Google Scholar] [CrossRef] [PubMed]
  29. Strzelczyk, M.; Lochynska, M.; Chudy, M. Systematics and botanical characteristics of industrial hemp Cannabis sativa L. J. Nat. Fibers 2022, 19, 5804–5826. [Google Scholar]
  30. Manaia, J.P.; Manaia, A.T.; Rodriges, L. Industrial hemp fibers: An overview. Fibers 2019, 7, 106. [Google Scholar] [CrossRef]
  31. Zimniewska, M. Hemp fibre properties and processing target textile: A review. Materials 2022, 15, 1901. [Google Scholar] [CrossRef] [PubMed]
  32. Malabadi, R.B.; Kolkar, K.P.; Chalannavar, R.K. Industrial Cannabis sativa: Role of hemp (fiber type) in textile industries. World J. Biol. Pharm. Health Sci. 2023, 16, 001–014. [Google Scholar] [CrossRef]
  33. Tănase Apetroaei, V.; Pricop, E.M.; Istrati, D.I.; Vizireanu, C. Hemp seeds (Cannabis sativa L.) as a valuable source of natural ingredients for functional foods—A review. Molecules 2024, 29, 2097. [Google Scholar] [CrossRef] [PubMed]
  34. Rupasinghe, H.V.; Davis, A.; Kumar, S.K.; Murray, B.; Zheljazkov, V.D. Industrial hemp (Cannabis sativa subsp. sativa) as an emerging source for value-added functional food ingredients and nutraceuticals. Molecules 2020, 25, 4078. [Google Scholar] [CrossRef] [PubMed]
  35. Mygdalia, A.; Panoras, I.; Vazanelli, E.; Tsaliki, E. Nutritional and Industrial Insights into Hemp Seed Oil: A Value-Added Product of Cannabis sativa L. Seeds 2025, 4, 5. [Google Scholar] [CrossRef]
  36. Placido, D.F.; Lee, C.C. Potential of industrial hemp for phytoremediation of heavy metals. Plants 2022, 11, 595. [Google Scholar] [CrossRef] [PubMed]
  37. Moscariello, C.; Matassa, S.; Esposito, G.; Papirio, S. From residue to resource: The multifaceted environmental and bioeconomy potential of industrial hemp (Cannabis sativa L.). Resour. Conserv. Recycl. 2021, 175, 105864. [Google Scholar] [CrossRef]
  38. Kaur, G.; Kander, R. The sustainability of industrial hemp: A literature review of its economic, environmental, and social sustainability. Sustainability 2023, 15, 6457. [Google Scholar] [CrossRef]
  39. Raihan, A.; Bijoy, T.R. A review of the industrial use and global sustainability of Cannabis sativa. Glob. Sustain. Res. 2023, 2, 1–29. [Google Scholar] [CrossRef]
  40. Vergara, D.; Baker, H.; Clancy, K.; Keepers, K.G.; Mendieta, J.P.; Pauli, C.S.; Tittes, S.B.; White, K.H.; Kane, N.C. Genetic and genomic tools for Cannabis sativa. Crit. Rev. Plant Sci. 2016, 35, 364–377. [Google Scholar] [CrossRef]
  41. Hurgobin, B.; Tamiru-Oli, M.; Welling, M.T.; Doblin, M.S.; Bacic, A.; Whelan, J.; Lewsey, M.G. Recent advances in Cannabis sativa genomics research. New Phytol. 2021, 230, 73–89. [Google Scholar] [CrossRef] [PubMed]
  42. Chen, S.; Tang, H.; Jie, S.; Wang, W.; Tang, L.; Ruan, G. Research Progress in Genome Sequencing and Functional Gene Mining of Cannabis. Genom. Appl. Biol. 2024, 15, 223–234. [Google Scholar] [CrossRef]
  43. Pancaldi, F.; Salentijn, E.M.; Trindade, L.M. From fibers to flowering to metabolites: Unlocking hemp (Cannabis sativa) potential with the guidance of novel discoveries and tools. J. Exp. Bot. 2025, 76, 109–123. [Google Scholar] [PubMed]
  44. Sokuma, A.; Chatsungnoen, T.; Susawaengsup, C.; Tongkoom, K.; Bhuyar, P. Integrating Genetic and Biotechnological Approaches to Enhance High-CBD Cannabis sativa L. Cultivation for Medicinal Purposes. Mol. Biotechnol. 2026, 68, 2989–3031. [Google Scholar]
  45. Shiels, D.; Prestwich, B.D.; Koo, O.; Kanchiswamy, C.N.; O’Halloran, R.; Badmi, R. Hemp genome editing—Challenges and opportunities. Front. Genome Ed. 2022, 4, 823486. [Google Scholar] [CrossRef] [PubMed]
  46. Van Bakel, H.; Stout, J.M.; Cote, A.G.; Tallon, C.M.; Sharpe, A.G.; Hughes, T.R.; Page, J.E. The draft genome and transcriptome of Cannabis sativa. Genome Biol. 2011, 12, R102. [Google Scholar] [CrossRef] [PubMed]
  47. Faux, A.-M.; Berhin, A.; Dauguet, N.; Bertin, P. Sex chromosomes and quantitative sex expression in monoecious hemp (Cannabis sativa L.). Euphytica 2014, 196, 183–197. [Google Scholar]
  48. Hirata, K. Cytological basis of the sex determination in Cannabis sativa L. Jpn. J. Genet. 1924, 4, 198–201. [Google Scholar]
  49. Sakamoto, K.; Shimomura, K.; Komeda, Y.; Kamada, H.; Satoh, S. A male-associated DNA sequence in a dioecious plant, Cannabis sativa L. Plant Cell Physiol. 1995, 36, 1549–1554. [Google Scholar] [CrossRef] [PubMed]
  50. Sakamoto, K.; Akiyama, Y.; Fukui, K.; Kamada, H.; Satoh, S. Characterization; genome sizes and morphology of sex chromosomes in hemp (Cannabis sativa L.). Cytologia 1998, 63, 459–464. [Google Scholar] [CrossRef]
  51. Braich, S.; Baillie, R.C.; Spangenberg, G.C.; Cogan, N.O. A new and improved genome sequence of Cannabis sativa. Gigabyte 2020, 2020, 1–13. [Google Scholar] [CrossRef]
  52. Pisupati, R.; Vergara, D.; Kane, N.C. Diversity and evolution of the repetitive genomic content in Cannabis sativa. BMC Genom. 2018, 19, 156. [Google Scholar] [CrossRef]
  53. Ryu, B.-R.; Gim, G.-J.; Shin, Y.-R.; Kang, M.-J.; Kim, M.-J.; Kwon, T.-H.; Lim, Y.-S.; Park, S.-H.; Lim, J.-D. Chromosome-level haploid assembly of Cannabis sativa L. cv. Pink Pepper. Sci. Data 2024, 11, 1442. [Google Scholar] [CrossRef] [PubMed]
  54. Vergara, D.; Huscher, E.L.; Keepers, K.G.; Pisupati, R.; Schwabe, A.L.; McGlaughlin, M.E.; Kane, N.C. Genomic evidence that governmentally produced Cannabis sativa poorly represents genetic variation available in state markets. Front. Plant Sci. 2021, 12, 668315. [Google Scholar] [CrossRef] [PubMed]
  55. Sakamoto, K.; Abe, T.; Matsuyama, T.; Yoshida, S.; Ohmido, N.; Fukui, K.; Satoh, S. RAPD markers encoding retrotransposable elements are linked to the male sex in Cannabis sativa L. Genome 2005, 48, 931–936. [Google Scholar] [CrossRef] [PubMed]
  56. Gao, S.; Wang, B.; Xie, S.; Xu, X.; Zhang, J.; Pei, L.; Yu, Y.; Yang, W.; Zhang, Y. A high-quality reference genome of wild Cannabis sativa. Hortic. Res. 2020, 7, 73. [Google Scholar] [CrossRef] [PubMed]
  57. Grassa, C.J.; Wenger, J.P.; Dabney, C.; Poplawski, S.G.; Motley, S.T.; Michael, T.P.; Schwartz, C.; Weiblen, G.D. A complete Cannabis chromosome assembly and adaptive admixture for elevated cannabidiol (CBD) content. bioRxiv 2018. [Google Scholar] [CrossRef]
  58. Grassa, C.J.; Weiblen, G.D.; Wenger, J.P.; Dabney, C.; Poplawski, S.G.; Timothy Motley, S.; Michael, T.P.; Schwartz, C. A new Cannabis genome assembly associates elevated cannabidiol (CBD) with hemp introgressed into marijuana. New Phytol. 2021, 230, 1665–1679. [Google Scholar] [CrossRef] [PubMed]
  59. Ren, G.; Zhang, X.; Li, Y.; Ridout, K.; Serrano-Serrano, M.L.; Yang, Y.; Liu, A.; Ravikanth, G.; Nawaz, M.A.; Mumtaz, A.S. Large-scale whole-genome resequencing unravels the domestication history of Cannabis sativa. Sci. Adv. 2021, 7, eabg2286. [Google Scholar] [CrossRef] [PubMed]
  60. Houston, R.; Birck, M.; Hughes-Stamm, S.; Gangitano, D. Evaluation of a 13-loci STR multiplex system for Cannabis sativa genetic identification. Int. J. Leg. Med. 2016, 130, 635–647. [Google Scholar]
  61. Di Nunzio, M.; Agostini, V.; Alessandrini, F.; Barrot-Feixat, C.; Berti, A.; Bini, C.; Bottinelli, M.; Carnevali, E.; Corradini, B.; Fabbri, M. A Ge. FI–ISFG European collaborative study on DNA identification of Cannabis sativa samples using a 13-locus multiplex STR method. Forensic Sci. Int. 2021, 329, 111053. [Google Scholar] [CrossRef] [PubMed]
  62. Di Nunzio, M.; Pieri, M.; Gangitano, D.; Di Nunzio, C.; Tinto, N.; Niola, M.; Barrot-Feixat, C. Leveraging genetics to support forensic toxicology analysis: Demonstrating concordance among marijuana samples. J. Appl. Res. Med. Aromat. Plants 2024, 43, 100598. [Google Scholar] [CrossRef]
  63. Wei, H.; Yang, Z.; Zhuang, L.; Pan, X.; Jia, H.; Jiang, S.; Li, Q.; Xu, J.; Tao, A.; Fang, P. A haplotype-resolved genome assembly of seed hemp (Cannabis sativa) and analysis of Y chromosome divergence from the X. Hortic. Res. 2026, 13, uhaf268. [Google Scholar] [CrossRef] [PubMed]
  64. Xu, J.; Kong, L.; Wu, W.; Li, C.; Ren, W.; Ma, W. Complete chloroplast genome sequence data of Cannabis sativa. Data Brief 2025, 60, 111632. [Google Scholar] [CrossRef] [PubMed]
  65. Deng, G.; Yang, M.; Zhao, K.; Yang, Y.; Huang, X.; Cheng, X. The complete chloroplast genome of Cannabis sativa variety Yunma 7. Mitochondrial DNA Part B 2021, 6, 531–532. [Google Scholar] [CrossRef] [PubMed]
  66. Vergara, D.; White, K.H.; Keepers, K.G.; Kane, N.C. The complete chloroplast genomes of Cannabis sativa and Humulus lupulus. Mitochondrial DNA Part A 2016, 27, 3793–3794. [Google Scholar]
  67. Di Nunzio, M.; Barrot-Feixat, C.; Gangitano, D. Characterization and evaluation of nine Cannabis sativa chloroplast SNP markers for crop type determination and biogeographical origin on European samples. Forensic Sci. Int. Genet. 2024, 68, 102971. [Google Scholar] [CrossRef] [PubMed]
  68. Houston, R.; Birck, M.; LaRue, B.; Hughes-Stamm, S.; Gangitano, D. Nuclear, chloroplast, and mitochondrial data of a US Cannabis DNA database. Int. J. Leg. Med. 2018, 132, 713–725. [Google Scholar] [CrossRef]
  69. Khabbazi, S.D. Comprehensive analysis of the chloroplast genome in Cannabis sativa L. reveals variations in simple sequence repeats among cultivars. Sci. Rep. 2026, 16, 206. [Google Scholar] [CrossRef] [PubMed]
  70. Divashuk, M.G.; Alexandrov, O.S.; Razumova, O.V.; Kirov, I.V.; Karlov, G.I. Molecular cytogenetic characterization of the dioecious Cannabis sativa with an XY chromosome sex determination system. PLoS ONE 2014, 9, e85118. [Google Scholar] [CrossRef] [PubMed]
  71. Razumova, O.V.; Divashuk, M.G.; Alexandrov, O.S.; Karlov, G.I. GISH painting of the Y chromosomes suggests advanced phases of sex chromosome evolution in three dioecious Cannabaceae species (Humulus lupulus, H. japonicus, and Cannabis sativa). Protoplasma 2023, 260, 249–256. [Google Scholar] [PubMed]
  72. Mandolino, G.; Carboni, A. Potential of marker-assisted selection in hemp genetic improvement. Euphytica 2004, 140, 107–120. [Google Scholar] [CrossRef]
  73. Faux, A.-M.; Draye, X.; Lambert, R.; d’Andrimont, R.; Raulier, P.; Bertin, P. The relationship of stem and seed yields to flowering phenology and sex expression in monoecious hemp (Cannabis sativa L.). Eur. J. Agron. 2013, 47, 11–22. [Google Scholar] [CrossRef]
  74. Toscani, M.; Malik, A.; Riera-Begue, A.; Dowling, C.; Rougemont, Q.; Rodríguez de la Vega, R.C.; Giraud, T.; Schilling, S.; Melzer, R. An ancient X chromosomal region harbours three genes potentially controlling sex determination in Cannabis sativa. bioRxiv 2025. [Google Scholar] [CrossRef]
  75. Petit, J.; Salentijn, E.M.; Paulo, M.-J.; Denneboom, C.; Trindade, L.M. Genetic architecture of flowering time and sex determination in hemp (Cannabis sativa L.): A genome-wide association study. Front. Plant Sci. 2020, 11, 569958. [Google Scholar] [CrossRef] [PubMed]
  76. Shi, J.; Toscani, M.; Dowling, C.A.; Schilling, S.; Melzer, R. Identification of genes associated with sex expression and sex determination in hemp (Cannabis sativa L.). J. Exp. Bot. 2025, 76, 175–190. [Google Scholar] [PubMed]
  77. Spitzer-Rimon, B.; Duchin, S.; Bernstein, N.; Kamenetsky, R. Architecture and florogenesis in female Cannabis sativa plants. Front. Plant Sci. 2019, 10, 447153. [Google Scholar] [CrossRef] [PubMed]
  78. Tamang, K.R.; Mawhinney, T.P.; Carson, C.B.; Asiamah, J.Y.; Mahdi, S.H.; Reed, E.; Sharma, S.; Koirala, P.; Patel, J.; Mensah, C.A. Phenotypic and genetic characterization of sixteen grain and dual-type industrial hemp varieties (Cannabis sativa L.) for agronomic and yield component traits. Front. Plant Sci. 2025, 16, 1632346. [Google Scholar] [CrossRef] [PubMed]
  79. Steel, L.; Welling, M.; Ristevski, N.; Johnson, K.; Gendall, A. Comparative genomics of flowering behavior in Cannabis sativa. Front. Plant Sci. 2023, 14, 1227898. [Google Scholar] [CrossRef] [PubMed]
  80. Naim-Feil, E.; Elkins, A.C.; Malmberg, M.M.; Ram, D.; Tran, J.; Spangenberg, G.C.; Rochfort, S.J.; Cogan, N.O. The Cannabis plant as a complex system: Interrelationships between cannabinoid compositions, morphological, physiological and phenological traits. Plants 2023, 12, 493. [Google Scholar] [CrossRef] [PubMed]
  81. Shi, J.; Schilling, S.; Melzer, R. Morphological and genetic analysis of inflorescence and flower development in hemp (Cannabis sativa L.). bioRxiv 2024. [Google Scholar] [CrossRef]
  82. Hesami, M.; Pepe, M.; Jones, A.M.P. Morphological characterization of Cannabis sativa L. throughout its complete life cycle. Plants 2023, 12, 3646. [Google Scholar] [CrossRef] [PubMed]
  83. Punja, Z.K.; Holmes, J.E. Hermaphroditism in Marijuana (Cannabis sativa L.) Inflorescences—Impact on Floral Morphology, Seed Formation, Progeny Sex Ratios, and Genetic Variation. Front. Plant Sci. 2020, 11, 718. [Google Scholar] [CrossRef] [PubMed]
  84. Hall, J.; Bhattarai, S.P.; Midmore, D.J. Review of Flowering Control in Industrial Hemp. J. Nat. Fibers 2012, 9, 23–36. [Google Scholar] [CrossRef]
  85. Prentout, D.; El Aoudati, S.; Mathis, F.; Marais, G.A.; Henri, H. Promising high fidelity genetic markers for sexing Cannabis sativa seedlings. G3 Genes Genomes Genet. 2025, 15, jkaf077. [Google Scholar] [CrossRef]
  86. Riera-Begue, A.; Toscani, M.; Malik, A.; Dowling, C.A.; Schilling, S.; Melzer, R. A simple and reliable PCR-based method to differentiate between XX and XY sex genotypes in Cannabis sativa. Planta 2025, 262, 87. [Google Scholar] [CrossRef] [PubMed]
  87. Cale, N.L.; Swiderek, R.E.; Walker, P.L.; Ziegler, D.J.; Castillo, B.; Robertson, S.M.; Wilkins, O.; Belmonte, M.F. Dual RNA sequencing reveals the transcriptomic and cellular response of Cannabis sativa to infection by the fungal pathogen Sclerotinia sclerotiorum. Sci. Rep. 2026, 16, 17739. [Google Scholar] [CrossRef] [PubMed]
  88. Hanuš, L.O.; Meyer, S.M.; Muñoz, E.; Taglialatela-Scafati, O.; Appendino, G. Phytocannabinoids: A unified critical inventory. Nat. Prod. Rep. 2016, 33, 1357–1392. [Google Scholar] [CrossRef] [PubMed]
  89. Taura, F.; Sirikantaramas, S.; Shoyama, Y.; Morimoto, S. Phytocannabinoids in Cannabis sativa: Recent studies on biosynthetic enzymes. In Cannabinoids in Nature and Medicine; Wiley-VHCA AG: Zurich, Switzerland, 2009; pp. 51–65. [Google Scholar]
  90. De Meijer, E.P.; Bagatta, M.; Carboni, A.; Crucitti, P.; Moliterni, V.C.; Ranalli, P.; Mandolino, G. The inheritance of chemical phenotype in Cannabis sativa L. Genetics 2003, 163, 335–346. [Google Scholar] [CrossRef] [PubMed]
  91. Johnson, M.S.; Wallace, J.G. Genomic and chemical diversity of commercially available industrial hemp accessions. bioRxiv 2021. [Google Scholar] [CrossRef]
  92. Padilla-González, G.F.; Rosselli, A.; Sadgrove, N.J.; Cui, M.; Simmonds, M.S. Mining the chemical diversity of the hemp seed (Cannabis sativa L.) metabolome: Discovery of a new molecular family widely distributed across hemp. Front. Plant Sci. 2023, 14, 1114398. [Google Scholar] [CrossRef] [PubMed]
  93. Sng, B.J.R.; Jeong, Y.J.; Leong, S.H.; Jeong, J.C.; Lee, J.; Rajani, S.; Kim, C.Y.; Jang, I.-C. Genome-wide identification of cannabinoid biosynthesis genes in non-drug type Cannabis (Cannabis sativa L.) cultivar. J. Cannabis Res. 2024, 6, 35. [Google Scholar] [CrossRef] [PubMed]
  94. Kim, A.L.; Yun, Y.J.; Choi, H.W.; Hong, C.-H.; Shim, H.J.; Lee, J.H.; Kim, Y.-C. Profiling cannabinoid contents and expression levels of corresponding biosynthetic genes in commercial Cannabis (Cannabis sativa L.) cultivars. Plants 2022, 11, 3088. [Google Scholar] [CrossRef] [PubMed]
  95. Docimo, T.; Caruso, I.; Ponzoni, E.; Mattana, M.; Galasso, I. Molecular characterization of edestin gene family in Cannabis sativa L. Plant Physiol. Biochem. 2014, 84, 142–148. [Google Scholar] [CrossRef] [PubMed]
  96. Ponzoni, E.; Brambilla, I.M.; Galasso, I. Genome-wide identification and organization of seed storage protein genes of Cannabis sativa. Biol. Plant. 2018, 62, 693–702. [Google Scholar] [CrossRef]
  97. Yan, B.; Chang, C.; Sui, Y.; Zheng, N.; Fang, Y.; Zhang, Y.; Zhang, M.; He, D.; Zhang, L. Transcriptomic and Lipidomic Analysis Reveals the Regulatory Network of Lipid Metabolism in Cannabis sativa. Foods 2025, 14, 2809. [Google Scholar] [CrossRef] [PubMed]
  98. Sipahi, H.; Haiden, S.; Berkowitz, G. Genome-wide analysis of cellulose synthase (CesA) and cellulose synthase-like (Csl) proteins in Cannabis sativa L. PeerJ 2024, 12, e17821. [Google Scholar] [CrossRef] [PubMed]
  99. Behr, M.; Sergeant, K.; Leclercq, C.C.; Planchon, S.; Guignard, C.; Lenouvel, A.; Renaut, J.; Hausman, J.-F.; Lutts, S.; Guerriero, G. Insights into the molecular regulation of monolignol-derived product biosynthesis in the growing hemp hypocotyl. BMC Plant Biol. 2018, 18. [Google Scholar] [CrossRef] [PubMed]
  100. Gao, C.; Cheng, C.; Zhao, L.; Yu, Y.; Tang, Q.; Xin, P.; Liu, T.; Yan, Z.; Guo, Y.; Zang, G. Genome-wide expression profiles of hemp (Cannabis sativa L.) in response to drought stress. Int. J. Genom. 2018, 2018, 3057272. [Google Scholar] [CrossRef]
  101. Yin, M.; Pan, G.; Tao, J.; Zhao, L.; Li, Z.; Jiang, H.; Tang, H.; Chang, L.; Li, J.; Chen, A. Identification of MYB genes reveals their potential functions in cadmium stress response and the regulation of cannabinoid biosynthesis in hemp. Ind. Crops Prod. 2022, 180, 114607. [Google Scholar] [CrossRef]
  102. Wang, D.; Liang, S.; Che, Y.; Qi, G.; Jiang, Z.; Yang, W.; Zhao, H.; Chen, J.; Zhu, A.; Gao, G. Genome-Wide Identification, Phylogenetic Analysis and Salt-Responsive Expression Profiling of the MYB Transcription Factor Family in Cannabis sativa L. During Seed Germination. Int. J. Mol. Sci. 2026, 27, 1087. [Google Scholar] [CrossRef] [PubMed]
  103. Sirangelo, T.M.; Ludlow, R.A.; Spadafora, N.D. Molecular Mechanisms Underlying Potential Pathogen Resistance in Cannabis sativa. Plants 2023, 12, 2764. [Google Scholar] [CrossRef] [PubMed]
  104. Apicella, P.V.; Sands, L.B.; Ma, Y.; Berkowitz, G.A. Delineating genetic regulation of cannabinoid biosynthesis during female flower development in Cannabis sativa. Plant Direct 2022, 6, e412. [Google Scholar] [CrossRef] [PubMed]
  105. Baek, S.-C.; Jeon, S.-Y.; Byun, B.-H.; Kim, D.-H.; Yu, G.-R.; Kim, H.; Lim, D.-W. CsCBDAS2-Driven Enhancement of Cannabinoid Biosynthetic Genes Using a High-Efficiency Transient Transformation System in Cannabis sativa ‘Cheungsam’. Plants 2025, 14, 1460. [Google Scholar] [CrossRef] [PubMed]
  106. Li, M.; Cai, S.; Bai, X.; Ouyang, Y.; Chen, L.; Zhang, Z.; Tang, W.; Xu, M.; Ao, D.; Li, Y. Genome-wide inventory and functional characterization of cannabinoid oxidocyclases (CBSs) in Cannabis sativa. Hortic. Res. 2026, 13, uhag023. [Google Scholar] [CrossRef] [PubMed]
  107. Laverty, K.U.; Stout, J.M.; Sullivan, M.J.; Shah, H.; Gill, N.; Holbrook, L.; Deikus, G.; Sebra, R.; Hughes, T.R.; Page, J.E.; et al. A physical and genetic map of Cannabis sativa identifies extensive rearrangements at the THC/CBD acid synthase loci. Genome Res. 2019, 29, 146–156. [Google Scholar] [CrossRef] [PubMed]
  108. McKernan, K.J.; Helbert, Y.; Kane, L.T.; Ebling, H.; Zhang, L.; Liu, B.; Eaton, Z.; McLaughlin, S.; Kingan, S.; Baybayan, P.; et al. Sequence and annotation of 42 Cannabis genomes reveals extensive copy number variation in cannabinoid synthesis and pathogen resistance genes. bioRxiv 2020. [Google Scholar] [CrossRef]
  109. Vergara, D.; Huscher, E.L.; Keepers, K.G.; Givens, R.M.; Cizek, C.G.; Torres, A.; Gaudino, R.; Kane, N.C. Gene copy number is associated with phytochemistry in Cannabis sativa. AoB Plants 2019, 11, plz074. [Google Scholar] [CrossRef] [PubMed]
  110. Lim, K.J.; Lim, Y.P.; Hartono, Y.D.; Go, M.K.; Fan, H.; Yew, W.S. Biosynthesis of nature-inspired unnatural cannabinoids. Molecules 2021, 26, 2914. [Google Scholar] [CrossRef] [PubMed]
  111. Sands, L.B.; Haiden, S.R.; Ma, Y.; Berkowitz, G.A. Hormonal control of promoter activities of Cannabis sativa prenyltransferase 1 and 4 and salicylic acid mediated regulation of cannabinoid biosynthesis. Sci. Rep. 2023, 13, 8620. [Google Scholar] [CrossRef] [PubMed]
  112. Fayaz, M.; Angmo, T.; Katoch, K.; Majeed, A.; Kundan, M.; Wajid, M.A.; Pal, K.; Misra, P. The promoter regions of CBDAS and PT genes of cannabinoid biosynthesis in Cannabis sativa respond to phytohormones and stress-related signals. Planta 2025, 261, 135. [Google Scholar] [CrossRef] [PubMed]
  113. Gao, M.; Chen, S.; Kong, L.; Wang, L.; Meng, X.; Xie, Z.; Xu, Z.; Mi, Y. Genome-wide identification and characterization of CONSTANS-like transcription factors reveal that three CsCOLs regulate the cannabinoid biosynthesis in Cannabis. Plant Physiol. Biochem. 2025, 224, 109942. [Google Scholar] [CrossRef] [PubMed]
  114. Guerriero, G.; Behr, M.; Legay, S.; Mangeot-Peter, L.; Zorzan, S.; Ghoniem, M.; Hausman, J.-F. Transcriptomic profiling of hemp bast fibres at different developmental stages. Sci. Rep. 2017, 7, 4961. [Google Scholar] [CrossRef] [PubMed]
  115. Zhong, R.; Cui, D.; Ye, Z.H. Secondary cell wall biosynthesis. New Phytol. 2019, 221, 1703–1723. [Google Scholar] [CrossRef] [PubMed]
  116. Petit, J.; Salentijn, E.M.; Paulo, M.-J.; Denneboom, C.; Van Loo, E.N.; Trindade, L.M. Elucidating the genetic architecture of fiber quality in hemp (Cannabis sativa L.) using a genome-wide association study. Front. Genet. 2020, 11, 566314. [Google Scholar] [CrossRef] [PubMed]
  117. Berni, R.; Hausman, J.-F.; Lutts, S.; Guerriero, G. Histochemical and gene expression changes in Cannabis sativa hypocotyls exposed to increasing concentrations of cadmium and zinc. Plant Stress 2024, 14, 100668. [Google Scholar] [CrossRef]
  118. Irakli, M.; Tsaliki, E.; Kalivas, A.; Kleisiaris, F.; Sarrou, E.; Cook, C.M. Effect οf genotype and growing year on the nutritional, phytochemical, and antioxidant properties of industrial hemp (Cannabis sativa L.) seeds. Antioxidants 2019, 8, 491. [Google Scholar] [CrossRef] [PubMed]
  119. Woods, P.; Campbell, B.J.; Nicodemus, T.J.; Cahoon, E.B.; Mullen, J.L.; McKay, J.K. Quantitative trait loci controlling agronomic and biochemical traits in Cannabis sativa. Genetics 2021, 219, iyab099. [Google Scholar] [CrossRef] [PubMed]
  120. Galasso, I.; Russo, R.; Mapelli, S.; Ponzoni, E.; Brambilla, I.M.; Battelli, G.; Reggiani, R. Variability in seed traits in a collection of Cannabis sativa L. genotypes. Front. Plant Sci. 2016, 7, 688. [Google Scholar] [CrossRef] [PubMed]
  121. Huang, H.; Cui, T.; Zhang, L.; Yang, Q.; Yang, Y.; Xie, K.; Fan, C.; Zhou, Y. Modifications of fatty acid profile through targeted mutation at BnaFAD2 gene with CRISPR/Cas9-mediated gene editing in Brassica napus. Theor. Appl. Genet. 2020, 133, 2401–2411. [Google Scholar] [CrossRef] [PubMed]
  122. Park, M.-E.; Yun, J.-Y.; Kim, H.U. C-to-G base editing enhances oleic acid production by generating novel alleles of FATTY ACID DESATURASE 2 in plants. Front. Plant Sci. 2021, 12, 748529. [Google Scholar] [CrossRef] [PubMed]
  123. Du, H.; Huang, M.; Hu, J.; Li, J. Modification of the fatty acid composition in Arabidopsis and maize seeds using a stearoyl-acyl carrier protein desaturase-1 (ZmSAD1) gene. BMC Plant Biol. 2016, 16, 137. [Google Scholar] [CrossRef] [PubMed]
  124. Manansala-Siazon, S.E.; Siazon, P.M.; Tandayu, E.; Garcia-de Heer, L.; Burn, A.; Guo, Q.; Mieog, J.C.; Kretzschmar, T. Seed the Difference: QTL Mapping Reveals Several Major Loci for Seed Size in Cannabis sativa L. Plants 2025, 14, 3853. [Google Scholar] [CrossRef] [PubMed]
  125. Schluttenhofer, C.; Yuan, L. Challenges towards revitalizing hemp: A multifaceted crop. Trends Plant Sci. 2017, 22, 917–929. [Google Scholar] [CrossRef] [PubMed]
  126. Fike, J.H.; Darby, H.; Johnson, B.L.; Smart, L.; Williams, D.W. Industrial hemp in the USA: A brief synopsis. In Sustainable Agriculture Reviews 42: Hemp Production and Applications; Springer: Berlin, Germany, 2020; pp. 89–109. [Google Scholar]
  127. Maity, A.; Lamichaney, A.; Joshi, D.C.; Bajwa, A.; Subramanian, N.; Walsh, M.; Bagavathiannan, M. Seed shattering: A trait of evolutionary importance in plants. Front. Plant Sci. 2021, 12, 657773. [Google Scholar] [CrossRef] [PubMed]
  128. Adamczyk, F.; Sieracka, D.; Zaborowicz, M. Review of Seed Hemp (Cannabis sativa L.) Harvesting Techniques and the Challenges of Harvesting Technologies for This Crop. Agronomy 2026, 16, 677. [Google Scholar] [CrossRef]
  129. Schultz, C.J.; Lim, W.L.; Khor, S.F.; Neumann, K.A.; Schulz, J.M.; Ansari, O.; Skewes, M.A.; Burton, R.A. Consumer and health-related traits of seed from selected commercial and breeding lines of industrial hemp, Cannabis sativa L. J. Agric. Food Res. 2020, 2, 100025. [Google Scholar] [CrossRef]
  130. Smart, L.B.; Toth, J.A.; Stack, G.M.; Monserrate, L.A.; Smart, C.D. Breeding of hemp (Cannabis sativa). Plant Breed. Rev. 2022, 46, 239–288. [Google Scholar] [CrossRef]
  131. Szarka, D.; Tymon, L.; Amsden, B.; Dixon, E.; Judy, J.; Gauthier, N. First report of powdery mildew caused by Golovinomyces spadiceus on industrial hemp (Cannabis sativa) in Kentucky. Plant Dis. 2019, 103, 1773. [Google Scholar] [CrossRef]
  132. Farinas, C.; Peduto Hand, F. First report of Golovinomyces spadiceus causing powdery mildew on industrial hemp (Cannabis sativa) in Ohio. Plant Dis. 2020, 104, 2727. [Google Scholar] [CrossRef]
  133. Chen, R.; Gajendiran, K.; Wulff, B.B.H. R we there yet? Advances in cloning resistance genes for engineering immunity in crop plants. Curr. Opin. Plant Biol. 2024, 77, 102489. [Google Scholar] [CrossRef] [PubMed]
  134. Dinglasan, E.; Periyannan, S.; Hickey, L.T. Harnessing adult-plant resistance genes to deploy durable disease resistance in crops. Essays Biochem. 2022, 66, 571–580. [Google Scholar] [CrossRef] [PubMed]
  135. Mihalyov, P.; Garfinkel, A. Discovery and genetic mapping of PM1, a powdery mildew resistance gene in Cannabis sativa L. Front. Agron. 2021, 3, 720215. [Google Scholar] [CrossRef]
  136. Jones, J.D.; Dangl, J.L. The plant immune system. Nature 2006, 444, 323–329. [Google Scholar] [CrossRef] [PubMed]
  137. Seifi, S.; Leckie, K.M.; Giles, I.; O’Brien, T.; MacKenzie, J.O.; Todesco, M.; Rieseberg, L.H.; Baute, G.J.; Celedon, J.M. Mapping and characterization of a novel powdery mildew resistance locus (PM2) in Cannabis sativa L. Front. Plant Sci. 2025, 16, 1543229. [Google Scholar] [CrossRef]
  138. Gill, A.R.; Loveys, B.R.; Cavagnaro, T.R.; Burton, R.A. The potential of industrial hemp (Cannabis sativa L.) as an emerging drought resistant fibre crop. Plant Soil 2023, 493, 7–16. [Google Scholar] [CrossRef]
  139. Jiang, Y.; Sun, Y.; Zheng, D.; Han, C.; Cao, K.; Xu, L.; Liu, S.; Cao, Y.; Feng, N. Physiological and transcriptome analyses for assessing the effects of exogenous uniconazole on drought tolerance in hemp (Cannabis sativa L.). Sci. Rep. 2021, 11, 14476. [Google Scholar] [CrossRef] [PubMed]
  140. Shao, Q.; Wang, X.; Wang, X.; Yang, S.; Cao, J.; Zhao, N.; Wang, Q.; Xu, H.; Liu, L. Identification and Expression Analysis of the Betv1 Gene Family in Industrial Hemp. Trop. Plant Biol. 2026, 19, 4. [Google Scholar] [CrossRef]
  141. Cao, K.; Sun, Y.; Han, C.; Zhang, X.; Zhao, Y.; Jiang, Y.; Jiang, Y.; Sun, X.; Guo, Y.; Wang, X. The transcriptome of saline-alkaline resistant industrial hemp (Cannabis sativa L.) exposed to NaHCO3 stress. Ind. Crops Prod. 2021, 170, 113766. [Google Scholar] [CrossRef]
  142. Jiang, Z.; Wang, D.; Che, Y.; Amanullah, S.; Zhang, L.; Jie, S.; Yang, W.; Wang, M.; Wang, L.; Qi, G. Transcriptomic sequencing and expression verification of identified genes modulating the alkali stress tolerance and endogenous photosynthetic activities of industrial hemp plant. PLoS ONE 2025, 20, e0326434. [Google Scholar] [CrossRef] [PubMed]
  143. Simiyu, E.S.; Che, Y.; Qia, G.C.; Jiang, Z.; Zhang, L.; Yang, W.; Wanjala, B.; Abubakar, A.S.; Gao, G.; Zhu, A. Whole-genome analysis reveals the tandem duplication of Cannabis sativa L. GATA and their stress-responsive expression during seed germination. BMC Plant Biol. 2006, 26, 38. [Google Scholar]
  144. Zhang, Y.; Zhang, M.; Fang, Y.; Zheng, N.; Yan, B.; Sui, Y.; Zhang, L. Genome-Wide Identification of the TIFY Family in Cannabis sativa L. and Its Potential Functional Analysis in Response to Alkaline Stress and in Cannabinoid Metabolism. Int. J. Mol. Sci. 2025, 26, 8171. [Google Scholar] [CrossRef] [PubMed]
  145. Talei, D.; Shams, A.; Khayam Nekouei, M. A Comprehensive Review of Cannabis as a Crucial Pharmaceutical Plant and Its Efficient Propagation Methods. ACS Agric. Sci. Technol. 2025, 5, 1191–1214. [Google Scholar] [CrossRef]
  146. Zhang, X.; Xu, G.; Cheng, C.; Lei, L.; Sun, J.; Xu, Y.; Deng, C.; Dai, Z.; Yang, Z.; Chen, X. Establishment of an Agrobacterium-mediated genetic transformation and CRISPR/Cas9-mediated targeted mutagenesis in Hemp (Cannabis Sativa L.). Plant Biotechnol. J. 2021, 19, 1979–1987. [Google Scholar] [CrossRef] [PubMed]
  147. Yang, Z.; Wei, H.; Pan, X.; Chen, J.; Kang, H.; Lin, Y.; Jiang, S.; Feng, X.; Tao, A.; Qi, J. Genome-Wide Identification and Expression Analysis of Genes Related to Cannabinoid Biosynthesis Pathways in Seed Hemp. Trop. Plant Biol. 2025, 18, 1–13. [Google Scholar] [CrossRef]
  148. Cull, A.; Joly, D.L. Development and validation of a minimal SNP genotyping panel for the differentiation of Cannabis sativa cultivars. BMC Genom. 2025, 26, 83. [Google Scholar] [CrossRef]
  149. Prentout, D.; Razumova, O.; Rhoné, B.; Badouin, H.; Henri, H.; Feng, C.; Käfer, J.; Karlov, G.; Marais, G.A.B. An efficient RNA-seq-based segregation analysis identifies the sex chromosomes of Cannabis sativa. Genome Res. 2020, 30, 164–172. [Google Scholar] [CrossRef] [PubMed]
  150. Adal, A.M.; Doshi, K.; Holbrook, L.; Mahmoud, S.S. Comparative RNA-Seq analysis reveals genes associated with masculinization in female Cannabis sativa. Planta 2021, 253, 17. [Google Scholar] [CrossRef] [PubMed]
  151. Stelmach-Wityk, K.; Szymonik, K.; Jones, A.M.P.; Grzebelus, E. A novel protocol for protoplast isolation, transfection, and culture in Cannabis sativa L. BMC Plant Biol. 2025, 25, 1629. [Google Scholar] [CrossRef] [PubMed]
  152. Schachtsiek, J.; Hussain, T.; Azzouhri, K.; Kayser, O.; Stehle, F. Virus-induced gene silencing (VIGS) in Cannabis sativa L. Plant Methods 2019, 15, 157. [Google Scholar] [CrossRef] [PubMed]
  153. Alter, H.; Peer, R.; Dombrovsky, A.; Flaishman, M.; Spitzer-Rimon, B. Tobacco Rattle Virus as a Tool for Rapid Reverse-Genetics Screens and Analysis of Gene Function in Cannabis sativa L. Plants 2022, 11, 327. [Google Scholar] [CrossRef] [PubMed]
  154. Holmes, J.E.; Punja, Z.K. Agrobacterium mediated transformation of THC-containing Cannabis sativa L. yields a high frequency of transgenic calli expressing bialaphos resistance and non-expressor of PR1 (NPR1) genes. Botany 2023, 101, 498–512. [Google Scholar] [CrossRef]
  155. Deguchi, M.; Bogush, D.; Weeden, H.; Spuhler, Z.; Potlakayala, S.; Kondo, T.; Zhang, Z.J.; Rudrabhatla, S. Establishment and optimization of a hemp (Cannabis sativa L.) agroinfiltration system for gene expression and silencing studies. Sci. Rep. 2020, 10, 3504. [Google Scholar] [CrossRef] [PubMed]
  156. Galán-Ávila, A.; Gramazio, P.; Ron, M.; Prohens, J.; Herraiz, F.J. A novel and rapid method for Agrobacterium-mediated production of stably transformed Cannabis sativa L. plants. Ind. Crops Prod. 2021, 170, 113691. [Google Scholar] [CrossRef]
  157. Bennur, P.L.; O’Brien, M.; Fernando, S.C.; Doblin, M.S. Genotype-independent de novo regeneration protocol in Cannabis sativa L. through direct organogenesis from cotyledonary nodes. Plant Methods 2025, 21, 145. [Google Scholar] [CrossRef] [PubMed]
  158. Sorokin, A.; Kovalchuk, I. Development of efficient and scalable regeneration tissue culture method for Cannabis sativa. Plant Sci. 2025, 350, 112296. [Google Scholar] [CrossRef] [PubMed]
  159. Monthony, A.S.; Page, S.R.; Hesami, M.; Jones, A.M.P. The past, present and future of Cannabis sativa tissue culture. Plants 2021, 10, 185. [Google Scholar] [CrossRef] [PubMed]
  160. Barbosa-Xavier, K.; Pedrosa-Silva, F.; Almeida-Silva, F.; Venancio, T.M. Cannabis Expression Atlas: A comprehensive resource for integrative analysis of Cannabis sativa L. gene expression. Physiol. Plant. 2024, 176, e70010. [Google Scholar] [CrossRef] [PubMed]
  161. Welling, M.T.; Liu, L.; Kretzschmar, T.; Mauleon, R.; Ansari, O.; King, G.J. An extreme-phenotype genome-wide association study identifies candidate cannabinoid pathway genes in Cannabis. Sci. Rep. 2020, 10, 18643. [Google Scholar] [CrossRef] [PubMed]
  162. Sahab, S.; Runa, F.; Ponnampalam, M.; Kay, P.T.; Jaya, E.; Viduka, K.; Panter, S.; Tibbits, J.; Hayden, M.J. Efficient multi-allelic genome editing via CRISPR–Cas9 ribonucleoprotein-based delivery to Brassica napus mesophyll protoplasts. Front. Plant Sci. 2024, 15, 1397632. [Google Scholar] [CrossRef] [PubMed]
  163. Hua, K.; Tao, X.; Yuan, F.; Wang, D.; Zhu, J.-K. Precise A·T to G·C base editing in the rice genome. Mol. Plant 2018, 11, 627–630. [Google Scholar] [CrossRef] [PubMed]
  164. Lin, Q.; Zong, Y.; Xue, C.; Wang, S.; Jin, S.; Zhu, Z.; Wang, Y.; Anzalone, A.V.; Raguram, A.; Doman, J.L. Prime genome editing in rice and wheat. Nat. Biotechnol. 2020, 38, 582–585. [Google Scholar] [CrossRef] [PubMed]
  165. Simiyu, D.C.; Jang, J.H.; Lee, O.R. Understanding Cannabis sativa L.: Current status of propagation, use, legalization, and haploid-inducer-mediated genetic engineering. Plants 2022, 11, 1236. [Google Scholar] [CrossRef] [PubMed]
  166. Xing, H.-L.; Dong, L.; Wang, Z.-P.; Zhang, H.-Y.; Han, C.-Y.; Liu, B.; Wang, X.-C.; Chen, Q.-J. A CRISPR/Cas9 toolkit for multiplex genome editing in plants. BMC Plant Biol. 2014, 14, 327. [Google Scholar] [CrossRef] [PubMed]
  167. Wang, M.; Mao, Y.; Lu, Y.; Tao, X.; Zhu, J.-k. Multiplex gene editing in rice using the CRISPR-Cpf1 system. Mol. Plant 2017, 10, 1011–1013. [Google Scholar] [CrossRef] [PubMed]
  168. Kang, B.-C.; Yun, J.-Y.; Kim, S.-T.; Shin, Y.; Ryu, J.; Choi, M.; Woo, J.W.; Kim, J.-S. Precision genome engineering through adenine base editing in plants. Nat. Plants 2018, 4, 427–431. [Google Scholar] [CrossRef] [PubMed]
  169. Vats, S.; Kumar, J.; Sonah, H.; Zhang, F.; Deshmukh, R. Prime editing in plants: Prospects and challenges. J. Exp. Bot. 2024, 75, 5344–5356. [Google Scholar] [CrossRef] [PubMed]
  170. Park, J.; Choi, S.; Park, S.; Yoon, J.; Park, A.Y.; Choe, S. DNA-free genome editing via ribonucleoprotein (RNP) delivery of CRISPR/Cas in lettuce. In Plant Genome Editing with CRISPR Systems: Methods and Protocols; Springer: Berlin, Germany, 2019; pp. 337–354. [Google Scholar]
  171. Varshney, R.K.; Graner, A.; Sorrells, M.E. Genomics-assisted breeding for crop improvement. Trends Plant Sci. 2005, 10, 621–630. [Google Scholar] [CrossRef] [PubMed]
  172. Krishna, T.P.A.; Veeramuthu, D.; Maharajan, T.; Soosaimanickam, M. The era of plant breeding: Conventional breeding to genomics-assisted breeding for crop improvement. Curr. Genom. 2023, 24, 24–35. [Google Scholar] [CrossRef]
  173. Tanaka, T.; Kobayashi, F.; Ishikawa, G. Current status and future perspective of genomics-assisted breeding in wheat (Triticum aestivum L.). Breed. Sci. 2026, 76, 48–62. [Google Scholar] [CrossRef]
  174. Tyagi, A.; Mir, Z.A.; Almalki, M.A.; Deshmukh, R.; Ali, S. Genomics-assisted breeding: A powerful breeding approach for improving plant growth and stress resilience. Agronomy 2024, 14, 1128. [Google Scholar] [CrossRef]
  175. Alsaleh, A.; Yılmaz, G.; Yazici, L. Using Molecular Markers as a Powerful Tool for Accelerating Hemp Breeding Programs. Kenevir Ve Biyoteknoloji Araştırmaları Derg. 2025, 1, 5–11. [Google Scholar]
  176. Petit, J.; Monfort, A.; Argyris, J. Industrial Hemp (Cannabis sativa L.) Breeding. In Breeding and Biotechnology of Grass and Bast Fiber Crops; Springer: Cham, Switzelrand, 2025; pp. 507–561. [Google Scholar]
  177. Tilkat, E.; Hoşer, A.; Tilkat, E.A.; Süzerer, V.; Çiftçi, Y.Ö. Production of industrial hemp: Breeding strategies, limitations, economic expectations, and potential applications. Türk Bilimsel Derlemeler Derg. 2023, 16, 54–74. [Google Scholar]
  178. Ranalli, P. Current status and future scenarios of hemp breeding. Euphytica 2004, 140, 121–131. [Google Scholar] [CrossRef]
  179. Faux, A.-M.; Draye, X.; Flamand, M.-C.; Occre, A.; Bertin, P. Identification of QTLs for sex expression in dioecious and monoecious hemp (Cannabis sativa L.). Euphytica 2016, 209, 357–376. [Google Scholar] [CrossRef]
  180. Borin, M.; Palumbo, F.; Vannozzi, A.; Scariolo, F.; Sacilotto, G.B.; Gazzola, M.; Barcaccia, G. Developing and testing molecular markers in Cannabis sativa (Hemp) for their use in variety and dioecy assessments. Plants 2021, 10, 2174. [Google Scholar] [CrossRef] [PubMed]
  181. Collard, B.C.; Mackill, D.J. Marker-assisted selection: An approach for precision plant breeding in the twenty-first century. Philos. Trans. R. Soc. B Biol. Sci. 2008, 363, 557–572. [Google Scholar]
  182. Zhao, Y.; Huang, J.; Jei, V.; Mohamed-Hussein, Z.-A.; Xiao, X.; Wang, Y.; Wang, X.; Zhang, H. Development of KASP markers for DNA fingerprinting in fiber-type hemp (Cannabis sativa L.) germplasms. Ind. Crops Prod. 2025, 237, 122320. [Google Scholar] [CrossRef]
  183. Bhat, J.A.; Ali, S.; Salgotra, R.K.; Mir, Z.A.; Dutta, S.; Jadon, V.; Tyagi, A.; Mushtaq, M.; Jain, N.; Singh, P.K. Genomic selection in the era of next generation sequencing for complex traits in plant breeding. Front. Genet. 2016, 7, 221. [Google Scholar] [CrossRef] [PubMed]
  184. Alemu, A.; Åstrand, J.; Montesinos-Lopez, O.A.; y Sanchez, J.I.; Fernandez-Gonzalez, J.; Tadesse, W.; Vetukuri, R.R.; Carlsson, A.S.; Ceplitis, A.; Crossa, J. Genomic selection in plant breeding: Key factors shaping two decades of progress. Mol. Plant 2024, 17, 552–578. [Google Scholar] [CrossRef] [PubMed]
  185. Zhao, Y.; Gowda, M.; Liu, W.; Würschum, T.; Maurer, H.P.; Longin, F.H.; Ranc, N.; Reif, J.C. Accuracy of genomic selection in European maize elite breeding populations. Theor. Appl. Genet. 2012, 124, 769–776. [Google Scholar] [PubMed]
  186. Farooq, M.A.; Gao, S.; Hassan, M.A.; Huang, Z.; Rasheed, A.; Hearne, S.; Prasanna, B.; Li, X.; Li, H. Artificial intelligence in plant breeding. Trends Genet. 2024, 40, 891–908. [Google Scholar] [CrossRef] [PubMed]
  187. Yoosefzadeh Najafabadi, M.; Torkamaneh, D. Machine learning-enhanced multi-trait genomic prediction for optimizing cannabinoid profiles in Cannabis. Plant J. 2025, 121, e17164. [Google Scholar] [PubMed]
  188. Ahmar, S.; Gill, R.A.; Jung, K.-H.; Faheem, A.; Qasim, M.U.; Mubeen, M.; Zhou, W. Conventional and molecular techniques from simple breeding to speed breeding in crop plants: Recent advances and future outlook. Int. J. Mol. Sci. 2020, 21, 2590. [Google Scholar] [CrossRef] [PubMed]
  189. Rai, K.K. Integrating speed breeding with artificial intelligence for developing climate-smart crops. Mol. Biol. Rep. 2022, 49, 11385–11402. [Google Scholar] [CrossRef] [PubMed]
  190. Al-Ashkar, I.; Al-Doss, A.; Ullah, N. Accelerating crop improvement through speed breeding. In Climate-Resilient Agriculture; Springer: Berlin, Germany, 2023; Vol 1: Crop Responses and Agroecological Perspectives; pp. 821–847. [Google Scholar]
  191. Pandey, S.; Singh, A.; Parida, S.K.; Prasad, M. Combining speed breeding with traditional and genomics-assisted breeding for crop improvement. Plant Breed. 2022, 141, 301–313. [Google Scholar] [CrossRef]
  192. Gudi, S.; Kumar, P.; Singh, S.; Tanin, M.J.; Sharma, A. Strategies for accelerating genetic gains in crop plants: Special focus on speed breeding. Physiol. Mol. Biol. Plants 2022, 28, 1921–1938. [Google Scholar] [CrossRef] [PubMed]
  193. Zulkiffal, M.; Ahsan, A.; Ahmed, J.; Mehboob, S.; Ajmal, S.; Hafeez, F.; Khokhar, M.I.; Farooq, M.U.; Nadeem, M.; Abdullah, M. Next-Generation Strategies for Developing and Commercializing Rust-Resistant Wheat through High-Throughput Phenotyping and Genomic Innovations. Plant Biotechnol. 2025, 3, 371–384. [Google Scholar] [CrossRef]
  194. Ghorbanzadeh, Z.; Panahi, B.; Purhang, L.; Hossein Panahi, Z.; Zeinalabedini, M.; Mardi, M.; Hamid, R.; Ghaffari, M.R. Integrative Genomics and Precision Breeding for Stress-Resilient Cotton: Recent Advances and Prospects. Agronomy 2025, 15, 2393. [Google Scholar] [CrossRef]
  195. Schilling, S.; Melzer, R.; Dowling, C.A.; Shi, J.; Muldoon, S.; McCabe, P.F. A protocol for rapid generation cycling (speed breeding) of hemp (Cannabis sativa) for research and agriculture. Plant J. 2023, 113, 437–445. [Google Scholar] [PubMed]
  196. Somody, G.; Molnár, Z. Flowering synchronization using artificial light control for crossbreeding hemp (Cannabis sativa L.) with varied flowering times. Plants 2025, 14, 594. [Google Scholar] [CrossRef] [PubMed]
  197. Amin, A.; Zaman, W.; Park, S. Harnessing multi-omics and predictive modeling for climate-resilient crop breeding: From genomes to fields. Genes 2025, 16, 809. [Google Scholar] [CrossRef] [PubMed]
  198. Yan, J.; Wang, X. Machine learning bridges omics sciences and plant breeding. Trends Plant Sci. 2023, 28, 199–210. [Google Scholar] [CrossRef] [PubMed]
  199. Singh, T. AI-assisted integretomics approaches in plant stress research. J. Plant Biochem. Biotechnol. 2026, 35, 443–448. [Google Scholar] [CrossRef]
  200. Benkirane, C.; Charif, M.; Mueller, C.; Taaifi, Y.; Mansouri, F.; Addi, M.; Bellaoui, M.; Serghini-Caid, H.; Elamrani, A.; Abid, M. Population structure and genetic diversity of Moroccan cannabis (Cannabis sativa L.) germplasm through simple sequence repeat (SSR) analysis. Genet. Resour. Crop Evol. 2024, 17, 2037–2051. [Google Scholar]
  201. Datwyler, S.L.; Weiblen, G.D. Genetic variation in hemp and marijuana (Cannabis sativa L.) according to amplified fragment length polymorphisms. J. Forensic Sci. 2006, 51, 371–375. [Google Scholar] [CrossRef] [PubMed]
  202. Kajitani, R.; Toshimoto, K.; Noguchi, H.; Toyoda, A.; Ogura, Y.; Okuno, M.; Yabana, M.; Harada, M.; Nagayasu, E.; Maruyama, H. Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads. Genome Res. 2014, 24, 1384–1395. [Google Scholar] [CrossRef] [PubMed]
  203. Pryszcz, L.P.; Gabaldón, T. Redundans: An assembly pipeline for highly heterozygous genomes. Nucleic Acids Res. 2016, 44, e113. [Google Scholar] [CrossRef] [PubMed]
  204. Sun, J.; Chen, J.; Zhang, X.; Xu, G.; Yu, Y.; Dai, Z.; Su, J. Genome-wide association study of salt tolerance at the germination stage in hemp. Euphytica 2023, 219, 5. [Google Scholar]
  205. Van der Werf, H.M. Life cycle analysis of field production of fibre hemp, the effect of production practices on environmental impacts. Euphytica 2004, 140, 13–23. [Google Scholar] [CrossRef]
  206. Kim, S. Genetic and Environmental Factors Shaping Cannabis Phenotypes A Study on Temperature Effects and Genetic Regulation of Anthocyanin Accumulation in Cannabis sativa. Master’s Thesis, University of Wisconsin-Madison, Madison, WI, USA, 2024. [Google Scholar]
  207. Monthony, A.S.; Kyne, S.T.; Grainger, C.M.; Jones, A.M.P. Recalcitrance of Cannabis sativa to de novo regeneration; a multi-genotype replication study. PLoS ONE 2021, 16, e0235525. [Google Scholar] [CrossRef] [PubMed]
  208. Piunno, K.F.; Golenia, G.; Boudko, E.A.; Downey, C.; Jones, A.M.P. Regeneration of shoots from immature and mature inflorescences of Cannabis sativa. Can. J. Plant Sci. 2019, 99, 556–559. [Google Scholar] [CrossRef]
  209. Dey, M.; Bakshi, S.; Galiba, G.; Sahoo, L.; Panda, S.K. Development of a genotype independent and transformation amenable regeneration system from shoot apex in rice (Oryza sativa spp. indica) using TDZ. 3 Biotech 2012, 2, 233–240. [Google Scholar] [CrossRef]
  210. Crossa, J.; Pérez-Rodríguez, P.; Cuevas, J.; Montesinos-López, O.; Jarquín, D.; De Los Campos, G.; Burgueño, J.; González-Camacho, J.M.; Pérez-Elizalde, S.; Beyene, Y. Genomic selection in plant breeding: Methods, models, and perspectives. Trends Plant Sci. 2017, 22, 961–975. [Google Scholar] [CrossRef] [PubMed]
  211. Sirangelo, T.M.; Ludlow, R.A.; Spadafora, N.D. Multi-omics approaches to study molecular mechanisms in Cannabis sativa. Plants 2022, 11, 2182. [Google Scholar] [CrossRef] [PubMed]
  212. Klassen, M.; Anthony, B.P. Legalization of Cannabis and agricultural frontier expansion. Environ. Manag. 2022, 69, 333–352. [Google Scholar]
Figure 1. Historical progression of genetic and genomic advances in Cannabis sativa L. breeding and research. Schematic timeline showing major milestones in Cannabis sativa genetic improvement, from early cultivation to modern precision breeding. The upper panel summarizes key breeding objectives and genetic advances across successive eras, including the differentiation of low-THC industrial hemp (<0.3% THC) from drug-type Cannabis, the adoption of marker-assisted selection (MAS), and the development of climate-resilient cultivars. The lower panel illustrates the progression of breeding technologies, from cytogenetics and next-generation sequencing (NGS) to pangenomic analysis, multi-omics integration, CRISPR/Cas-based genome editing, and AI/ML-assisted genomic prediction. The figure was assembled in Canva, with selected illustrations generated using the DeepAI image generation tool.
Figure 1. Historical progression of genetic and genomic advances in Cannabis sativa L. breeding and research. Schematic timeline showing major milestones in Cannabis sativa genetic improvement, from early cultivation to modern precision breeding. The upper panel summarizes key breeding objectives and genetic advances across successive eras, including the differentiation of low-THC industrial hemp (<0.3% THC) from drug-type Cannabis, the adoption of marker-assisted selection (MAS), and the development of climate-resilient cultivars. The lower panel illustrates the progression of breeding technologies, from cytogenetics and next-generation sequencing (NGS) to pangenomic analysis, multi-omics integration, CRISPR/Cas-based genome editing, and AI/ML-assisted genomic prediction. The figure was assembled in Canva, with selected illustrations generated using the DeepAI image generation tool.
Plants 15 02088 g001
Figure 2. Floral morphology and sex expression in industrial hemp (Cannabis sativa L.). (A) Male plant showing early staminate inflorescences with developing pollen sacs along the nodes. (B) Female plant at the flowering stage with compact pistillate inflorescences and trichome-rich floral bracts. (C) Monoecious plant bearing both male (♂) and female (♀) flowers on the same individual. Scale bar = 1 cm. Photographs were taken by the authors, and the figure was assembled using Adobe Photoshop.
Figure 2. Floral morphology and sex expression in industrial hemp (Cannabis sativa L.). (A) Male plant showing early staminate inflorescences with developing pollen sacs along the nodes. (B) Female plant at the flowering stage with compact pistillate inflorescences and trichome-rich floral bracts. (C) Monoecious plant bearing both male (♂) and female (♀) flowers on the same individual. Scale bar = 1 cm. Photographs were taken by the authors, and the figure was assembled using Adobe Photoshop.
Plants 15 02088 g002
Figure 3. Biosynthetic pathway and regulatory network of cannabinoids in Cannabis sativa L. Cannabinoid biosynthesis occurs in the secretory cells of glandular trichomes on female flowers. The pathway begins with two parallel routes: the cytosolic polyketide pathway, in which hexanoyl-CoA and malonyl-CoA are converted to olivetolic acid (OLA) by olivetol synthase (CsOLS), and the plastidial MEP pathway, which converts pyruvate and glyceraldehyde-3-phosphate into isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). These intermediates are condensed by geranyl diphosphate synthase (CsGPPS) to produce geranyl diphosphate (GPP). OLA and GPP are then combined by the prenyltransferase CsGOT to form cannabigerolic acid (CBGA), the common precursor of major cannabinoids. CBGA is subsequently converted into tetrahydrocannabinolic acid (THCA), cannabidiolic acid (CBDA), and cannabichromenic acid (CBCA) by CsTHCAS, CsCBDAS, and CsCBCAS, respectively. These acidic cannabinoids are decarboxylated by heat or light to produce the neutral cannabinoids THC, CBD, and CBC. Cannabinoid biosynthesis is regulated by genetic variation, environmental conditions, and transcription factor families, including WRKY, MYB, and AP2/ERF. Scale bar = 5 mm. The figure was designed and assembled in Canva (v1.122.0.0), and the chemical structures were generated using the RCSB Chemical Component Sketch Tool.
Figure 3. Biosynthetic pathway and regulatory network of cannabinoids in Cannabis sativa L. Cannabinoid biosynthesis occurs in the secretory cells of glandular trichomes on female flowers. The pathway begins with two parallel routes: the cytosolic polyketide pathway, in which hexanoyl-CoA and malonyl-CoA are converted to olivetolic acid (OLA) by olivetol synthase (CsOLS), and the plastidial MEP pathway, which converts pyruvate and glyceraldehyde-3-phosphate into isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). These intermediates are condensed by geranyl diphosphate synthase (CsGPPS) to produce geranyl diphosphate (GPP). OLA and GPP are then combined by the prenyltransferase CsGOT to form cannabigerolic acid (CBGA), the common precursor of major cannabinoids. CBGA is subsequently converted into tetrahydrocannabinolic acid (THCA), cannabidiolic acid (CBDA), and cannabichromenic acid (CBCA) by CsTHCAS, CsCBDAS, and CsCBCAS, respectively. These acidic cannabinoids are decarboxylated by heat or light to produce the neutral cannabinoids THC, CBD, and CBC. Cannabinoid biosynthesis is regulated by genetic variation, environmental conditions, and transcription factor families, including WRKY, MYB, and AP2/ERF. Scale bar = 5 mm. The figure was designed and assembled in Canva (v1.122.0.0), and the chemical structures were generated using the RCSB Chemical Component Sketch Tool.
Plants 15 02088 g003
Figure 4. Comprehensive multi-omics integration and genomic selection framework for industrial hemp (Cannabis sativa L.) improvement. The workflow is organized into five sequential phases, progressing from germplasm characterization to elite cultivar development. (A) Phenotypic data acquisition: Evaluation of diverse germplasm, including landraces, wild/feral populations, and European and Asian genotypes, with measurement of key agronomic traits such as plant height, biomass yield, flowering time, and cannabinoid content using field trials and remote sensing approaches. (B) Multi-omics data generation: Integration of genomic (sequencing and genotyping), transcriptomic (gene expression), proteomic (protein profiling), and metabolomic (chemical profiling) datasets. (C) Genetic mapping and prediction: Identification of trait-associated loci through genome-wide association studies (GWASs) and QTL mapping, combined with genomic prediction models to support selection decisions. (D) Breeding and validation: Application of marker-assisted selection, speed breeding, and multi-environment field trials to validate target traits. (E) Elite genotype development: Development of improved cultivars with enhanced fiber quality, abiotic stress tolerance, disease and pest resistance, and improved nutritional properties. The figure was assembled in Canva with select illustrative elements generated using the DeepAI tool.
Figure 4. Comprehensive multi-omics integration and genomic selection framework for industrial hemp (Cannabis sativa L.) improvement. The workflow is organized into five sequential phases, progressing from germplasm characterization to elite cultivar development. (A) Phenotypic data acquisition: Evaluation of diverse germplasm, including landraces, wild/feral populations, and European and Asian genotypes, with measurement of key agronomic traits such as plant height, biomass yield, flowering time, and cannabinoid content using field trials and remote sensing approaches. (B) Multi-omics data generation: Integration of genomic (sequencing and genotyping), transcriptomic (gene expression), proteomic (protein profiling), and metabolomic (chemical profiling) datasets. (C) Genetic mapping and prediction: Identification of trait-associated loci through genome-wide association studies (GWASs) and QTL mapping, combined with genomic prediction models to support selection decisions. (D) Breeding and validation: Application of marker-assisted selection, speed breeding, and multi-environment field trials to validate target traits. (E) Elite genotype development: Development of improved cultivars with enhanced fiber quality, abiotic stress tolerance, disease and pest resistance, and improved nutritional properties. The figure was assembled in Canva with select illustrative elements generated using the DeepAI tool.
Plants 15 02088 g004
Table 1. Candidate genes associated with major traits.
Table 1. Candidate genes associated with major traits.
TraitCandidate GenesMechanistic Biological FunctionBreeding Targets and Applications
Cannabinoid BiosynthesisDXS, GPPS, OLS, CBDAS, THCAS, and NBCS [93,94]CBGA biosynthesis and enzymatic conversion into major cannabinoids (THC, CBD, and CBC).Development of compliant low-THC cultivars and high-yield cannabinoid medicinal cultivars.
Seed Protein QualityCsEde1 (A–D) and CsEde2 (A–C) [95,96]Edestin storage protein synthesis and regulation of seed amino acid composition.Improvement of protein quality, digestibility, and functional food applications.
Seed Oil ProfilesFAD2, FAD3, KCS, and SAD [19,97]Fatty acid desaturation and elongation controlling PUFA composition and oil quality.Optimization of nutritional oil profiles and enhanced oxidative stability.
Secondary Cell Wall Deposition and Tensile StrengthCesA4, CesA7, CesA8, CsCESA/CSL gene families, and IRX [98]Cellulose and hemicellulose biosynthesis governing fiber strength, flexibility, and cell wall architecture.Improvement of fiber yield, tensile strength, and textile processing quality.
Fiber Quality and LignificationC4H, CCR, and CAD [99]Lignin biosynthesis and deposition within secondary cell walls.Reduction in lignin content and improved fiber-processing efficiency.
Flowering TimeFT, CO, and FLD [75]Photoperiod-mediated regulation of floral transition and reproductive development.Adaptation to diverse environments and development of regionally optimized cultivars.
Stress ToleranceDREB, NAC, WRKY, and MYB [100,101,102]Regulation of drought, salinity, and heat stress responses through transcriptional and cellular defense pathways.Development of climate-resilient cultivars with stable productivity under abiotic stress.
Defense TraitsNBS-LRR, WRKY, and PR genes [103]Pathogen recognition and activation of immune signaling pathways.Enhancement of resistance to fungal and bacterial diseases and reduction in crop losses.
Table 2. Functional genomics and genome-editing tools in Cannabis.
Table 2. Functional genomics and genome-editing tools in Cannabis.
MethodApplicationExample of Cannabis-Specific Target AdvantagesLimitations
QTL MappingPopulation-based; links traits to chromosomes.Plant height, biomass, flowering time, CBD/THC variation [119] and seed size [124].Reliable for major-effect loci; foundational for markers.Low resolution; slow due to outcrossing nature.
GWAS (Genome-Wide Association Study)Population-based; maps traits across diverse germplasm.Flowering time, architecture, cannabinoids, and stress tolerance loci (WRKY/NAC/MYB clusters) [100,147].Captures natural variation; high-resolution mapping.Requires large populations; confounded by population structure.
SNP Genotyping and Marker DevelopmentHigh-throughput screening; marker development (MAS).Cultivar identification, sexing, and chemotype panels (1.5K HASCH, 20-SNP panels) [25,148].Early seedling sexing/chemotype prediction; cost-effectiveness.Requires prior discovery; poor for complex polygenic traits.
RNA-seq (Transcriptomics)Expression profiling across tissues or abiotic/biotic stressors.Sex-linked/male-biased genes; transcripts for environmental sex plasticity [149,150].Global expression profiles; identifies candidate pathway genes.Correlation only; highly tissue-, time-, and condition-specific.
Protoplast Transient Expression AssaysTransient; rapid subcellular and promoter analysis.PEG-mediated system in isolated hemp cells for functional validation [151].Fast feedback (24–48 h); bypasses whole-plant regeneration.Short expression window; highly sensitive to tissue age/genotype.
Virus-Induced Gene Silencing (VIGS)Transient; rapid gene knockdown.Silencing of PDS and ChlI for validation; cannabinoid/pigment pathways [152,153].Fast pipeline; eliminates tissue culture/regeneration needs.Temporary effects; variable knockdown; uneven systemic spread.
Agrobacterium-Mediated TransformationStable; gene introduction and trait engineering.Transgenic calli expressing AtNPR1 (disease) and bar (herbicide) genes [154].Heritable genome integration; permanent transgenic lines.Very time-consuming; bottlenecked by poor regeneration.
CRISPR/Cas9 Genome EditingStable; precise gene knockout or knockin.Knockout of CsPDS1 (albino phenotype proof of concept) [146].High precision; definitive causal validation of gene function.Off-target risks; limited by recalcitrant tissue regeneration.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Daundasekara, K.C.; Thennakoon, K.P.; Wickramasinghe, J.S.; Woldesenbet, S.; Delhom, C.; Chandra, S.; Weerasooriya, A.D. Genetic Advances in Cannabis sativa L.: A Review of Recent Progress and Future Directions. Plants 2026, 15, 2088. https://doi.org/10.3390/plants15132088

AMA Style

Daundasekara KC, Thennakoon KP, Wickramasinghe JS, Woldesenbet S, Delhom C, Chandra S, Weerasooriya AD. Genetic Advances in Cannabis sativa L.: A Review of Recent Progress and Future Directions. Plants. 2026; 15(13):2088. https://doi.org/10.3390/plants15132088

Chicago/Turabian Style

Daundasekara, Kasuni C., Kalpani P. Thennakoon, Jivendra S. Wickramasinghe, Selamawit Woldesenbet, Christopher Delhom, Suman Chandra, and Aruna D. Weerasooriya. 2026. "Genetic Advances in Cannabis sativa L.: A Review of Recent Progress and Future Directions" Plants 15, no. 13: 2088. https://doi.org/10.3390/plants15132088

APA Style

Daundasekara, K. C., Thennakoon, K. P., Wickramasinghe, J. S., Woldesenbet, S., Delhom, C., Chandra, S., & Weerasooriya, A. D. (2026). Genetic Advances in Cannabis sativa L.: A Review of Recent Progress and Future Directions. Plants, 15(13), 2088. https://doi.org/10.3390/plants15132088

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop