Next Article in Journal
Social Diversification Driven by Mobile Genetic Elements
Previous Article in Journal
SDHA Germline Mutations in SDH-Deficient GISTs: A Current Update
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Structural and Functional Classification of G-Quadruplex Families within the Human Genome

1
School of Graduate and Interdisciplinary Studies, University of Louisville, Louisville, KY 40292, USA
2
Department of Neuroscience Training, University of Louisville, Louisville, KY 40292, USA
3
Kentucky IDeA Network of Biomedical Research Excellence (KY INBRE) Bioinformatics Core, University of Louisville, Louisville, KY 40292, USA
4
Department of Biochemistry and Molecular Genetics, University of Louisville, Louisville, KY 40292, USA
*
Author to whom correspondence should be addressed.
Genes 2023, 14(3), 645; https://doi.org/10.3390/genes14030645
Submission received: 13 February 2023 / Revised: 22 February 2023 / Accepted: 2 March 2023 / Published: 4 March 2023
(This article belongs to the Section Bioinformatics)

Abstract

G-quadruplexes (G4s) are short secondary DNA structures located throughout genomic DNA and transcribed RNA. Although G4 structures have been shown to form in vivo, no current search tools that examine these structures based on previously identified G-quadruplexes and filter them based on similar sequence, structure, and thermodynamic properties are known to exist. We present a framework for clustering G-quadruplex sequences into families using the CD-HIT, MeShClust, and DNACLUST methods along with a combination of Starcode and BLAST. Utilizing this framework to filter and annotate clusters, 95 families of G-quadruplex sequences were identified within the human genome. Profiles for each family were created using hidden Markov models to allow for the identification of additional family members and generate homology probability scores. The thermodynamic folding energy properties, functional annotation of genes associated with the sequences, scores from different prediction algorithms, and transcription factor binding motifs within a family were used to annotate and compare the diversity within and across clusters. The resulting set of G-quadruplex families can be used to further understand how different regions of the genome are regulated by factors targeting specific structures common to members of a specific cluster.
Keywords: G-quadruplex; G4; clustering; hidden Markov models; DNA structures G-quadruplex; G4; clustering; hidden Markov models; DNA structures

Share and Cite

MDPI and ACS Style

Neupane, A.; Chariker, J.H.; Rouchka, E.C. Structural and Functional Classification of G-Quadruplex Families within the Human Genome. Genes 2023, 14, 645. https://doi.org/10.3390/genes14030645

AMA Style

Neupane A, Chariker JH, Rouchka EC. Structural and Functional Classification of G-Quadruplex Families within the Human Genome. Genes. 2023; 14(3):645. https://doi.org/10.3390/genes14030645

Chicago/Turabian Style

Neupane, Aryan, Julia H. Chariker, and Eric C. Rouchka. 2023. "Structural and Functional Classification of G-Quadruplex Families within the Human Genome" Genes 14, no. 3: 645. https://doi.org/10.3390/genes14030645

APA Style

Neupane, A., Chariker, J. H., & Rouchka, E. C. (2023). Structural and Functional Classification of G-Quadruplex Families within the Human Genome. Genes, 14(3), 645. https://doi.org/10.3390/genes14030645

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop