A Multi-Layered Study on Harmonic Oscillations in Mammalian Genomics and Proteomics

Cellular, organ, and whole animal physiology show temporal variation predominantly featuring 24-h (circadian) periodicity. Time-course mRNA gene expression profiling in mouse liver showed two subsets of genes oscillating at the second (12-h) and third (8-h) harmonic of the prime (24-h) frequency. The aim of our study was to identify specific genomic, proteomic, and functional properties of ultradian and circadian subsets. We found hallmarks of the three oscillating gene subsets, including different (i) functional annotation, (ii) proteomic and electrochemical features, and (iii) transcription factor binding motifs in upstream regions of 8-h and 12-h oscillating genes that seemingly allow the link of the ultradian gene sets to a known circadian network. Our multifaceted bioinformatics analysis of circadian and ultradian genes suggests that the different rhythmicity of gene expression impacts physiological outcomes and may be related to transcriptional, translational and post-translational dynamics, as well as to phylogenetic and evolutionary components.


Introduction
Tissue and cellular functions underlying physiology of living organisms show time-dependent variations predominantly featured by 24-h (circadian) periodicity and driven by molecular clockworks operated by rhythmically expressed genes and proteins hardwiring transcriptional/translational feedback loops (TTFL) [1][2][3][4]. Circadian expressed genes include a handful of core-clock genes that in turn drive thousands of downstream clock-controlled genes [5,6]. Experiments performed in animal models showed that approximately half of the transcriptome shows 24-h oscillations that manage crucial biological processes such as the cell cycle, proliferation, metabolism, DNA damage repair, apoptosis and autophagy [7][8][9][10].
The interacting positive and negative limbs of the TTFL regulate gene transcription through sequential cycles of transcriptional activation of the expression of clock genes followed by transcriptional suppression by their protein products [11,12]. The positive limb is operated by CLOCK and BMAL1 that heterodimerize and activate the transcription of cryptochrome genes (Cry1 and Cry2) and period genes (Per1, Per2 and Per3), which operate the negative limb encoding repressors hindering gene transcription. Conversely, Bmal1 rhythmic expression is driven by the nuclear receptors REV-ERBα and RORα through competitive binding at its promoter region [13,14].
Gene expression profiling performed by means of high-throughput measurements with DNA microarrays and quantitative PCR in mouse liver specimens collected at regular time intervals showed that two groups of genes oscillate at the second (12-h) and third (8-h) harmonic of the fundamental (24-h) frequency [15].
The aim of our study was to characterize genomic and proteomic features of the clusters of genes oscillating with harmonics of circadian periodicity. We exploited bioinformatics tools for functional prediction to identify the biological functions and enriched signalling pathways and to perform comparative qualitative proteomic analysis. Finally, we implemented several computational strategies in order to detect the presence of significant de-novo regulatory motifs and known transcription factor binding sites in the promoter region of clock genes, with respect to the whole mouse genome.
We investigated the following working hypotheses: (i) Circadian genes and genes oscillating with harmonic frequencies show dissimilar biological facets and encode different proteome profiles; (ii) canonical and non-canonical DNA structures are found within the upstream regions of the oscillating genes subsets; (iii) ultradian genes connect to an identified circadian network through distinctive upstream short nucleotide sequences and DNA binding sites. Our results show that the three subsets of oscillating genes are hallmarked by very different functional annotation and proteomic features, as well as peculiar transcription factor binding motives, in addition to canonical binding sites. These are found within the upstream regions of rhythmically expressed target genes and seemingly allow for the link of the ultradian gene sets to a known circadian network.

Results
To characterize particular features of the gene sets with ultradian and circadian periodicity (8-h, 12-h, 24-h gene sets), we used a variety of computational and bioinformatics methods including a comprehensive analysis at the gene expression level namely: a sequence analysis for known transcription factor binding sites, multiple sequence alignment and phylogenetic analysis, enrichment analysis of the three gene sets, as well as the analysis of epigenetic and non-epigenetic regulation of oscillating gene expression. We further carried out an analysis at the protein level and investigated the electrochemical properties of oscillating proteins and completed our analysis by generating chromosomal co-localization networks created upon homology mapping of oscillating genes.

Known Transcription Factor Binding Sites Are Enriched in the Promoter Regions of the 8-h, 12-h, and 24-h Rhytmically Expressed Genes
To characterize a putative differential functionality of ultradian genes, we searched for enriched transcription factor (TF) binding sites in the promoter regions of 8-h and 12-h gene sets as compared to 24-h rhythmically expressed gene set, using the MEME SUITE AME tool [23]. We found several significantly enriched binding sites (p > 0.05), Tables S1-S6. The top 5 enriched TF binding sites for the  (Table S1),

119
In addition, using the AME tool of the MEME suite we searched for E-boxes and D-boxes, 120 conserved motifs known to be the present in the promoter region of clock-controlled genes, and 121 bound by core-clock elements. Both E-boxes and D-boxes were detected in the upstream promoter 122 region of the 8-h gene set and of the 12-h gene set (p < 0.05). However, other motifs are more 123 significantly enriched (Figure 1). We detected E-boxes in the 8-h gene set in 38.3% of the upstream 124 promoter sequences (adj. p = 4.99e-2), and in the 12-h gene set in 6.9% of the upstream promoter 125 sequences (adj. p = 3.75e-8). In particular, we detected CLOCK (adj. p = 1.25e-16) and BMAL1 (adj. p 126 = 5.83e-06) binding motifs in the upstream promoter regions of the 12-h gene set (Table S3). For the 127 24-h gene set we detected E-boxes in 26.7% of the upstream promoter sequences (adj. p = 3.04e-48).
In addition, using the AME tool of the MEME suite we searched for E-boxes and D-boxes, conserved motifs known to be the present in the promoter region of clock-controlled genes, and bound by core-clock elements. Both E-boxes and D-boxes were detected in the upstream promoter region of the 8-h gene set and of the 12-h gene set (p < 0.05). However, other motifs are more significantly enriched ( Figure 1). We detected E-boxes in the 8-h gene set in 38.3% of the upstream promoter sequences (adj. p = 4.99e-2), and in the 12-h gene set in 6.9% of the upstream promoter sequences (adj. p = 3.75e-8).

Phylogenetic Analysis Shows Similarity within the Promoter Regions of the 8 h and 12 h Gene Sets
The significantly enriched motifs found point to a common regulatory system for the 8-h and 12-h gene sets, hence we hypothesized the existence of an evolutionary connection between the promoter regions of both gene sets as the key mechanism of activation is most likely evolutionary ancient and well conserved (as the clock itself). To further investigate this hypothesis we generated phylogenetic trees, as an output visualization of the multiple sequence alignments of the 3500 bp upstream promoter region of the 8-h and 12-h gene sets ( Figure 2). First, we produced a multiple sequence alignment of the promoter region sequences together with 10 control sequences of non-oscillating genes. Second, we created phylogenetic trees from the resulting alignment using the Felsenstein (F84) (Figure 2A) and Jukes-Cantor (JC) ( Figure 2B) nucleotide substitution models [24]. We tagged the 8-h oscillating genes with red markers in the resulting trees and the 10 control genes with green markers. When utilizing 10 non-oscillating genes as control sequences, the 8-h and 12-h gene sets do not show a clustering pointing at strong evolutionary conservation of the entire sequence regardless of the substitution model used. Hence, while individual binding sites for TFs are highly enriched and conserved, the promoter sequence itself varies greatly and may allow for the fine-tuning of expression for individual genes.  (Table S6). Interestingly, CLOCK (adj. p = 130 0.02) and BMAL1 (adj. p = 3.28e-05) binding motif are also present in the downstream promoter region of the 24-h gene set (Table S5).

133
The significantly enriched motifs found point to a common regulatory system for the 8-h and     To search for specific functions of the individual gene sets, we investigated a possible enrichment of Gene Ontology and Reactome Pathways terms for the 8-h, 12-h and 24-h rhythmically expressed genes. In all cases, significant enrichments, generated with ConsensusPathDB, are present (p < 0.01). While the 8-h gene set showed an enrichment of terms related to metabolism, the 12-h set showed an enrichment of terms related to endoplasmic reticulum (ER)-related processes, splicing, translation and gene expression regulation. The 24-h rhythmically expressed genes showed an enrichment of terms related to meiosis, and splicing ( Figure S3), which is in line with our previous findings [25][26][27][28].
We further explored the putative connection of the ultradian gene sets to a known circadian (approximately 24 h rhythmically expressed elements) network (NCRG-network of circadian regulated genes [10]) for that, we performed a series of simulations based on randomized protein-protein interaction networks. The random network generation is based on the IntAct database contained in iRefIndex. We quantified the number of interactions between the elements of the 8-h gene set, and 100 random networks of the same size as the NCRG. In addition, we also quantified the number of interactions between the 8-h gene set and the NCRG. While the average interactions between the 8-h gene set and the random networks was 3.99 ± 2.91 (mean and SD), the number of interactions between the same gene set and the NCRG was 12. We applied the same procedure to the 12-h gene set and obtained 19 ± 9.25 (mean and SD) connections to the random networks, while the number of connections between the NCRG and the 12 h gene set was 87. Both sets therefore exhibit a connectivity to the NCRG that was higher than the connectivity displayed by the random gene sets. Thus, the randomized network analysis points to a connection between the ultradian rhythmically expressed genes, the core-clock and clock-controlled genes.

Epigenetic and Non-epigenetic Regulation of Oscillating Gene Expression
We performed an enrichment analysis for histone modifications associated with the 8-h, 12-h and 24-h gene sets available in the public Encode 2015 project data [29]. The enriched histone methylation pattern of H3K79me2 associated with the 12-h and 24-h gene sets is tissue-specific for the liver in agreement with the original data, generated from liver cells. This points to a tissue specificity of the methylation pattern and the corresponding expression of rhythmically expressed genes. The methylation pattern of the 8-h gene set is associated with a different cell line (M. Musculus MEL, p = 3.555e-02).
The H3K79me2 histone modification is associated with the function of the RNA polymerase II ( Figure S4). RNA Polymerase II plays a major role in the transcription regulation of the 12-h and 24-h rhythmically expressed genes based on ChIPSeq data from the ENCODE project and as suggested by the previous histone modification data. However, GABPA (GA Binding Protein Transcription Factor Subunit Alpha, p = 4.315e-08) TF scores higher in the 12-h oscillating gene set than RNA Polymerase II. GABPA is related to the mitochondrial gene expression pathway, thus pointing again at the potential metabolic role of the genes with ultradian oscillations. For the 8-h gene set we detected an enrichment of TCF12 (Transcription Factor 12) binding. This transcription factor recognizes E-boxes and is involved in the formation of lineage-specific gene expression. This enrichment illustrates the important role of the E-boxes, which even though not being the most enriched motif seem to attract the strongest TF activity ( Figure S4).
Moreover, we investigated the protein-protein interactions of the transcription factors potentially influencing the gene sets according to the ENRICHR database [30]. This enrichment showed interesting candidates-POLE (DNA polymerase Epsilon, Catalytic Subunit, p = 1.026e-02) for the 8-h gene set and ESR1 (Estrogen receptor 1) for the 12-h (p = 1.137e-07) and 24-h (p = 7.235e-13) gene sets. Figure S4. We further investigated the role of POLE in a potential cancer context. From publicly available data the high expression of POLE is an unfavourable marker in renal cancer and melanoma [31] ( Figure S5).
Altogether, the enrichment information points towards very specific processes that govern the regulation and output of the ultradian oscillating genes. Often a single microRNA (such as miRNA-1295 for the 8-h gene set) or a single gene such as POLR2A in the 24-h gene set are predicted to have the most significant results in terms of interaction with other genes, regulation of transcription or computationally predicted targets in the genomic sequence.

Electrochemical Properties of Oscillating Proteins
Next, we analyzed the electrochemical features of the proteins encoded by circadian and ultradian genes as compared to a randomly sorted set of proteins encoded by non-oscillating genes ( Table 1,  Table S7). The overall stability as predicted by the FoldX algorithm [32] was higher in oscillating proteins when compared to non-oscillating proteins (although with a barely detectable statistically significant difference), whereas the terms represented by interresidue Van der Waals' clashes and electrostatic interaction (Table S8) between molecules in the precomplex were significantly different. A correlation analysis showed negative correlation of these terms with free energy values in oscillating proteins and positive correlation in non-oscillating proteins, suggesting a different contribution to the overall protein stability.
Among the oscillating proteins subsets, 8-h oscillating proteins showed statistically significant differences in respect to 12-h and 24-h oscillating proteins, with lower average number of residues and with higher free energy (lower energy of unfolding), suggesting lower overall stability. In this regard, the components that were different in a statistically significant way were represented by solvation of polar and hydrophobic atoms, water binding, Van der Waals energy, steric clashes, hydrogen bonds, electrostatic interactions (Table 1). Gibbs free energy was negatively correlated with these statistically significant variables in the 8-h oscillating proteins, while an inverse correlation was found in the set of 12-h oscillating proteins, hinting at a diverse involvement in the net equilibrium of forces settling on unfolded or folded protein state. On the other hand, similar correlations were found for 8-h and 24-h oscillating proteins, except for the contribution of hydrophobic groups to free energy difference (Figure 3 and Figure S6).

Chromosome Mapping of Oscillating Genes
All 8-h subset (56) and nearly all 12-h subset of mouse genes (202 out of 205) were mapped to human homologs, while only 1826 out of 2054 mouse circadian genes were suitably mapped. The genes of the three classes were distributed along all chromosomes, with no chromosome left uncovered and no homolog and paralog gene localized on the same chromosome both in human and in mouse (Figure 4). Only a few oscillating genes mapped to chromosome Y, precisely one circadian gene in the mouse set, and one ultradian and one circadian gene in human set (Table S9). The intersection of mouse and human co-localization networks created upon homology mapping of oscillating genes revealed high localization conservation for the 8-h gene sets between both species (65%), a moderate conservation for 12-h gene sets (23%) and poor conservation of chromosomal localization for circadian genes (6%) ( Table 2).

Discussion
Frequency multiplication is a common occurrence in rhythmic phenomena observed in multifaceted systems of interest for a variety of scientific disciplines, for instance physics, chemistry, biology, astronomy. In natural and life sciences, harmonics of circadian frequency have been initially reported prior to the foundation of chronobiology as a separate area of scientific research addressing rhythmic phenomena in living beings. Nonetheless, the scientific literature on the multiplication of circadian periodicity in biological processes remains limited at the present time.
The comprehensive bioinformatics analyses performed on transcriptomics and proteomics data in mammalian genes expressed with 24-h periodicity and with harmonics of circadian rhythmicity allowed us to highlight a number of interesting differences among the subsets of oscillating genes: (i) circadian genes and genes oscillating at the second and third harmonic of 24-h periodicity show divergent functional annotation and proteomic characteristics; (ii) within their upstream regions unusual transcription factor binding motives other than canonical binding sites are found; (iii) genes oscillating at the second and third harmonics are connected by specific regulatory motifs and transcription factor binding sites to a recognized circadian network.
In particular, concerning shared enriched transcription factor binding sites in the promoter regions of the circadian and ultradian genes suggest equivalent transcriptional control of time-dependent gene expression. In the upstream promoter region of ultradian genes, in addition to other motifs more significantly enriched, we identified E-boxes and D-boxes, which were not found in their downstream promoter regions. Moreover, the phylogenetic analysis of the promoter regions of the ultradian gene sets showed variability of the entire promoter sequence, which could eventually allow the accurate regulation of expression of the different genes. Furthermore, a randomized network analysis suggested a possible connection between the ultradian genes subset and the circadian clock circuitry. The subsequent enrichment analysis showed that the 8-h oscillating genes were enriched in terms related to metabolism, the 12-h oscillating genes in terms related to ER-related processes, splicing, translation and gene expression regulation, while the 24-h oscillating genes in terms related to meiosis, and splicing. This is in agreement with previous results [25,26,33].

Oscillating Proteins Are Hallmarked by Higher Overall Stability when Compared to Non-Oscillating Proteins
Bioinformatics analysis of the electrochemical properties of non-oscillating and oscillating proteins showed that the oscillating proteins are hallmarked by higher overall stability when compared to non-oscillating proteins, mainly in relation to significant dissimilarity of two components of free energy calculation in the FoldX protein design algorithm, one related to inter-residue close contacts and the other represented by electrostatic contribution of interactions at interfaces, which differently contributed to the free energy value in the two subsets. Considering the three oscillating proteins subsets, 8-h oscillating proteins showed lower mean residue number and lower overall stability, mainly in relation to different polar and hydrophobic desolvation, water binding, Van der Waals energy, steric clashes, hydrogen bonds, electrostatic interactions, interestingly with opposite correlations when matched up to the other ultradian subset of proteins. Protein folding allows free volume to decrease and considerably impacts protein conformational/binding equilibrium and ultimately physiological function in conditions of macromolecular crowding, such as those hallmarking cellular and sub-cellular volume-restricted compartments [34,35]. The spatio-temporal gathering of oscillating proteins may impact the effects of macromolecular crowding on equilibrium stability of proteins with different folds, cofactors and mechanisms. Protein folding and unfolding kinetics are influenced by crowding, with stabilizing effects whose degree will hinge on intrinsic stability and protein fold [34,35]. In this context, the molecular clockwork could manage the phase relation between subcellular oscillation patterns of folded, intermediate, and unfolded proteins, as well as of molecular chaperones that assist these transitions, especially considering that macromolecular crowding accelerates folding, but over a given limit the folding process will be hindered [34,35].

Specific Enriched Processes Govern the Regulation and Output of the Ultradian Oscillating Genes
The enrichment analysis for histone modifications showed association with the 12-h and 24-h oscillating genes of H3K79me2, involved in RNA polymerase II function, whereas 8-h oscillating genes showed binding enrichment for TCF12, a transcription factor capable of binding to E-boxes. Altogether, the enrichment information points towards very specific processes that govern the regulation and output of the ultradian oscillating genes. Often a single microRNA (such as miRNA-1295 for the 8 h gene set) or a single gene such as POLR2A in the 24-h gene set are predicted to have the most significant result in terms of targets. The enrichment analysis for computationally predicted miRNA targets pinpointed to the 8-h gene set as targets for miRNA-1295, to the 12-h gene set for miRNA 344, miRNA 344c, miRNA 1244 and miRNA 499, whereas the 24-h oscillating genes appeared as targets for miRNA-4637. Furthermore, the protein-protein interactions of the transcription factors potentially influencing the oscillating gene sets identified as major candidates POLE for the 8-h gene set and ESR1 for the 12-h and 24-h gene sets. Interestingly, elevated POLE expression predicts poorer outcome marker in renal cancer and melanoma patients.

Homology Mapping of Oscillating Genes Revealed Different Degree of Localization Conservation for the Three Gene Sets
Mapping of the 8-h, 12-h and 24-h oscillating genes in mouse and human chromosomes revealed scattering of the three classes along all chromosomes, with no chromosome left uncovered and homologs and paralogs of core-clock genes and clock-controlled genes never localized on the same chromosome. Nevertheless, in both species only a few oscillating genes mapped to chromosome Y, probably in relation to the peculiar role played by this allosome in male fertility and sex determination in mammals. In addition, we found high localization conservation for the 8-h genes (65%) between both species, a moderate conservation for 12-h genes (23%) and a poor conservation of localization for circadian genes (6%).

Primary Dataset
Bioinformatics analyses were performed on publicly available genomic data (GSE11923). Briefly, liver samples were collected every hour for 48 h from n = 3-5, 6-week-old male C57BL/6J mice (Jackson) per time point, the specimens were pooled, and high-temporal resolution profiling was performed using Affymetrix arrays to detect cycling genes. Fisher's G-test at a false-discovery rate of < 0.05 and COSOPT were jointly exploited to recognize rhythmic transcripts, which were classified, depending on the length of the oscillation period, as circadian (24 ± 4 h) and ultradian (12 ± 2 h and 8 ± 1 h) [15]. Array probe IDs/nucleotide sequences of 8-h, 12-h and 24-h oscillating genes were registered and by using BioDBnet (https://biodbnet-abcc.ncifcrf.gov/db/db2db.php) 56, 202 and 2396 Ensembl Transcript IDs were recovered from the primary dataset, respectively.

Sequence Analysis for Known Transcription Factor Binding Sites
For the initial data acquisition, we performed an analysis on pre-selected data sets corresponding to the above-mentioned gene-probes with 8-h, 12-h and 24-h rhythmic oscillations. To perform the sequence analysis, we extracted and analyzed the 3500 bp flanking sequences upstream and the 300 bp flanking sequences downstream of the complete corresponding genes. The mapping and sequence selection were carried out with Ensembl biomaRt (Ensembl revision 84). We searched for enriched known motifs and specific acceptance for gapped motifs with the MEME SUITE software (http://meme-suite.org/) [36]. The length of the motif correlates with its statistical significance. MEME defines the most statistically significant motif based on its E-value (low E-value). The E-value of a motif is based on its log likelihood ratio, width, sites, the background letter frequencies, and the size of the training set. The E-value is an estimate of the expected number of motifs with the given log likelihood ratio (or higher), and with the same width and site count, as found in a similarly sized set of random sequences. We used the AME tool and the HOCOMOCOv11 [37,38] database as motif sources. The AME tool specifically searches for enrichments of known motifs from the database selected. The 3500 bp upstream promoter region was scanned, as well as the 300 bp downstream motif region.

Multiple Sequence Alignment and Phylogenetic Analysis
Multiple sequence alignments of the promoter regions of the 8-h and 12-h gene sets were created with MUSCLE (http://www.drive5.com/muscle/). The resulting alignments were used for further phylogenetic analysis of the promoter regions of the 8-h and 12-h gene sets. Phylogenetic tree creation was performed with PHYLIP's neighbor joining method with F84 and Jukes-Cantor substitution models [24] (http://evolution.genetics.washington.edu/phylip.html).

Enrichment Analysis of the 8-h, 12-h and 24-h Gene Sets
The enrichment analysis for the mouse gene sets was performed with ConsensusPathDB (http://consensuspathdb.org) [39]. An analysis for Enriched Reactome and GO terms was performed. The p-value cut-off was set to 0.01 and the GO terms were set to level 4. Each of the 8-h, 12-h and 24-h gene sets was analyzed individually.

Randomized Network Analysis on Ultradian and Circadian Genes
A network analysis to explore the connection between the ultradian genes, and the core-clock and clock-controlled genes was performed through a series of simulations based on randomized protein-protein interaction networks. For the network creation the probes were mapped to Uniprot and Entrez ID. The network was generated from IntAct data contained in the iRefIndex database (snapshot from 2015) (http://irefindex.org) which summarizes protein-protein interaction data from different sources. The computations were performed using the iRefR R package with 100 random sets of genes of matching size.

Impact of Protein Expression on Survival
Survival data associated with protein expression was retrieved from the Protein Atlas database [31].

Electrochemical Properties of Oscillating Proteins
To predict the electrochemical properties of proteins encoded by ultradian and circadian genes we used the corresponding three-dimensional structural data stored in the protein data bank (PDB; http://www.rcsb.org/pdb/) [42]. All complex analyses were performed with FoldX, which is one of the best stability predictors and is easily implementable in a pipeline [32]. FoldX is an empirical force field that was developed or the rapid evaluation of the stability, folding of proteins and nucleic acids. It is composed of a solvation term, a van der Waals term, H-bond, and electrostatic terms and entropic terms for the backbone and side chains. In the case of protein complexes, an extra term related to the electrostatic contribution is also considered. The software package FoldX includes subroutines, e.g., RepairPDB. The way it operates is the following: first it looks for all Asparagine, Glutamine and Histidine residues and flips them by 180 degrees. This is done to prevent incorrect rotamer assignment in the structure due to the fact that the electron density of Asparagine and Glutamine carboxamide groups is almost symmetrical and the correct placement can only be discerned by calculating the interactions with the surrounding atoms. The same applies to Histidine. It does a small optimization of the side chains to eliminate small VanderWaals' clashes. This way it prevents moving side chains in the final step. "RepairPDB" identifies the residues that have very bad energies and mutates them and their neighbors' to themselves exploring different rotamer combinations to find new energy minima. Correlations between the parameters were investigated by pairwise correlation analysis (Spearman correlation; R package PerformanceAnalytics). The statistical analysis for electrochemical features of the ultradian and circadian gene sets was conducted using the energy values as from the FOLD-X [32] energy function and performing a Kruskal-Wallis one-way analysis of variance with Residue Number as covariate and Dunn's post hoc test with false discovery rate (FDR) correction.

Chromosome Mapping of Oscillating Genes
H. sapiens homologs for 8-h, 12-h and 24-h M. musculus oscillating genes (GSE11923) were retrieved using biomaRt. Genes not matching between mouse and humans were sought manually using the latest version of Ensembl web portal. In case of multiple homologs (one-to-many or many-to-many relationships), the following scores were considered, in this order of priority: confidence score; gene order conservation (GOC) score; target %ID, which refers to the percentage of the sequence in the target species (human) that matches to the query sequence (mouse); query % ID, which refers to the percentage of the sequence in the query species that matches to the homologue; dN/dS ratio. The number of oscillating genes divided by the total number of genes in each chromosome was represented by bar plots. The extent of gene co-localization overlap was assessed by using networks. Genes of the three subsets, i.e., 8-h, 12-h and 24-h oscillating genes, were represented as networks, where the genes, symbolized as nodes, were linked by edges if they were located on the same chromosomes. For each mouse and human subset of genes networks were built and then intersected by means of Pyntacle (http://pyntacle.css-mendel.it/). An intersection network was built considering only nodes and edges in common between the two original networks. Pairs of intersecting genes were considered to be on the same chromosome in mouse and human, even if the chromosome were not the same between the two species.

Conclusions
High-throughput analysis over time-series microarray expression data unveils harmonics in oscillation patterns of omics that, intermingling with spatial hierarchical branching networks lapsing in size-invariant units, could endow a fourth temporal dimension at least complementary to the fourth spatial dimension blueprinted by fractal-like networks broadly pervasive in nature. Our wide-ranging characterization of genomic, proteomic and functional properties of oscillating genes and proteins suggests that ultradian and circadian rhythmicity in omics could subtend or alternatively be related to specific mechanisms underlying the functioning of various and complex biological phenomena crucial to make life possible.
Supplementary Materials: Supplementary materials can be found at http://www.mdpi.com/1422-0067/20/18/4585/s1. Figure S1. Enrichment analysis for Gene Ontology and Reactome Pathways terms in the transcription factors for which enriched binding sites were detected in the promoter regions of the 8-h, 12-h and 24-h gene sets. Figure  S2. High-resolution format of Figure 2. Figure S3. Pathways and GO terms enrichment analysis points at the specific functions for each gene set. Figure S4. Enrichment analysis of histone modifications. Figure S5. High POLE expression is associated with negative outcome for renal cancer and melanoma. Figure S6. Correlation matrix for parameters of the 8-hours, 12-hours and 24-hours gene sets. Figure S7. Cytoscape files for the network in Figure 4. Table S1. Enriched binding sites for known transcription factors in the upstream regions of the promoters of the 8-h gene set. Table S2. Enriched binding sites for known transcription factors in the downstream regions of the promoters of the 8-h gene set. Table S3. Enriched binding sites for known transcription factors in the upstream regions of the promoters of the 12-h gene set. Table S4. Enriched binding sites for known transcription factors in the downstream regions of the promoters of the 12-h gene set. Table S5. Enriched binding sites for known transcription factors in the downstream regions of the promoters of the 24-h gene set. Table S6. Enriched binding sites for known transcription factors in the upstream regions of the promoters of the 24-h gene set. Table S7. Descriptive statistics for the data in Table 1. Table S8. Free energy (∆G) terms included in the core function of FoldX, the empirical force field algorithm aiming to calculate the change of ∆G in kcalmol −1 . Table S9