1. Introduction
Tailed bacteriophages have historically served as model systems for mechanistic investigations of powerful molecular motors that use ATP hydrolysis to translocate double-stranded DNA into pre-formed virus protein capsids [
1,
2,
3,
4]. Most large single-stranded DNA (ssDNA) viruses and dsDNA viruses—such as herpes virus, pox virus, adenovirus, and all the tailed dsDNA bacteriophages—assemble an empty virus shell, and then actively translocate DNA into this procapsid. There are considerable entropic and enthalpic costs associated with compacting DNA to near-crystalline densities inside virus shells [
3,
5,
6]. DNA must be bent significantly and repeatedly, and it must be condensed without any knots, tangles, or other topological anomalies that would prevent genome ejection during subsequent infection events. To package DNA efficiently, viruses code for homomeric ring additional strand conserved glutamate (ASCE) ATPases that tightly couple ATP hydrolysis and DNA translocation [
1,
3,
4,
7,
8]. This ancient ATPase superfamily is ubiquitous across all three domains of life, and its members are typically involved in polymer manipulation tasks (e.g., protein degradation, chromosome segregation, DNA recombination, DNA strand separation, and conjugation) or in molecular segregation, such as proton movement by the F1-ATP synthase [
9,
10,
11,
12]. Thus, understanding the mechanistic principles underlying viral DNA packaging will also illuminate the mechano-chemistry of a broad class of molecular motors responsible for basic macromolecular partitioning processes. Within the ASCE ATPase superfamily, virally encoded dsDNA packaging motors are especially powerful, capable of producing forces in excess of 50 pN [
13,
14]. Hence the virally encoded ATPases that drive dsDNA encapsidation provide a unique window into how maximum mechanical force can be extracted from the chemical hydrolysis of ATP. Despite the general ubiquity of these motors in nature, the manner in which they are coupled to genome encapsidation in dsDNA viruses is unique; hence, these motors are attractive targets for therapeutics that inhibit viral packaging machinery [
15].
Given the importance of viral dsDNA packaging motors in both applied and basic biomedical research, considerable effort has been exerted to understand the molecular basis of genome packaging. As a result, several different bacteriophage dsDNA packaging systems have been developed to experimentally interrogate the encapsidation process, each of which have relative advantages and disadvantages [
1,
2,
3]. Common to all systems, the packaging motors include a portal/connector protein that provides the portal for DNA entry and exit, and an ATPase that powers DNA translocation through the portal and into the capsid. While the packaging motors in each system essentially perform the same task, use similar molecular machinery, and likely share a common underlying mechano-chemistry, there are important differences that must be considered. To provide a simple conceptual framework for categorizing packaging motors, we distinguish between two different types: (1) those that must cut their DNA at the beginning and end of packaging to encapsidate concatenated genomes; and (2) those that simply package a fixed unit-length genome.
Two of the most well-known examples of genome-cutting packaging systems are bacteriophages lambda and T4 [
1,
2]. In these phages, a concatenated genome is produced upon replication, and thus the motor is not simply a translocating machine. It first must find the genome start site on the concatenated multi-genome, make a cut in the genome, and only then begin packaging the DNA into the capsid. Further, as the capsid fills with DNA, the motor must somehow sense that the packaging is complete and cut the genome again such that the motor–DNA complex can detach from the filled head and re-attach to another empty procapsid. In bacteriophage lambda a specific DNA sequence known as a “cos” site is recognized as the cut site at the beginning and end of packaging. Bacteriophage T4 uses a “headful” mechanism, wherein the phage somehow recognizes that slightly more than one genome-length of DNA has been packaged, thus filling the procapsid.
To accomplish these additional tasks, genome-cutting motors require additional protein components to identify the genome and locate the appropriate start site for packaging, as well as a functional nuclease to cut the genome at the beginning and the end of packaging. Hence, genome-cutting phages also code for a fully functional RNaseH-like nuclease that is fused to the ATPase as a C-terminal domain and which cuts DNA at the beginning and end of packaging. Due to this nucleolytic function, these ATPases are often referred to as large terminases. Genome-cutting systems also code for a so-called small terminase protein that is likely involved in genome recognition. Despite adopting the “terminase” nomenclature, no enzymatic functions are associated with the small terminase. Neither the exact function of the small terminase, nor how it is incorporated in the motor complex, is well understood.
Linear fixed-length packaging systems are somewhat simpler. In these types of phages, single unit-length genomes are produced rather than the concatenated multi-genomes utilized by genome-cutting phages. Because the polymerases in these phages use a protein primer to initiate replication, the genomes of the fixed-length phages have a protein covalently attached to the 5′-end of each strand of genomic DNA. The ATPase assembles at the portal vertex, engages its DNA substrate by recognizing and binding the terminal protein at a genomic 5′-end, packages until the entire nucleic acid is encapsulated, and then detaches, possibly to assemble at the portal vertex of another empty capsid. The most thoroughly characterized fixed-length packaging system is the bacteriophage φ29 system. Since φ29 packages a unit-length genome, its packaging motor is accordingly simpler. There is no need for a nuclease function to cut the DNA, and thus the φ29 ATPase lacks a fully functional nuclease domain and is only ~ 60% the size of ATPases/large terminases in genome-cutting packaging systems [
16]. Similarly, the portal protein is only ~60% of the size of the portals in the cutting systems [
17], possibly because it does not have or need any structural components to sense that the capsid is full and/or transmit this signal to the rest of the motor. Further, the 5′-terminal protein provides a unique identifying feature of genomic DNA, and thus these phages do not use a small terminase for genome recognition. Thus, φ29 is in some sense a stripped-down version of the genome-cutting packaging motors. However, φ29 is by no means simple; structural and single-molecule results indicate that the φ29 packaging motor operates in a highly coordinated manner to regulate the activity of its subunits and to efficiently generate force to translocate DNA [
18,
19,
20,
21]. Additionally, φ29 is unusual in that it requires a virally encoded structural RNA molecule (pRNA) that acts as a scaffold for assembly of a functional ATPase ring, and which is not present in headful packaging systems [
17,
22,
23,
24,
25]. Thus, neither the genome-cutting nor fixed genome-length systems represent the universal essential minimum necessary for operation of a packaging motor.
An additional complicating factor in all current model systems is that in both genome-cutting and unit-length packaging motors, the ATPases form functional rings only by virtue of assembling on the capsid (as in T4) [
26], on the small terminase (as in lambda) [
1,
27], or on the pRNA (as in φ29) [
17,
25]. As a result, rigorous biophysical studies to interrogate inter-subunit interactions are challenging, and structural information regarding the functional ATPase assembly has thus far been limited to fitting X-ray structures of various motor components into low-resolution cryoEM maps of motor complexes assembled on procapsids [
26,
28]. Understanding the molecular basis of subunit coordination would thus be facilitated by the availability of stable ATPase ring assemblies that can be interrogated biophysically and via atomic resolution structural information to learn how the ATPase subunits interact to efficiently translocate DNA.
It would thus be useful to develop a packaging system that is like φ29 in that it has fewer, smaller, and simpler motor components, but which does not need a pRNA to assemble a functional ATPase ring. Such a motor would reflect the most experimentally useful aspects of both types of systems and likely represent the bare minimum necessary to translocate dsDNA. Toward this end, we have characterized the DNA packaging ATPase from the
Lactococcus lactis phage asccφ28 [
29]. Like φ29, asccφ28 packages a unit-length genome that has a terminal protein covalently attached to each 5′-end of its genomic DNA. Hence, asccφ28 also codes for smaller, simpler versions of the portal protein and the ATPase, similar to φ29. However, there is no evidence for a pRNA in its genome sequence. Since a primary function of the pRNA in φ29 is to act as a scaffold for assembly of a functional ATPase ring [
3,
17,
23,
28], we hypothesized that the ATPase from asccφ28 might have evolved to form functional rings on its own. If so, the ATPase from φ28 would be an excellent model for rigorous biophysical and high-resolution structural analysis of functional ATPase ring motors that drive viral dsDNA genome packaging.
Indeed, using a combination of size-exclusion chromatography (SEC), analytical ultra-centrifugation (AUC), small X-ray scattering (SAXS), negative stain transmission electron microscopy (TEM), and preliminary X-ray crystallographic diffraction, we show that recombinantly expressed gene product 11 (gp11) from asccφ28 is a highly soluble and stable decameric assembly, consisting of two pentameric rings related by D5 point group symmetry. Additional kinetic characterization of ATPase binding and hydrolysis shows that the assembly binds and hydrolyzes ATP similarly to the ATPases in φ29 and other phages, but only once the ATPases in these other systems have assembled functional rings on the procapsid. These results suggest that the pentameric rings in the D5 decamer reflect the biological assembly of the gp11 packaging ATPase on asccφ28 procapsids. Hence, the dsDNA packaging motor gp11 from bacteriophage asccφ28 provides a unique opportunity to examine a phage dsDNA-packaging motor apart from other packaging components. Studying an isolated, functional ATPase ring motor will facilitate rigorous biophysical analysis, more straightforward kinetic analysis, and high-resolution structure determination, expediting a comprehensive understanding of viral dsDNA packaging motors and the mechano-chemistry common to the larger superfamily of ASCE ATPases to which they belong.
2. Materials and Methods
2.1. Cloning
A BLAST search identified phage asccφ28
gene product 11 (
gp11) as orthologous to the bacteriophage φ29 packaging ATPase, with 45% sequence similarity and 28% amino acid identity [
30]. A recombinant, codon-optimized gene for
gp11 was synthesized by DNA 2.0 (now ATUM, Newark, CA, US) based on the published sequence of phage asccφ28 (GenBank ascension number EU438902) [
29]. The synthesized gene was inserted into a pET-30a(+) vector (Novagen, Darmstadt, DE) with kanamycin resistance using NdeI and XhoI restriction sites, resulting in a final construct with an additional C-terminal, -LEHHHHHH tag to facilitate purification by metal-affinity chromatography [
31]. DNA sequencing confirmed that the final construct matched the published amino acid sequence of asccφ28 gp11.
2.2. Protein Expression and Purification
E. coli cells transformed with the synthetic construct were grown in Luria–Bertani medium with 30 μg/mL of kanamycin at 37 °C until an optical density at 600 nm was reached ~0.6 (1 cm path). The BL21 (DE3) strain of E. coli (Novagen) was predominantly used, but other strains also show good expression. Maximum yield of soluble protein was obtained by induction with isopropyl-β-D-1-thiogalactopyranoside (IPTG) at a final concentration of 0.1 mM and expression overnight at 18 °C. Cells were harvested by centrifugation at 4000× g for 20 min at 4 °C. Cell pellets were then either stored at −20 °C or immediately resuspended in a lysis/extraction buffer containing 50 mM sodium phosphate, pH 8.1, 400 mM sodium chloride, 5 mM β-mercaptoethanol, and 5 mM imidazole. The typical volume of lysis/extraction buffer used was 25 mL for a pellet harvested from 1 L of cell culture. Resuspended cells were kept on ice and lysed by sonication with the Microson XL200 Ultrasonic Cell Disruptor (Misonix, Farmingdale, NY, US). The sonication regime involved 30 s pulse treatments with 30 s pauses between pulses until the mixture appeared completely homogenous (typically after ~15–20 pulse treatments). The soluble protein fraction was then separated from cell lysate solids by centrifugation at 17,000× g for 30 min at 4 °C. Immobilized metal affinity chromatography was used for the first purification step. The supernatant containing the soluble, 6xHis-tagged protein fraction was incubated for thirty minutes at 4 °C with Talon resin (Clontech Laboratories/Takara Bio Kusatsu, Shiga, JP) that had been previously equilibrated in lysis/extraction buffer, typically employing ~1 mL of resin for ~25 mL of supernatant. The resin-bound protein was then transferred to a chromatography column and eluted in 1 mL fractions with a linear gradient going from 5 mM to 150 mM imidazole. Talon column fractions that contained gp11 were then pooled, concentrated by ultrafiltration, and further purified by size exclusion chromatography on a HiLoad Superdex 200 gel filtration column (GE Healthcare, Chicago, IL, USA). The pooled protein remained in the Talon column elution buffer, while the gel filtration column was equilibrated and run with a buffer containing 50 mM sodium phosphate, pH 8.1, 400 mM sodium chloride, and 1 mM dithiothreitol. Fractions from the single elution peak were assessed by SDS-PAGE and combined for subsequent characterization.
2.3. Size Estimation by Gel Filtration Chromatography
Gel filtration chromatography was used to estimate the protein molecular weight by comparing gp11′s elution profile with protein standards of known molecular weight. A commercial standard containing 5 different molecular weight markers (Biorad, Hercules, CA, USA) was run on the same HiLoad Superdex 200 16/60 column, in buffer containing 400 mM NaCl, 25 mM Tris pH 8.1, and 1 mM DTT. The standards in the mixture were thyroglobulin (670 kDa), γ-globulin (158 kDa), ovalbumin (44 kDa), myoglobin (17 kDa), and vitamin B12 (1.35 kDa). The elution volume for the peak of each standard was used to plot a standard curve, and the elution volume of gp11 was plotted on this curve to estimate the molecular weight.
2.4. Analytical Ultracentrifugation
For this and subsequent procedures, all protein samples were purified through the two-column process and any necessary buffer exchange was done by overnight dialysis at 4 °C. Purified protein was dialyzed into a buffer containing 20 mM Tris-HCl, pH 8.1 at 20 °C, 400 mM NaCl, and 5 mM β-ME, and subsequently centrifuged at 27,000×
g for 20 min at 4 °C. Concentration was determined spectrophotometrically based on Edelhoch’s method, with a predicted extinction coefficient ε280 = 42,400 cm
−1 M
−1 for the 44.6 kDa monomer. [
32,
33]. Analytical ultracentrifugation experiments were performed with an Optima XL-A analytical ultracentrifuge (Beckman Inc., Palo Alto, CA, USA), equipped with absorbance optics and An60Ti rotor; all experiments were carried out at 20° C. Absorbance data were collected by scanning the sample cells at wavelength intervals of 0.001 cm in the step mode with 5 averages per step. Each experiment was conducted at two or three rotor speeds, starting with the lowest and finishing with the highest rotor speed. The sedimentation was assumed to be at equilibrium when consecutive scans, separated by intervals of 8 h, did not indicate any change.
2.5. Solution Small-Angle X-ray Scattering
SAXS experiments were performed at the ALS beamline 12.3.1 (SIBYLS; Lawrence Berkeley National Laboratory Berkeley, CA, US). Scattering intensities I(q) for the protein and buffer samples were recorded as a function of scattering vector q (q = 4πsinθ⁄λ, where 2θ is the scattering angle and λ is the X-ray wavelength). The sample-to-detector distance was set to 1.5 m, which resulted in a q range of 0.01–0.32Å
−1, λ was 1.0 Å, and all experiments were performed at 20 °C. The data collection strategy described by Hura was used in this study [
34]. Briefly, SAXS data were collected for three protein concentrations (1.4, 2.0, and 2.7 mg/mL) and for two buffer samples. For each sample measurement, SAXS data were collected for three X-ray exposures—one long exposure (10 s) flanked by two short exposures (1 s) to assess radiation damage. The buffer scattering contributions were subtracted from the sample scattering data using the program ogreNew (SIBYLS beamline). Data analysis was performed using the program package PRIMUS from the ATSAS suite 2.3 [
35,
36]. Experimental SAXS data obtained for different protein concentrations were analyzed for aggregation and folding state using Guinier and Kratky plots, respectively. The forward scattering intensity I(0) and the radius of gyration RG were evaluated using the Guinier approximation: I(q) ≈ I(0) exp(−q2R
g)
2/3, with the limits qR
g < 1.3. These parameters were also determined from the pair-distance distribution function P(R), which was calculated from the entire scattering patterns via indirect Fourier inversion of the scattering intensity I(q) using the program GNOM [
37]. The maximum particle diameter D
max was also estimated from the P(R). The hydrated volume VP of the particle was computed using the Porod equation: V
P = 2π
2I(0)/Q, where I(0) is the extrapolated scattering intensity at zero angle and Q is the Porod invariant [
38,
39]. The molecular mass of a globular protein can then be estimated from the value of its hydrated volume [
39]. The overall shape of the protein was modeled ab initio by fitting the SAXS data to the calculated SAXS profile of a chain-like ensemble of dummy residues in reciprocal space using the program GASBOR, version 2.2i [
40]. Ten independent calculations were performed with a D5 symmetry restriction (see below for choice of symmetry).
2.6. Electron Microscopy
A continuous carbon-film copper grid (Electron Microscopy Sciences, Hatfield, PA, US) was plasma-cleaned for 30 s. A total of 3 µL of purified protein at 0.5 mg/mL was applied to the grid for 3 s and then blotted. A total of 3 µL of 1% uranyl acetate (aq) stain was applied three times and blotted after each application. Grids were imaged on a JEM 2100 microscope (JEOL Limited, Akishima, Tokyo, JP) equipped with a Gatan US4000 CCD camera (Gatan, Pleasanton, CA, USA).
2.7. Crystallization
Protein in the buffer containing 50 mM sodium phosphate, pH 8.1, 400 mM sodium chloride, and 1 mM dithiothreitol was concentrated via filtration to ~4.5 mg/mL. Several commercial sparse matrix screens, including the Wizard Classic screen (Rigaku Americas Corporation, The Woodlands, TX, USA) and Salt RX screen (Hampton Research Aliso Viejo, CA, USA), were used to determine the initial crystallization conditions using an automated PHENIX crystallization robot and Intelliplate 96-well sitting drop trays (both from Art Robbins Instruments, Sunnyvale, CA, USA). Promising crystals were screened on a Rigaku FR-E++ X-ray generator with an RAXIS-IV++ crystallography system. Subsequent optimization screens were set up manually at various temperatures in 24 well VDX trays using hanging-drop vapor-diffusion geometry.
2.8. X-ray Diffraction Data Collection and Processing
Seleno-methionine derivatized gp11 crystals grown from two distinct optimized conditions (A and B) were shipped to Argonne National Laboratory, and the data collected at the Advanced Photon Source beamline 21 ID-F on a MAR 225 CCD detector. Prior to freezing in liquid nitrogen, samples of crystal form A were soaked in a well solution supplemented with 20% glycerol to prevent formation of crystalline ice. One microliter of cryoprotectant solution was added directly to the crystallization drop and allowed to soak for 10–30 min prior to freezing. Data were collected with an oscillation angle of 1.0° and one image per second, and then indexed, integrated, merged, and scaled with HKL-3000 [
41]. Crystal form B was directly frozen in liquid nitrogen; citrate in the crystallization buffer was of sufficient concentration to act as a cryoprotectant. Data were collected using the helical scan technique, translating the crystal through the X-ray beam to reduce radiation damage. Images were collected every 0.5 s in 0.5° steps.
2.9. Activity Assays
Nucleotide triphoshatase (NTPase) activities for gp11 were determined using a continuous coupled assay with the enzymes pyruvate kinase and lactate dehydrogenase. The reaction scheme involves uptake by pyruvate kinase of the NDP generated by the NTPase, and transfer of the phosphate group from phosphoenolpyruvate (PEP) to regenerate NTP and pyruvate. The released pyruvate is then taken by the enzyme lactate dehydrogenase, along with reduced nicotinamide adenine dinucleotide (NADH), to generate lactate and oxidized nicotinamide adenine dinucleotide (NAD
+). The progression of the reaction is monitored by measuring the reduction in absorbance at 340nm as NADH is converted to NAD
+. The reaction mixture contained 0.1 µM gp11, 10 U/mL rabbit muscle pyruvate kinase, 15 U/mL rabbit muscle lactate dehydrogenase, 2 mM PEP, and 0.3 mM NADH. Reactions were carried out in a 50 mM Tris pH 7.5, 72 mM KCl, and 7.2 mM MgSO
4 (TKM) buffer, as first described in [
42]; all reagents were purchased from Sigma Aldrich (St. Louis, MO, US). Data were analyzed as the simplest case described by the Michaelis–Menten equation, where NTP is rate limiting, and the steady-state rate of ATP hydrolysis, v, is given by Equation (1).
4. Discussion
Genome packaging motors have been studied extensively in several dsDNA bacteriophage systems. The proposed motor mechanisms invoke complex multi-step processes that require strict coordination between the motor components to efficiently generate force. However, the molecular basis of subunit coordination and force generation remains unknown, largely due to challenges in structurally and biophysically characterizing the oligomeric ATPase ring motors that drive packaging. In currently available phage packaging systems, these ATPases typically have low solubility and are present only as monomers in solution. Thus, it is not possible to directly observe and interrogate the intermolecular interactions that regulate or coordinate ATP hydrolysis, force generation, or DNA translocation. Here, we report the biochemical, biophysical, and preliminary structural characterization of gp11, a DNA packaging ATPase from bacteriophage asccφ28.
Phage asccφ28 is genetically and morphologically similar to the well-studied bacteriophage φ29, yet there is no evidence for the unusual pRNA molecule that serves as a scaffold for assembly of a functional ATPase ring motor in φ29. Hence, we hypothesized that gp11 might assemble as functional ATPase rings in solution. Otherwise, asccφ28 is like φ29: gp11 and the asccφ28 portal protein are smaller and functionally simpler than their counterparts in genome-cutting packaging phages. Hence, asccφ28′s dsDNA encapsidation machinery may represent the minimum functional assembly of a viral dsDNA genome packaging motor. These properties, compositional simplicity and assembly of a functional ATPase ring motor, should facilitate detailed structural, biochemical, and biophysical analysis of a viral dsDNA packaging motor.
A recombinant, codon-optimized gene for gp11 was synthesized and expressed in E. coli with a C-terminal hexa-histidine tag to facilitate purification via immobilized metal affinity chromatography. After elution on a cobalt-containing Talon column, size-exclusion chromatography indicated that the protein assembled as a multimer in solution. Comparison to a standard elution curve of proteins with known molecular weight indicated a possible weight for the assembly ranging between 158 and 670 kDa. Subsequent sedimentation velocity and equilibrium analysis showed that gp11 is best represented as a single species in solution, with a molecular weight of ~445 kDa, which would correspond to a decameric assembly. Estimation of the Porod volume via small angle X-ray scattering confirmed this molecular weight, and further analysis of low angle scattering data provided additional shape information, suggesting the assembly formed a cylinder, likely prolate in shape but possibly oblate. Negative stain EM allowed us to distinguish between these possibilities, clearly showing a prolate cylinder. Interestingly, while all biophysical mass and volume estimates pointed toward a decameric assembly, the dimensions of the asymmetric units in the two different X-ray crystallographic space groups could only accommodate between 4 and 7 copies of the ~45 kDa gp11 monomer. However, both space groups have crystallographic 2-fold axes of symmetry, suggesting that a molecular 2-fold symmetry axis is incorporated into global crystallographic symmetry. Hence, the simplest explanation parsimonious with the aggregate data is that gp11 is a decamer in solution, with approximate D5 molecular symmetry; i.e., an assembly consisting of two pentameric rings related by 2-fold symmetry.
While we cannot rule out the possibility that gp11 actually functions as a decamer when assembled on procapsids, we suspect that the dimerization of the two pentameric rings is an artifact of over-expression in the absence of asccφ28 procapsids. A significant argument against a biologically functional decamer is that the 2-fold symmetry axis relating the two pentameric rings requires that the two rings face opposite directions, either head-to-head or tail-to-tail. Hence, if both rings were to interact with DNA, the directions of imposed force would seem to cancel each other as the motor plays tug-of-war with the dsDNA substrate. Further, it is well established that the gp11 homologue in the closely related bacteriophage φ29 is a pentamer, as are the packaging ATPases in the more distantly related T4, T7, and P74-26 bacteriophages, suggesting that the biological assembly is a single pentameric ring.
Biochemical analysis of gp11 suggests that it binds and hydrolyzes ATP with affinities and rates similar to the functional ring forms of these other phage packaging ATPases. With the exception of bacteriophage lambda [
27], recombinantly expressed packaging ATPases in other phages are monomeric, and are unable to bind and/or hydrolyze ATP when expressed recombinantly. Only upon oligomerization as pentameric rings on their respective procapsids are these enzymes able to bind and hydrolyze ATP. Hence, the comparable ATP affinities and turn-over rates observed for recombinantly expressed gp11 suggests that the assembled pentameric rings reflect the biological assembly of the packaging ATPase on the procapsid.