<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" article-type="research-article"><?properties open_access?><?properties manuscript?><front><journal-meta><journal-id journal-id-type="nlm-journal-id">101186374</journal-id><journal-id journal-id-type="pubmed-jr-id">31761</journal-id><journal-id journal-id-type="nlm-ta">Nat Struct Mol Biol</journal-id><journal-id journal-id-type="iso-abbrev">Nat. Struct. Mol. Biol.</journal-id><journal-title-group><journal-title>Nature structural &#x00026; molecular biology</journal-title></journal-title-group><issn pub-type="ppub">1545-9993</issn><issn pub-type="epub">1545-9985</issn></journal-meta><article-meta><article-id pub-id-type="pmid">30076408</article-id><article-id pub-id-type="pmc">6185745</article-id><article-id pub-id-type="doi">10.1038/s41594-018-0096-7</article-id><article-id pub-id-type="manuscript">NIHMS977255</article-id><article-categories><subj-group subj-group-type="heading"><subject>Article</subject></subj-group></article-categories><title-group><article-title>Atomic insights into the genesis of cellular filaments by globular proteins</article-title></title-group><contrib-group><contrib contrib-type="author" equal-contrib="yes"><name><surname>McPartland</surname><given-names>Laura</given-names></name><xref ref-type="aff" rid="A1">1</xref></contrib><contrib contrib-type="author" equal-contrib="yes"><name><surname>Heller</surname><given-names>Danielle M.</given-names></name><xref ref-type="aff" rid="A1">1</xref></contrib><contrib contrib-type="author"><name><surname>Eisenberg</surname><given-names>David S.</given-names></name><xref ref-type="aff" rid="A2">2</xref><xref ref-type="aff" rid="A3">3</xref><xref ref-type="aff" rid="A4">4</xref><xref ref-type="aff" rid="A5">5</xref><xref ref-type="aff" rid="A6">6</xref></contrib><contrib contrib-type="author"><name><surname>Hochschild</surname><given-names>Ann</given-names></name><xref ref-type="aff" rid="A1">1</xref><xref rid="CR1" ref-type="corresp">*</xref></contrib><contrib contrib-type="author"><name><surname>Sawaya</surname><given-names>Michael R.</given-names></name><xref ref-type="aff" rid="A2">2</xref><xref ref-type="aff" rid="A3">3</xref><xref ref-type="aff" rid="A4">4</xref><xref ref-type="aff" rid="A5">5</xref><xref ref-type="aff" rid="A6">6</xref><xref rid="CR1" ref-type="corresp">*</xref></contrib></contrib-group><aff id="A1"><label>1</label>Department of Microbiology and Immunobiology, Harvard Medical School, Boston, Massachusetts, United States of America.</aff><aff id="A2"><label>2</label>Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, California, United States of America.</aff><aff id="A3"><label>3</label>Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, United States of America.</aff><aff id="A4"><label>4</label>Howard Hughes Medical Institute, University of California, Los Angeles, Los Angeles, California, United States of America.</aff><aff id="A5"><label>5</label>Molecular Biology Institute, University of California, Los Angeles, Los Angeles, California, United States of America.</aff><aff id="A6"><label>6</label>UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, Los Angeles, California, United States of America.</aff><author-notes><fn fn-type="con" id="FN1"><p id="P1">Author contributions</p><p id="P2">L.M. designed the study, performed biochemical experiments, and contributed to writing the manuscript. D.M.H. designed the study, performed biochemical experiments, and contributed to writing the manuscript. A.H. designed the study and contributed to writing the manuscript. M.R.S. determined and analyzed the crystal structures, and contributed to writing the manuscript.</p></fn><corresp id="CR1"><label>*</label>Correspondence to: <email>ann_hochschild@hms.harvard.edu</email>; <email>sawaya@mbi.ucla.edu</email></corresp></author-notes><pub-date pub-type="nihms-submitted"><day>16</day><month>8</month><year>2018</year></pub-date><pub-date pub-type="epub"><day>03</day><month>8</month><year>2018</year></pub-date><pub-date pub-type="ppub"><month>8</month><year>2018</year></pub-date><pub-date pub-type="pmc-release"><day>03</day><month>2</month><year>2019</year></pub-date><volume>25</volume><issue>8</issue><fpage>705</fpage><lpage>714</lpage><!--elocation-id from pubmed: 10.1038/s41594-018-0096-7--><permissions><license><license-p>Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:<uri xlink:type="simple" xlink:href="http://www.nature.com/authors/editorial_policies/license.html#terms">http://www.nature.com/authors/editorial_policies/license.html#terms</uri></license-p></license></permissions><abstract id="ABS1"><p id="P3">Self-assembly of proteins into filaments, such as actin and tubulin filaments, underlies essential cellular processes in all three domains of life. The early emergence of filaments in evolutionary history suggests that filament genesis might be a robust process. Here we describe the fortuitous construction of GFP fusion proteins that self-assemble as fluorescent polar filaments in <italic>Escherichia coli</italic>. Filament formation is achieved by appending as few as 12 residues. Crystal structures reveal that the protomers each donate an appendage to fill a groove between two following protomers along the filament. This exchange of appendages resembles runaway domain swapping but is distinguished by higher efficiency because monomers cannot competitively bind their own appendages. Ample evidence of this &#x0201c;runaway domain coupling&#x0201d; mechanism in nature suggests it could facilitate the evolutionary pathway from globular protein to polar filament, requiring a minimal extension of protein sequence and no significant refolding.</p></abstract></article-meta></front><body><sec id="S1"><title>Introduction</title><p id="P4">The self-association of proteins to form filaments is the basis for the formation of essential cytoskeletal elements such as actin and tubulin in all three domains of life. Cytoskeletal elements participate in a variety of critical cellular processes including DNA segregation, cell division, morphogenesis, cell motility, maintenance of cell shape, and the transport of cellular cargoes. The identification of diverse families of actin and tubulin homologs in prokaryotes over the last 20 years<sup><xref rid="R1" ref-type="bibr">1</xref></sup> has led to the suggestion that these are ancient proteins that predate the split between bacteria, archaea and eukarya<sup><xref rid="R2" ref-type="bibr">2</xref>,<xref rid="R3" ref-type="bibr">3</xref>,<xref rid="R4" ref-type="bibr">4</xref></sup>. Furthermore, the recent discovery that certain metabolic enzymes can polymerize as filaments has provoked the suggestion that an original rudimentary cytoskeleton may have evolved in an ancestral cell through the repurposing of enzyme polymerization<sup><xref rid="R5" ref-type="bibr">5</xref>,<xref rid="R6" ref-type="bibr">6</xref>,<xref rid="R7" ref-type="bibr">7</xref>,<xref rid="R8" ref-type="bibr">8</xref></sup>.</p><p id="P5">The early emergence of filaments in biological history raises the question of what structural mechanisms are capable of establishing filament-competent interfaces, and how facile is their evolution. One mechanism requiring few specific structural features other than a globular fold is runaway 3-dimensional domain swapping. By this mechanism, the intermolecular interface between successive protomers in the filament recapitulates an intramolecular interface that exists within the isolated monomer<sup><xref rid="R9" ref-type="bibr">9</xref>,<xref rid="R10" ref-type="bibr">10</xref></sup>. Thus, a domain of the protein, usually residing at either the N- or C-terminus and connected to the remainder of the protein by a flexible linker (the so-called hinge-loop), exchanges its intramolecular contacts for otherwise identical intermolecular contacts, an exchange that usually occurs only under non-native conditions. Runaway domain swapping occurs in a unidirectional fashion, with each subunit donating a domain to the next, leading to the formation of an open-ended protofilament. Examples include fibrils of a designed 3-&#x003b1;-helix bundle<sup><xref rid="R11" ref-type="bibr">11</xref></sup>, filaments of &#x003b1;<sub>1</sub>-antitrypsin<sup><xref rid="R12" ref-type="bibr">12</xref></sup>, and ribbons of &#x0201c;cab&#x0201d;-type carbonic anhydrase<sup><xref rid="R13" ref-type="bibr">13</xref></sup>. Runaway domain swapping has also been evidenced in some amyloid fibrils<sup><xref rid="R10" ref-type="bibr">10</xref>,<xref rid="R14" ref-type="bibr">14</xref>,<xref rid="R15" ref-type="bibr">15</xref>,<xref rid="R16" ref-type="bibr">16</xref>,<xref rid="R17" ref-type="bibr">17</xref></sup>. (Note, we consider fibrils and filaments to be equivalent in the assembly hierarchy &#x02013; a level more complex than protofilaments &#x02013; and we use the term most closely associated with the particular system discussed.)</p><p id="P6">A simpler, more obvious mechanism for filament formation is the stacking of domains in an end-to-end fashion. Other than runaway domain swapping and end-to-end stacking, no other naturally occurring mechanisms for filament formation have been formally categorized despite the recent rapid increase in the number of filament structures determined.</p><p id="P7">Here we describe a nested set of GFP fusion proteins with a shared ability to self-assemble as fluorescent filaments in bacterial cells. Remarkably, these filaments typically span the entire length of the cell. The smallest of these fusion proteins bears just 12 non-native amino acids at the C-terminus of GFP. Structure determination by x-ray crystallography revealed that these appended residues form an &#x003b1;-helix that extends away from the GFP domain (a &#x003b2;-barrel) through a hinge loop and docks into a groove formed by the following two &#x003b2;-barrels in the protofilament. We call this filament-forming mechanism &#x0201c;runaway domain coupling&#x0201d; because the protomer-protomer interfaces involve complementation between distinct donor and acceptor domains. It is distinct from end-to-end domain stacking where the protomer-protomer interfaces reside entirely within a single domain type. It is also distinct from runaway domain swapping, because the helix-in-groove complementation cannot be reproduced in the monomer. The fact that runaway domain coupling arose fortuitously suggests that the genesis of coupled interfaces is facile, an inference supported by our discovery of their occurrence in numerous natural filaments. We propose that the chances for such randomly generated surfaces to align in a productive interface are enhanced by a flexible hinge loop that connects the two domains of each protomer. Our findings illustrate a pathway whereby a non-polymerizing protein might evolve the ability to self-assemble as protofilaments and filaments.</p></sec><sec id="S2"><title>Results</title><sec id="S3"><title>GFP-RNase A fusion proteins form fluorescent rods in bacterial cells.</title><p id="P8">Previous work from the Eisenberg group<sup><xref rid="R17" ref-type="bibr">17</xref>,<xref rid="R18" ref-type="bibr">18</xref></sup> demonstrated that RNase A variants containing short amyloidogenic peptide insertions in the C-terminal hinge loop could form amyloid-like fibrils <italic>in vitro</italic>. We sought to investigate whether or not these RNase A variants could also access the fibrillar form when produced in <italic>E. coli</italic> cells. To do this, we constructed a plasmid vector that directed the inducible synthesis of RNase A (with or without an inserted amyloidogenic peptide sequence; see Methods) fused to a monomeric variant of GFP (<xref rid="F1" ref-type="fig">Fig. 1a</xref> ). Cells containing these plasmids were examined by fluorescence microscopy at various times after the induction of fusion protein synthesis. Whereas most cells contained diffuse fluorescence, we observed fluorescent rod structures in a small minority (1 to 2%) of cells. Such rod-containing cells typically contained a single rod (<xref rid="F1" ref-type="fig">Fig. 1b</xref>, left). These structures were observed regardless of whether or not the RNase A moiety contained the amyloidogenic peptide insertion (<xref rid="F1" ref-type="fig">Fig. 1b</xref>, left and data not shown). Furthermore, we found that refrigeration dramatically stimulated formation of the rod-like structures. When we grew cells overnight after the induction of fusion protein synthesis and then incubated the cell cultures for 2 hours at 4<sup>o</sup>C, we detected fluorescent rod structures in essentially all of the cells (<xref rid="F1" ref-type="fig">Fig. 1b</xref>, right).</p><p id="P9">To define the sequence requirements for intracellular rod formation, we constructed a series of truncations from the C-terminus of RNase A (<xref rid="F1" ref-type="fig">Fig. 1a</xref>). We found that as few as eight N-terminal residues of RNase A (separated from the C-terminus of GFP by a three-residue linker) sufficed to permit rod formation. When cell cultures containing this construct were induced overnight and incubated for 2 hours at 4<sup>o</sup>C, essentially all of the cells contained fluorescent rods (<xref rid="F1" ref-type="fig">Fig. 1d</xref>). In most cases, the rods extended the entire length of the cell.</p></sec><sec id="S4"><title>Rod formation does not depend on the fortuitous presence of an HP segment.</title><p id="P10">We considered the possibility that the propensities of the constructed GFP fusion proteins to form fluorescent rod structures was due to the fortuitous presence of an amino acid segment with high amyloid fiber-forming propensity (termed an HP segment) either spanning the fusion junction or contained wholly within the appended sequence. We discovered an HP segment comprising the 3-residue linker and the first 3 residues of RNase A (AMAKET) that was shared by our rod-forming constructs (<xref rid="F1" ref-type="fig">Fig. 1a</xref> and <xref rid="SD1" ref-type="supplementary-material">Fig S1</xref>)<sup><xref rid="R19" ref-type="bibr">19</xref></sup>.</p><p id="P11">Results from mutagenesis experiments were inconsistent with the hypothesis that rod formation was due to the amyloid-forming propensity of the HP segment located at the N-terminal end of the appended peptide sequence. Using a fusion protein containing RNase A residues 1&#x02013;14 as our reference construct, we compared the rod-forming behaviors of the reference fusion [GFP-A<bold>M</bold>A-RNase(1&#x02013;14)] and several variants bearing replacements of the linker methionine residue (residue 240 in the fusion sequence) with A, E, K, L or Q (<xref rid="F1" ref-type="fig">Fig. 1c</xref>, <xref rid="SD1" ref-type="supplementary-material">Fig. S1</xref>). Whereas the reference construct gave rise to rods in essentially all of the cells after overnight induction of fusion protein synthesis (<xref rid="F1" ref-type="fig">Fig. 1c</xref>), the variant encoding AQAKET, which has a higher predicted amyloid-forming propensity than AMAKET, gave rise to no rod-containing cells even after induced overnight cultures were refrigerated for an additional 24 hours (<xref rid="F1" ref-type="fig">Fig. 1 c</xref>). Among the remaining variants, only that encoding ALAKET gave rise to rods (which formed efficiently only after refrigeration) (<xref rid="F1" ref-type="fig">Fig. 1c</xref>), suggesting that a hydrophobic residue was required at the second linker position (see below). Overall, our analysis of this set of single amino acid substitution variants suggests that rod formation is not due to the amyloid-forming propensity of the appended peptide. Conclusive evidence that the rods lack amyloid architecture follows from diffraction analysis and crystal structure determination presented below.</p></sec><sec id="S5"><title>Crystal structures of GFP fused to eight RNase A residues.</title><p id="P12">To elucidate the intermolecular interactions that guide assembly of GFP into intracellular fluorescent rods, we crystallized a fusion protein containing RNase A residues 1&#x02013;8 [GFP-AMA-RNase(1&#x02013;8)] (<xref rid="F1" ref-type="fig">Fig. 1d</xref>), the shortest fusion protein that efficiently formed rods. To facilitate purification of the fusion we appended a C-terminal His<sub>6</sub> tag [GFP-AMA-RNase(1&#x02013;8)-His<sub>6</sub>] after determining that the tag did not alter the ability of the fusion protein to form fluorescent rods (<xref rid="F1" ref-type="fig">Fig. 1e</xref>).</p><p id="P13">We discovered crystals in dozens of conditions, all of which shared the same needle morphology, recapitulating the rods observed in cells (<xref rid="SD1" ref-type="supplementary-material">Fig. S2a</xref>). The needles were typically over 100 &#x003bc;m long, but seldom greater than 5 &#x003bc;m thick. Among the different conditions, we discovered four different crystal forms (P2<sub>1</sub>2<sub>1</sub>2 form1, P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 2, C2, and P2<sub>1</sub>) (<xref rid="SD1" ref-type="supplementary-material">Fig. S2b</xref>), collected diffraction data, and solved their structures (<xref rid="T1" ref-type="table">Table 1</xref>).</p><p id="P14">Remarkably, the structures obtained from the four crystal forms revealed essentially identical protofilament architectures, suggesting this architecture is likely to persist under many other conditions, including those within the cell. In each crystal form, the protofilament is assembled with 2<sub>1</sub> screw symmetry, meaning that each protomer is related to its closest neighbor protomer by a 180&#x000b0; rotation around the protofilament axis and translation along the axis by half the pitch length (i.e. rise per turn along the protofilament axis) (<xref rid="F2" ref-type="fig">Fig 2a and c</xref>). In each crystal form, this pitch length is approximately 51.4 &#x000c5;, (less than 1 &#x000c5; variation) and corresponds to one of the unit cell dimensions (a=51.2 &#x000c5; in P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 1, a=51.1 &#x000c5; in P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 2, b=51.3 &#x000c5; in C2, and b=51.8 &#x000c5; in P2<sub>1</sub>) (<xref rid="F3" ref-type="fig">Fig. 3b</xref>). There is no significant structural variation in protofilament architecture among crystal forms (<xref rid="F2" ref-type="fig">Fig. 2b</xref>). Indeed, the root mean square deviation is less than 0.8 &#x000c5; between protofilament segments from different crystals (composed of three consecutive GFP-AMA-RNase(1&#x02013;8)-His<sub>6</sub> protomers totaling 5667 atoms). Thus, the robust nature of this protofilament assembly is evidenced by the high degree of similarity in protofilament structures (<xref rid="F2" ref-type="fig">Fig. 2b</xref>) despite clear differences in packing between protofilaments (<xref rid="F3" ref-type="fig">Fig. 3b</xref>).</p><p id="P15">A closer view of the protofilament reveals that one structural feature in particular stabilizes protofilament assembly--a new, fusion-created, C-terminal helix which fastens together neighboring pairs of protomers by binding in a groove between them (<xref rid="F2" ref-type="fig">Fig. 2a, c, and d</xref>). This new C-terminal helix comprises seventeen residues which span a sequence derived from three different origins: six residues from the natural GFP C-terminus (233-MDELYK-238), a three-residue linker (239-AMA-241), and eight residues from RNase (242-KETAAAKF-249) (<xref rid="F2" ref-type="fig">Fig. 2d</xref>). The six residues contributed by GFP are normally disordered in crystal structures of natural-length GFP. The propensity of this segment to form a helix was evidently enhanced by its fusion with the RNase residues 1&#x02013;8, which also form a helix in native RNase.</p><p id="P16">The new, fusion-created helix extends up to 40 &#x000c5; away from its attachment to the barrel&#x02019;s last strand, enabling the helix to reach a binding groove that is formed by the following two protomers in the protofilament. This 40 &#x000c5; reach is thus directed by the relative positions of three successive protomers in the protofilament; the helix originates from protomer i, and the groove is formed between protomers i+1 and i+2. The ability of the helix to reach this groove is facilitated by the extended conformation adopted by GFP residues 228-GITHG-232, which connect the helix to strand 11, the last strand in the GFP barrel (<xref rid="F2" ref-type="fig">Fig. 2d</xref>).</p><p id="P17">Protomer i+1 forms one wall of the groove (<xref rid="F2" ref-type="fig">Fig. 2e</xref>, brown surface). This wall comprises the surfaces of strands 3, 10, and 11 near the waist of the barrel, and the helix docks tangentially across these. The helix-in-groove interface is snug and defined almost entirely by side-chain-to-side-chain contacts. A stripe of hydrophobic side-chains (M233, L236, Y237, M240, and T244) lines the length of the &#x003b1;-helix and these embed between side-chains on the surface of the barrel (T43, K45, A206, S208, V219, and L221). Loss of this hydrophobic interface upon substituting M240 with polar, charged, or small residues, such as Q, E, K, or A, would likely explain our observation that these substitutions interfere with efficient rod formation (<xref rid="F1" ref-type="fig">Fig. 1c</xref>) (<xref rid="SD1" ref-type="supplementary-material">Fig. S3</xref>). This interface between helix i and protomer i+1 buries a total of 727 &#x000c5;<sup>2</sup> on the two surfaces (<xref rid="SD1" ref-type="supplementary-material">Fig. S4</xref>).</p><p id="P18">Protomer i+2 forms the opposite wall of the groove. This wall comprises the end of strand 5 and residues at one cap of the barrel (<xref rid="F2" ref-type="fig">Fig. 2e</xref>, blue surface). Specifically, residues L7, P89, and F114 create a small hydrophobic pocket into which docks the side chain of A245 of the helix of protomer i. This contact is flanked on both sides by salt bridges (protomer i+2 residues E90 and D117 with helix i residues K242 and K248, respectively). This interface between protomer i+2 and helix i buries a total of 426 &#x000c5;<sup>2</sup> on the two surfaces (<xref rid="SD1" ref-type="supplementary-material">Fig. S4</xref>). Overall, the shape complementarity between the helix and groove (0.78) indicates that the fit is even tighter than typically observed among antibody-antigen complexes<sup><xref rid="R20" ref-type="bibr">20</xref></sup>.</p><p id="P19">Direct contact between the sides of neighboring GFP barrels strengthens and rigidifies the contiguous helix-in-groove interaction (<xref rid="F2" ref-type="fig">Fig. 2f</xref>). The interface is approximately flat. Side-chains from strands 1, 2, 5, and 6 of protomer i+1 nestle with side-chains from strands 3, 10 and 11 of protomer i+2. The interactions are primarily polar rather than non-polar. The area buried by the two surfaces of this interface is 901 &#x000c5;<sup>2</sup> (<xref rid="SD1" ref-type="supplementary-material">Fig. S4</xref>). This patch of direct contact between barrels appears to be adventitious; it does not correspond in any way to the natural dimer interface of wild-type GFP<sup><xref rid="R21" ref-type="bibr">21</xref></sup>.</p><p id="P20">The combined surface area buried by protomers i, i+1, and i+2 appears large enough to maintain protofilament architecture in solution, free of the crystal lattice. Coordination of neighboring protomers by the C-terminal helix buries a total of 1153 &#x000c5;<sup>2</sup>. Direct contact between protomers buries an additional 901 &#x000c5;<sup>2</sup> of surface area. Either interface by itself would likely be insufficient to support protofilament assembly. But, together, these total to 2054 &#x000c5;<sup>2</sup>, which exceeds the 1712 &#x000c5;<sup>2</sup> threshold value that discriminates between biological and artificial dimers<sup><xref rid="R22" ref-type="bibr">22</xref></sup>. The division of the protomer-protomer interface over three surfaces implies cooperativity in protofilament assembly.</p></sec><sec id="S6"><title>Single amino acid substitution disrupts rod formation.</title><p id="P21">As a further test of whether or not the intermolecular interface observed in the crystals is relevant to rod formation in the cell, we mutated a solvent exposed residue on the GFP barrel that was predicted to contribute significantly to the interface formed by the binding of the C-terminal alpha helix of the neighboring protomer. Specifically, we introduced a charged residue (E) in place of V219, which should disfavor the docking of M233, L236, and M240 in the C-terminal alpha helix (<xref rid="SD1" ref-type="supplementary-material">Fig. S3</xref>). We then induced the synthesis of GFP-AMA-RNase(1&#x02013;8)-His<sub>6</sub> with or without the V219E substitution in the GFP moiety. Whereas the unmutated fusion protein formed rod structures in essentially all of the cells after overnight induction followed by incubation for 2 hours at 4<sup>o</sup>C, the mutant fusion protein failed to form rods even after overnight cultures were refrigerated for several days, and instead gave rise to diffuse fluorescence (<xref rid="F1" ref-type="fig">Fig. 1f</xref> and <xref rid="SD1" ref-type="supplementary-material">Fig. S5</xref>). These findings provide strong support for the hypothesis that the intermolecular interface observed in the four crystal forms is also the basis for fluorescent rod formation in bacterial cells.</p></sec><sec id="S7"><title>The pattern of protofilament bundling in intracellular rods resembles that observed in the C2 crystal form.</title><p id="P22">We performed <italic>in cellulo</italic> diffraction experiments to evaluate the pattern of protofilament bundling in fluorescent rods produced in <italic>E. coli</italic>. X-ray diffraction patterns from <italic>E. coli</italic> producing GFP-AMA-RNase A(1&#x02013;8)-His<sub>6</sub> revealed six reflections with Bragg spacings between 55 &#x000c5; and 25 &#x000c5; (<xref rid="F3" ref-type="fig">Fig. 3c</xref>). Remarkably, these spacings observed from intracellular filaments coincide with those of the crystal structure of purified GFP-AMA-RNase(1&#x02013;8)-His<sub>6</sub> in space group C2 (<xref rid="F3" ref-type="fig">Fig. 3e</xref>). Moreover, the relative intensities of the six observed reflections correlate between the observed and calculated patterns, as well as the geometric disposition with respect to the meridional and equatorial directions (<xref rid="F3" ref-type="fig">Fig. 3f</xref>). Structurally, we find that the Bragg planes for these six reflections coincide well with the spacings between protomers in the crystal (<xref rid="SD1" ref-type="supplementary-material">Fig. S6a-g</xref>). The sum of these six Fourier terms produces a low-resolution image of the protofilament in close agreement with the crystal structure (<xref rid="SD1" ref-type="supplementary-material">Fig. S6h</xref>). These correlations, like matching fingerprints, indicate that the protofilaments bundle together in the fluorescent rods in the same pattern as observed in the C2 crystal (<xref rid="SD2" ref-type="supplementary-material">Supplementary note 1</xref>).</p><p id="P23">Poor correlation is observed between the <italic>in cellulo</italic> diffraction and the remaining three crystal forms (P2<sub>1</sub>, P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 1 and P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 2; <xref rid="F3" ref-type="fig">Fig. 3d</xref>). This poor correlation is not an indication of dissimilarity among the protofilament structures. Recall that all four crystal forms are composed of the same protofilament structure (<xref rid="F2" ref-type="fig">Fig. 2b</xref> and <xref rid="F3" ref-type="fig">3a</xref>). Rather, the differences among the simulated diffraction patterns arise from alternative packing interactions between structurally identical protofilaments (<xref rid="F3" ref-type="fig">Fig. 3b</xref>). Apparently, the cellular milieu favors the same packing of protofilaments as the C2 crystal form.</p><p id="P24">The symmetry of the protofilaments themselves likely makes them prone to crystal-like bundling, because it corresponds to one of the most prevalent symmetry elements in crystallography&#x02014;a two-fold screw axis. Fibers of hemoglobin E6V are also rigidified by crystal-like bundling<sup><xref rid="R23" ref-type="bibr">23</xref></sup>, and also feature a two-fold screw protofilament symmetry<sup><xref rid="R24" ref-type="bibr">24</xref></sup>. Conversely, many functional protofilaments, such as actin and tubulin, have resisted crystallization because their symmetry is imprecise or does not coincide with common crystallographic symmetry elements.</p></sec></sec><sec id="S8"><title>Discussion</title><p id="P25">We found that appending as few as twelve residues to the C-terminus of GFP enabled the fortuitous assembly of GFP filaments both <italic>in vivo</italic> and <italic>in vitro</italic>. These filaments were detectable as fluorescent rods that spanned the length of bacterial cells harboring the fusion construct. The purified fusion protein readily formed needle-like crystals and structure determination revealed the basis for filament formation. Specifically a fusion-created C-terminal &#x003b1;-helix extends away from the barrel to which it is attached and nestles into a groove that is formed by the barrels of two successive protomers along the protofilament axis.</p><p id="P26">This helix-in-groove linkage between protomers resembles runaway domain swapping<sup><xref rid="R10" ref-type="bibr">10</xref></sup>, a mechanism proposed for the assembly of disease-related fibrils. By this mechanism, each protomer swaps a part of its structure into an identical protomer along the protofilament (<xref rid="F4" ref-type="fig">Fig. 4e</xref>). The swapped part is termed the &#x0201c;swapping domain&#x0201d; and may be as small as a single &#x003b1;-helix or &#x003b2;-strand; the receiving part is termed the &#x0201c;complementary domain&#x0201d;. The swapping event begins when a folded or &#x0201c;closed&#x0201d; monomer (<xref rid="F4" ref-type="fig">Fig. 4c</xref>) opens up to expose its own swapping domain to another protomer (<xref rid="F4" ref-type="fig">Fig. 4d</xref>). The flexibility required to open the monomer and execute the swap is afforded by a hinge loop that links the two domains. In GFP-RNase(1&#x02013;8) protofilaments, the C-terminal helix acts as the swapping domain, the GFP barrel acts as the complementary domain, and intervening residues 229&#x02013;232 form the hinge loop.</p><p id="P27">Other properties of the GFP-RNase filaments, such as facile nucleation and mechanical rigidity, seem incongruous with runaway domain swapping. Indeed, GFP-RNase(1&#x02013;8) filaments assemble spontaneously <italic>in vivo</italic> within hours, unlike runaway domain swapped filaments which may take years to assemble <italic>in vivo</italic>, or require high concentrations, harsh conditions, proteolytic cleavage, or multiple days to nucleate <italic>in vitro</italic><sup><xref rid="R14" ref-type="bibr">14</xref>, <xref rid="R25" ref-type="bibr">25</xref></sup> Furthermore, GFP-RNase filaments are mechanically strong and rigid. In some instances, the rods connect two deeply constricted cells, implying that septum formation was incapable of severing a preexisting filament (<xref rid="F1" ref-type="fig">Fig. 1g</xref>). In other instances, the ends of growing GFP-RNase rods appear to impose bulges at the cell poles (<xref rid="F1" ref-type="fig">Fig. 1h</xref>). These distortions of the cell shape are reminiscent of those observed in red blood cells upon aggregation of hemoglobin bearing a single glutamate-to-valine substitution, E6V, associated with sickle cell anemia. This mutation causes hemoglobin tetramers to assemble into rigid fibrils that distort the normally disk-shaped cell to a sickle shape<sup><xref rid="R26" ref-type="bibr">26</xref></sup>. In contrast, runaway domain-swapped filaments are connected by flexible hinge loops which typically confer the appearance of &#x0201c;beads-on-a-string.&#x0201d;<sup><xref rid="R27" ref-type="bibr">27</xref></sup>. These incongruities prompted us to examine the GFP-RNase protofilament structure for attributes that differ from runaway domain swapping and facilitate nucleation and mechanical rigidity.</p><p id="P28">Consideration of the monomeric state illuminates the defining attribute that distinguishes GFP-RNase filament formation from runaway domain swapping and explains why GFP-RNase filaments grow more robustly than runaway domain swapped filaments; that is, the GFP-RNase monomer cannot competitively bind its own helical appendage. A defining feature of runaway domain swapping is the high affinity for binding the swapping domain, whether nested within its own protomer (monomeric state) or swapped into a neighboring protomer (filamentous state). The equally deep energetic stability of both the monomeric and filamentous states impedes filament nucleation and growth in two ways: (1) it imposes a large kinetic barrier to the conversion from monomer to filament because the intramolecular domain-domain interface in the monomer must be disrupted to form the equivalent intermolecular domain-domain interface in the filament<sup><xref rid="R28" ref-type="bibr">28</xref></sup>, and (2) it renders the monomeric state an energetically viable competitor of filament nucleation and growth since the two states contain equally stabilizing domain-domain interfaces.</p><p id="P29">In violation of a defining principle of runaway domain swapping, the helix-in-groove interface that stabilizes GFP-RNase protofilaments is unattainable within individual monomers. Indeed, the C-terminal helix of GFP-RNase (donor domain) cannot bind its complementary site within the same monomer (acceptor domain) because the acceptor site is out of reach; the hinge loop is too short (<xref rid="F4" ref-type="fig">Fig. 4g</xref>). Self-complementation is further precluded by that fact that the acceptor site itself is not a single surface, but assembled from two protomers (i+1 and i+2), which respectively contribute residues from opposite faces of the barrel (<xref rid="F2" ref-type="fig">Fig. 2d</xref> and <xref rid="F4" ref-type="fig">Fig. 4j</xref>).</p><p id="P30">The unattainability of a self-complementary donor-acceptor interface within GFP-RNase monomers explains why this protein forms filaments more readily than proteins that undergo runaway domain swapping. Without a means of self-complementation, protomers in the monomeric state cannot attain the same low energy state as do the protomers in the filament, thereby tilting the energy landscape in favor of filament growth (<xref rid="F4" ref-type="fig">Fig. 4f-h</xref>). The absence of stable self-complementation also has the advantage of eliminating the kinetic barrier to filament formation&#x02014;with no donor-acceptor interface in the monomer there is no barrier imposed by its disruption as there would be in runaway domain swapped filaments.</p><p id="P31">We discovered that many natural filaments display donor-acceptor linkages analogous to those observed in the GFP-RNase filaments (<xref rid="SD2" ref-type="supplementary-material">Table S1</xref>). For example, in filaments of nucleocapsid protein from phlebovirus, each protomer donates a helix to an acceptor domain in the neighboring protomer<sup><xref rid="R29" ref-type="bibr">29</xref></sup>. Similarly in pilus rods of fimA protein, each protomer donates an N-terminal strand to its neighboring protomer<sup><xref rid="R30" ref-type="bibr">30</xref></sup>. Moreover, GFP-RNase, nucleocapsid protein, and fimA are all incapable of self-complementation. The mechanism of pilus rod formation has been named &#x0201c;donor-strand exchange&#x0201d;, but a more general name is needed to encompass those filaments assembled by donation of elements other than strands. We refrain from using the previously coined term &#x0201c;obligate domain swapping&#x0201d; <sup><xref rid="R31" ref-type="bibr">31</xref></sup>,<sup><xref rid="R32" ref-type="bibr">32</xref></sup>, because we have shown the mechanism is not a true domain swap. Instead, we coin the term &#x0201c;runaway domain coupling&#x0201d; to describe all filaments like GFP-RNase in which protomers interface through distinct donor and acceptor domains--interfaces which are unattainable in individual monomers.</p><p id="P32">Runaway domain coupling describes many protofilament assemblies, but it does not explain why some, like GFP-RNase, are mechanically rigid. Indeed, a flexible linker is characteristic of this mechanism. Rigidity, on the other hand, is a defining characteristic of domain stacking, in which copies of a single rigid domain stack together in uninterrupted succession (<xref rid="F4" ref-type="fig">Fig. 4a</xref>). In GFP-RNase protofilaments, protomers stack through patches on the rigid barrel domains, thus rigidifying the otherwise flexible domain coupled linkages (<xref rid="F2" ref-type="fig">Fig 2f</xref>). This combination of domain coupling and stacking is also evident in naturally occurring RecA nucleoprotein filaments; even in the absence of nucleic acid, RecA forms filaments with a persistence length of about 100 nm (<xref rid="SD2" ref-type="supplementary-material">Table S1</xref>)(<xref rid="F4" ref-type="fig">Fig. 4i</xref>)<sup><xref rid="R33" ref-type="bibr">33</xref></sup>. In contrast, the runaway domain coupled filaments of phlebovirus nucleocapsid protein have no supplementary connection through domain stacking (<xref rid="F4" ref-type="fig">Fig. 4h</xref>). Accordingly, the filaments give the appearance of a random walk in electron micrographs<sup><xref rid="R29" ref-type="bibr">29</xref></sup>. Hence, domain stacking imparts rigidity to a protofilaments, acting alone or in combination with other mechanisms.</p><p id="P33">Bridging is yet another mechanism of filament formation that contributes to the mechanical rigidity of GFP-RNase protofilaments. Bridging interfaces join nonconsecutive protomers in the protofilament, creating a mutually supportive closed network that includes the intervening &#x0201c;bridged&#x0201d; protomer(s) (<xref rid="F4" ref-type="fig">Fig 4b</xref>). In GFP-RNase protofilaments, a bridging interface exists between protomers i and i+2, as part of the helix-in-groove interaction (<xref rid="F4" ref-type="fig">Fig 4k</xref>). This interface in turn, bolsters contacts between these protomers and the intervening protomer i+1. Bridging is also observed in tropomyosin, which forms a coiled coil bridge across protomers in actin filaments<sup><xref rid="R34" ref-type="bibr">34</xref></sup>, thereby reinforcing stacking between actin domains and increasing the persistence length by two-fold (<xref rid="SD2" ref-type="supplementary-material">Table S1</xref>)<sup><xref rid="R35" ref-type="bibr">35</xref></sup>. Bridging also helps assemble soluble E6V hemolgobin tetramers into cell-deforming rods<sup><xref rid="R24" ref-type="bibr">24</xref></sup> and tubulin protofilaments into microtubules. Cooperativity is implied by bridging because assembly of two protomers enhances binding of a third protomer.</p><p id="P34">Genesis of protofilaments by either runaway domain coupling or domain stacking appears relatively difficult to achieve through a single, <italic>de novo</italic> interface alone. The interface would have to be sufficiently large<sup><xref rid="R22" ref-type="bibr">22</xref></sup> and snug for the protofilament to have a significant lifetime, presumably requiring many iterations of genetic selection. Circular permutation may offer a shortcut by exploiting a preexisting intramolecular interface. For example, circular permutation may shift an N-terminal element to the C-terminus. From its new location, the element may not reach its own complementary site near the N-terminus, but it might easily reach the complementary site of another protomer&#x02014;a perfect setup for runaway domain coupling (<xref rid="SD1" ref-type="supplementary-material">Fig. S7</xref>). Circular permutation occurs with sufficient frequency in nature to be a plausible step in protofilament genesis; according to one study, as many as 47% of all domains examined have a circularly permuted homolog<sup><xref rid="R36" ref-type="bibr">36</xref></sup>. Simpler still, runaway domain coupled protofilaments might also arise from insertion of a stop codon in a multi-domain oligomeric protein, as engineered in the T7 primase-helicase gene<sup><xref rid="R37" ref-type="bibr">37</xref></sup>. Evidently, the removal of some structural elements disrupted cyclic oligomerization in favor of a runaway polymerization.</p><p id="P35">Surprisingly, protofilaments might be generated faster when protomer-protomer interfaces are divided over multiple surfaces, as observed in GFP-RNase protofilaments (<xref rid="F4" ref-type="fig">Fig 4k</xref>), rather than collected in a single surface. Contrary to the elevated level of complexity inferred by the presence of three distinct interfaces in GFP-RNase protofilaments (runaway domain coupling, stacking, and bridging), this combination allows each of the interfaces to be relatively smaller, and therefore relatively easier to achieve. In fact, we found that a majority of crystal structures of monomeric proteins demonstrate intermolecular interfaces as large as the individual interfaces observed in GFP-RNase protofilaments (<xref rid="SD2" ref-type="supplementary-material">Supplementary Note 2</xref>).</p><p id="P36">Flexible linkage between domains, such as occurs in runaway domain coupling, seems advantageous for consolidating these smaller interfaces into a network stable enough to support filament genesis. Consider an early stage of protofilament evolution in which domains can stack, but the stacking interface is too small to be stable. Suppose that this stacking domain is linked to an accessory domain with the potential to evolve a second, stabilizing interface in parallel with the first. If the linker between domains is rigid, only a small portion of the accessory domain may be in proximity of establishing a new interface without disturbing the first. However, if the linker is flexible, as is the case with domain coupling, additional residues may be brought in proximity, improving chances of encountering a snug interface. Indeed, one third of all the naturally occurring filament structures we surveyed involve combinations of runaway domain coupling with stacking (<xref rid="SD2" ref-type="supplementary-material">Table S1</xref>). Notably, runaway domain coupling, stacking, and bridging were revealed recently in CTP synthase<sup><xref rid="R38" ref-type="bibr">38</xref></sup>, a representative of a class of metabolic enzymes that evolved filamentation as a survival response to certain cellular conditions<sup><xref rid="R5" ref-type="bibr">5</xref>,<xref rid="R6" ref-type="bibr">6</xref>,<xref rid="R8" ref-type="bibr">8</xref>,<xref rid="R39" ref-type="bibr">39</xref>,<xref rid="R40" ref-type="bibr">40</xref></sup>.</p><p id="P37">The advantage of flexibility for consolidating stacking interfaces comes with an entropic penalty--the immobilization of the interdomain loop. This penalty partially diminishes the stability of the protofilament, perhaps explaining the temperature sensitivity of filament formation by some of the GFP-RNase constructs (<xref rid="F1" ref-type="fig">Fig. 1b, c</xref>) and the sensitivity of CTP synthase filamentation to cellular conditions. These two filaments also score lower values of interface area and solvation free energy than the majority of naturally occurring filaments (<xref rid="SD2" ref-type="supplementary-material">Table S1</xref>, <xref rid="SD1" ref-type="supplementary-material">Fig. S8</xref>), suggesting another reason for their sensitivity.</p><p id="P38">A strategy of combining runaway domain coupling with stacking may offer new advantages for filament design. Notably, flexible linkers were <italic>not</italic> included in the few examples where globular or helical proteins were successfully engineered to form filaments<sup><xref rid="R11" ref-type="bibr">11</xref>, <xref rid="R41" ref-type="bibr">41</xref>,<xref rid="R42" ref-type="bibr">42</xref>,<xref rid="R43" ref-type="bibr">43</xref>,<xref rid="R44" ref-type="bibr">44</xref>,<xref rid="R45" ref-type="bibr">45</xref>,<xref rid="R46" ref-type="bibr">46</xref></sup>. In all these cases, alignment between protomers was designed computationally, thereby eliminating the need for flexibility to attain alignment. However, the success of most of these designs depended on the presence of a naturally evolved dimerization interface. We propose that a design strategy of combining runaway domain coupling with stacking could open up filamentation to the majority of monomeric proteins, employing interfaces as small as the average crystal packing interface.</p></sec><sec id="S9"><title>Online Methods</title><sec id="S10"><title>Plasmids, Strains, and Cell Growth.</title><p id="P39">GFP fusions were produced from pBR322-derived plasmids under the control of the arabinose-inducible promoter pBAD. Specifically, the constructs depicted in <xref rid="F1" ref-type="fig">Fig. 1</xref> (top to bottom) were produced from plasmids pLM163, pLM168, pLM180, pLM179, pLM177, pLM178, pLM176, pLM187, pLM192, and pLM198. Plasmid pLM136 encodes the full-length RNaseA fusion with the amyloidogenic peptide GGVVIA (derived from A&#x003b2;) inserted between RNaseA residues 113 and 114<sup>18</sup>. Constructs with only eight RNaseA residues included an additional aspartate (D) residue with or without a C-terminal hexa-histidine tag. Overnight cultures of <italic>E. coli</italic> BW27785<sup><xref rid="R49" ref-type="bibr">49</xref>,<xref rid="R50" ref-type="bibr">50</xref></sup> transformed with the appropriate GFP-fusion construct were diluted to an OD600 of 0.02 in 50 mL LB supplemented with carbenicillin (100 &#x003bc;g/mL), grown for 30 min at 30 &#x000b0;C, and induced with L-arabinose (0.2% wt/vol). For experiments in which early stationary phase cultures were examined, cells were harvested after 4 h induction at 30 &#x000ba;C. Otherwise, cultures were induced overnight (~20 h) at 30 &#x000ba;C. Where indicated, both early stationary phase and overnight cultures were refrigerated at 4 &#x000ba;C for 2 h or &#x02265;24 h in order to enhance formation of GFP rods.</p></sec><sec id="S11"><title>Fluorescence Microscopy.</title><p id="P40">Cells were imaged after overnight induction of fusion protein synthesis either without any subsequent incubation or following incubation for the time specified at 4<sup>o</sup>C. Cells were harvested at 3,000 &#x000d7; g, resuspended in PBS, and spotted onto agarose pads (1% wt/vol in PBS; Seakem LE Agarose, Lonza) for visualization with a UplanFLN 100&#x000d7; phase contrast objective on an Olympus BX61 microscope as described elsewhere<sup><xref rid="R51" ref-type="bibr">51</xref></sup>. Images were captured with a CoolSnapHQ camera (Photometrics) and the Metamorph software package, version 6.1 (Universal Imaging). Exposures were typically 50&#x02013;100 ms. Images were cropped and adjusted using Metamorph or ImageJ<sup><xref rid="R52" ref-type="bibr">52</xref></sup>.</p></sec><sec id="S12"><title>Bacterial Extract Preparation and Western Blotting.</title><p id="P41">Cultures (50 mL) were grown as described above, and the cell densities were recorded (OD600). The cultures were centrifuged 10 min at 3,000 &#x000d7; g at 4 &#x000ba;C, and the pellets were resuspended in BugBuster Protein Extraction Reagent (EMD Millipore) to normalize for cell density (800&#x003bc;L of lysis buffer for an OD600 of 0.8.). rLysozme (EMD Millipore) was added to 3,000 units/mL and Omnicleave endonuclease (Epicentre) was added to 200 units/mL; the mixture was incubated, rocking, at room temperature (RT) for 30 min. For SDS-PAGE and Western blot analysis, samples were diluted 1:25 in standard SDS-PAGE loading buffer and equal volumes were loaded. Blots prepared in duplicate were probed with either an anti-His mouse monoclonal antibody (Genscript A00612), to detect His<sub>6</sub>-tagged GFP-fusion protein, or with an antibody specific for the RpoA (&#x003b1;) subunit of <italic>E. coli</italic> RNA polymerase (Neoclone, clone 4RA2), which served as a loading control. Blots were detected using an ECLplus Western Blotting Detection System (GE Heathcare).</p></sec><sec id="S13"><title>Purification of GFP-RNase(1&#x02013;8) for crystallization.</title><p id="P42"><italic>E. coli</italic> strain BW27785 was transformed with pLM192 (<italic>pBAD-gfp-AMA-KETAAAKFD-his</italic><sub><italic>6</italic></sub>) to produce the GFP fusion, GFP-AMA-KETAAAKFD-His<sub>6</sub>, for crystallization. We term this fusion protein GFP-RNase(1&#x02013;8). A culture of LB supplemented with100 &#x003bc;g/mL carbenicillin was inoculated with a single transformed colony and grown at 30 &#x000ba;C overnight. This overnight culture was back diluted to an OD<sub>600</sub> of 0.03 in 1 L of fresh LB medium supplemented with 100 &#x003bc;g/mL carbenicillin and grown at 30 &#x000ba;C 220 rpm until the culture reached mid-log phase (OD<sub>600</sub> 0.5). L-arabinose was added to a final concentration of 0.2%, and the culture was grown at 30 &#x000ba;C at 220 rpm for an additional 5 h. Cells were harvested from a 500 mL aliquot of this culture by centrifugation and stored at &#x02212;80 &#x000ba;C until used. On the day of purification, the cell pellet was thawed on ice, washed once in cold, sterile PBS, and lysed in 8.5 mL BugBuster supplemented with Santa Cruz Complete<sup>&#x02122;</sup> Protease Inhibitor Cocktail Tablet (1 tablet per 50 mL lysis buffer), 5 &#x003bc;L rLysozyme, and 5 &#x003bc;L Omnicleave. The lysate was centrifuged for 20 min at 12,600 rpm to remove cell debris, and the supernatant fraction was incubated with 3 mL pre-washed Ni-NTA resin (Qiagen) at 4 &#x000ba;C for 1 h with rocking. After this incubation step, the resin/supernatant mixture was added to a gravity column and drained. The column was washed three times with a buffer containing 50 mM Hepes pH 7.9, 300 mM NaCl, 5 mM BME, and increasing concentrations of imidazole (0, 20 mM, and 60 mM, respectively). Bound protein was eluted in 8 mL elution buffer (50 mM Hepes pH 7.9, 300 mM NaCl, 5 mM BME, 200 mM imidazole), and 1 mL aliquots were frozen on dry ice. Purification protocol was adapted from Orm&#x000f6; et al.<sup><xref rid="R48" ref-type="bibr">48</xref></sup></p></sec><sec id="S14"><title>Crystallization of GFP-RNase(1&#x02013;8).</title><p id="P43">The protein was concentrated to approximately 40 mg/mL in a centrifugal concentrator. The buffer was exchanged three times with 20 mM HEPES pH 7.5 and 10 mM NaCl. Crystals were grown by the hanging-drop vapor diffusion method. The drops were set up using a Mosquito robot, dispensing drops composed of 200 nL of protein and 200 nL reservoir. The reservoir volume was 100 &#x003bc;L. Crystals typically grew within a few hours. Crystals in space group P2<sub>1</sub> and P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> were cryoprotected by the addition of glycerol to a concentration of 35%.</p><sec id="S15"><title>Crystallization of GFP-RNase(1&#x02013;8) in space group P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 1.</title><p id="P44">The reservoir contained 20% ethanol and 30% (v/v) 2-methyl-2,4-pentanediol (Qiagen MPD Suite condition 90 [H6]). The crystal grew at 20&#x000b0;C.</p></sec><sec id="S16"><title>Crystallization of GFP-RNase(1&#x02013;8) in space group P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 2.</title><p id="P45">The reservoir contained 0.1 M sodium acetate, pH 5.0 and 65% (v/v) 2-methyl-2,4-pentanediol (Qiagen MPD Suite condition 68 [F8]). The crystal grew at 4&#x000b0;C.</p></sec><sec id="S17"><title>Crystallization of GFP-RNase(1&#x02013;8) in space group C2.</title><p id="P46">The reservoir contained 0.1 M sodium citrate, 0.1 M HEPES pH 7.5, and 10% (v/v) 2-methyl-2,4-pentanediol (Qiagen MPD Suite condition 73 [G1]). The crystal grew at 4&#x000b0;C.</p></sec><sec id="S18"><title>Crystallization of GFP-RNase(1&#x02013;8) in space group P2<sub>1</sub>.</title><p id="P47">The reservoir contained 0.2 M KCl, 0.005M MgCl<sub>2</sub>, 0.05 M sodium cacodylate pH 6.5, and 10% (w/v) 1,6-hexanediol (Qiagen Nucleix Suite condition 21 [B9]). The crystal grew at 4&#x000b0;C.</p></sec></sec><sec id="S19"><title>Data collection and processing.</title><p id="P48">Diffraction data from GFP-RNase(1&#x02013;8) crystals in space group P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> (form 1), C2, and P2<sub>1</sub> were collected at the Advanced Photon Source beamline 24-ID-C using a Dectris Pilatus 6M-F pixel detector. The data set from crystal form 2 of space group P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> was collected at the beamline 24-ID-E using an ADSC Quantum 315 CCD detector. Each data set was collected from a single crystal at a temperature of 100 K. A cryoprotectant mixture of 35% glycerol and 65% reservoir was used for crystal form P2<sub>1</sub>. The remaining crystal forms required no additional cryoprotectant. The following X-ray wavelengths were used for crystal forms P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> (form 1), P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> (form 2), C2, and P2<sub>1</sub>, respectively: 0.9795 &#x000c5;, 0.9791 &#x000c5;, 0.9795 &#x000c5;, and 1.4760 &#x000c5;. Data was processed using the XDS package<sup><xref rid="R53" ref-type="bibr">53</xref></sup> for all crystals. In addition, autoPROC<sup><xref rid="R54" ref-type="bibr">54</xref></sup> was used for processing data sets in P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> (form 1), and C2; aimless<sup><xref rid="R55" ref-type="bibr">55</xref></sup> was used for scaling in P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> (form 1). Data collection statistics are reported in <xref rid="SD2" ref-type="supplementary-material">supplemental Table S1</xref>.</p></sec><sec id="S20"><title>Structure determination and refinement.</title><p id="P49">The first crystal form we obtained of GFP-RNase(1&#x02013;8), P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 1, was isomorphous with a previously determined GFP structure, PDB ID code 4P1Q<sup><xref rid="R56" ref-type="bibr">56</xref></sup>. We used these coordinates as a starting model in the refinement of the GFP-RNase(1&#x02013;8) Buster<sup><xref rid="R57" ref-type="bibr">57</xref></sup>. Difference density maps revealed the position of the N-terminal fusion, and these atoms were built using the graphics program Coot<sup><xref rid="R58" ref-type="bibr">58</xref></sup>. The remaining structures were determined by molecular replacement using the program Phaser<sup><xref rid="R59" ref-type="bibr">59</xref></sup>. The search model for molecular replacement in space group C2 was the refined structure from space group P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 1. The search model for molecular replacement in space groups P2<sub>1</sub> and P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 2 was the refined structure from space group C2. All models were built and refined using the same software as described above for P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 1. Structures were illustrated using the program Pymol<sup><xref rid="R60" ref-type="bibr">60</xref></sup>.</p></sec><sec id="S21"><title>Fiber diffraction simulation.</title><p id="P50">Crystal diffraction intensities were cylindrically averaged around the filament axis to simulate the radial disorder of a filament using a custom-written Fortran code.</p></sec><sec id="S22"><title>Identification and analysis of potential protofilament-forming interfaces in the Protein Data Bank.</title><p id="P51">To identify how many globular, monomeric proteins are poised to form protofilaments, we began by screening 130,446 PDB entry headers to identify monomers. We defined &#x0201c;monomers&#x0201d; as entries that contained only a single protein chain, were determined by crystallography, had resolution better than 3.0 &#x000c5;, and had no biological assembly operators other than the identity matrix. We also excluded redundant entries defined as having the same MOLECULE record and unit cell lengths identical to &#x000b1; 3&#x000c5; and unit cell angles identical to &#x000b1;5&#x000b0;. We found 12,933 entries that passed these criteria for monomers.</p><p id="P52">To determine how many of these monomers were poised to form screw-related protofilaments, we performed the following steps. First, we applied crystallographic operations to bring the molecule as close as possible to the origin, generating a reference molecule. Then, we applied crystallographic screw operations to the reference molecules, allowing for up to 3 unit cell translations in any combination along positive and negative a, b, and c. Screw operators were accepted as potential protofilament generators only if the reference molecule and the operated molecule were close enough to touch. We defined touching distance as any distance less than twice the distance of the furthest atom of the reference molecule from its center of mass. These criteria produced 73,968 screw-related pairs. Surface areas buried by these interfaces were calculated using the CCP4 program, areaimol<sup><xref rid="R55" ref-type="bibr">55</xref></sup>. Of these, 26,375 screw-related pairs meet the criterion of a quasi-stable interface as defined by equaling or surpassing the 901 &#x000c5;<sup>2</sup> of buried surface area (450 &#x000c5;<sup>2</sup> per surface) observed between barrel domains of our GFP fusion protein protofilament. That is, 9119 PDB entries, or 74% of the monomer entries. Including a constraint that one of the termini be located within 12 &#x000c5; of the intermolecular interface (a threshold defined by our GFP fusion protein) leaves 53% of monomeric PDB entries meeting these combined criteria. That is, over half the monomer entries in the PDB (6816 entries) are poised for filament formation.</p></sec><sec id="S23"><title>Simulation of helix-forming propensity for randomly generated 13-mers.</title><p id="P53">We used a random number generator and probability weighted amino acid frequencies in vertebrates to calculate one million peptide sequences. We scored the sequences for helix propensity using Chou &#x00026; Fasman rules<sup><xref rid="R61" ref-type="bibr">61</xref></sup>. Using two different seeds for the random number generator, we found 127,952 and 128,171 sequences passed the criteria for helix. That is approximately 13%.</p></sec></sec><sec sec-type="supplementary-material" id="SM1"><title>Supplementary Material</title><supplementary-material content-type="local-data" id="SD1"><label>1</label><media xlink:href="NIHMS977255-supplement-1.doc" orientation="portrait" xlink:type="simple" id="d36e1195" position="anchor"/></supplementary-material><supplementary-material content-type="local-data" id="SD2"><label>2</label><media xlink:href="NIHMS977255-supplement-2.pdf" orientation="portrait" xlink:type="simple" id="d36e1199" position="anchor"/></supplementary-material></sec></body><back><ack id="S24"><title>Acknowledgments</title><p id="P54">We thank the Howard Hughes Medical Institute and National Science Foundation (grant 1616265) for support to DSE, and National Institutes of Health (grants OD003806 and GM115941) for support to AH. Diffraction data were collected at the Northeastern Collaborative Access Team beamlines 24-ID-E and C, which are funded by the National Institute of General Medical Sciences from the National Institutes of Health (P41 GM103403). The Pilatus 6M detector on 24-ID-C beam line is funded by a NIH-ORIP HEI grant (S10 RR029205). This research used resources of the Advanced Photon Source, a U.S. Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Argonne National Laboratory under Contract No. DE-AC02&#x02013;06CH11357. We thank M. Collazo and D. Cascio of the UCLA-DOE X-ray Crystallization and Crystallography Core Facilities for their assistance with crystallization and data collection, J. Abraham for helpful discussion, and S. Dove for critical reading of the manuscript.</p></ack><fn-group><fn id="FN2"><p id="P55"><bold>Data availability.</bold> Atomic coordinates of the X-ray structures of GFP-RNase(1&#x02013;8) in space groups P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> forms 1 and 2, C2, and P2<sub>1</sub> were deposited in the Protein Data Bank (PDB) under accession codes PDB 5HGE, 6AS9, 5HBD, and 5HW9, respectively.</p><p id="P65">Life Sciences Reporting Summary for this article is available.</p></fn><fn id="FN3"><p id="P56"><bold>Code availability.</bold> The custom written Fortran program used to calculate the fibril diffraction pattern displayed in <xref rid="F3" ref-type="fig">Fig 3e</xref> is available upon request.</p></fn><fn fn-type="COI-statement" id="FN4"><p id="P57">Competing interests</p><p id="P58">The authors declare no competing financial interests.</p></fn></fn-group><ref-list><title>References</title><ref id="R1"><label>1.</label><mixed-citation publication-type="journal"><name><surname>Shih</surname><given-names>Y-L</given-names></name> &#x00026; <name><surname>Rothfield</surname><given-names>L</given-names></name>
<article-title>The Bacterial Cytoskeleton.</article-title>
<source>Microbiol. Mol. Biol. Rev</source>
<volume>70</volume>, <fpage>729</fpage>&#x02013;<lpage>754</lpage> (<year>2006</year>).<pub-id pub-id-type="pmid">16959967</pub-id></mixed-citation></ref><ref id="R2"><label>2.</label><mixed-citation publication-type="book"><name><surname>Jockusch</surname><given-names>BM</given-names></name> &#x00026; <name><surname>Graumann</surname><given-names>PL</given-names></name>
<chapter-title>The Long Journey: Actin on the Road to Pro- and Eukaryotic Cells.</chapter-title> in <source>Reviews of Physiology, Biochemistry and Pharmacology</source>
<volume>161</volume> (eds. <name><surname>Amara</surname><given-names>SG</given-names></name>
<etal/>) <fpage>67</fpage>&#x02013;<lpage>85</lpage> (<publisher-name>Springer Berlin</publisher-name>
<publisher-loc>Heidelberg</publisher-loc>, <year>2011</year>). doi:<pub-id pub-id-type="doi">10.1007/112_2011_1</pub-id></mixed-citation></ref><ref id="R3"><label>3.</label><mixed-citation publication-type="journal"><name><surname>Bernander</surname><given-names>R</given-names></name>, <name><surname>Lind</surname><given-names>AE</given-names></name> &#x00026; <name><surname>Ettema</surname><given-names>TJG</given-names></name>
<article-title>An archaeal origin for the actin cytoskeleton: Implications for eukaryogenesis.</article-title>
<source>Commun. Integr. Biol</source>
<volume>4</volume>, <fpage>664</fpage>&#x02013;<lpage>667</lpage> (<year>2011</year>).<pub-id pub-id-type="pmid">22446522</pub-id></mixed-citation></ref><ref id="R4"><label>4.</label><mixed-citation publication-type="journal"><name><surname>Wickstead</surname><given-names>B</given-names></name> &#x00026; <name><surname>Gull</surname><given-names>K</given-names></name>
<article-title>The evolution of the cytoskeleton.</article-title>
<source>J. Cell Biol</source>
<volume>194</volume>, <fpage>513</fpage>&#x02013;<lpage>525</lpage> (<year>2011</year>).<pub-id pub-id-type="pmid">21859859</pub-id></mixed-citation></ref><ref id="R5"><label>5.</label><mixed-citation publication-type="journal"><name><surname>O&#x02019;Connell</surname><given-names>JD</given-names></name>, <name><surname>Zhao</surname><given-names>A</given-names></name>, <name><surname>Ellington</surname><given-names>AD</given-names></name> &#x00026; <name><surname>Marcotte</surname><given-names>EM</given-names></name>
<article-title>Dynamic Reorganization of Metabolic Enzymes into Intracellular Bodies.</article-title>
<source>Annu. Rev. Cell Dev. Biol</source>
<volume>28</volume>, <fpage>89</fpage>&#x02013;<lpage>111</lpage> (<year>2012</year>).<pub-id pub-id-type="pmid">23057741</pub-id></mixed-citation></ref><ref id="R6"><label>6.</label><mixed-citation publication-type="journal"><name><surname>Petrovska</surname><given-names>I</given-names></name>
<etal/>
<article-title>Filament formation by metabolic enzymes is a specific adaptation to an advanced state of cellular starvation.</article-title>
<source>eLife</source> (<year>2014</year>). doi:<pub-id pub-id-type="doi">10.7554/eLife.02409</pub-id></mixed-citation></ref><ref id="R7"><label>7.</label><mixed-citation publication-type="journal"><name><surname>Aughey</surname><given-names>GN</given-names></name> &#x00026; <name><surname>Liu</surname><given-names>J-L</given-names></name>
<article-title>Metabolic regulation via enzyme filamentation.</article-title>
<source>Crit. Rev. Biochem. Mol. Biol</source>
<volume>51</volume>, <fpage>282</fpage>&#x02013;<lpage>293</lpage> (<year>2016</year>).</mixed-citation></ref><ref id="R8"><label>8.</label><mixed-citation publication-type="journal"><name><surname>Barry</surname><given-names>RM</given-names></name>
<etal/>
<article-title>Large-scale filament formation inhibits the activity of CTP synthetase.</article-title>
<source>eLife</source>
<volume>3</volume>, <fpage>e03638</fpage> (<year>2014</year>).<pub-id pub-id-type="pmid">25030911</pub-id></mixed-citation></ref><ref id="R9"><label>9.</label><mixed-citation publication-type="journal"><name><surname>Liu</surname><given-names>Y</given-names></name> &#x00026; <name><surname>Eisenberg</surname><given-names>D</given-names></name>
<article-title>3D domain swapping: as domains continue to swap.</article-title>
<source>Protein Sci. Publ. Protein Soc</source>
<volume>11</volume>, <fpage>1285</fpage>&#x02013;<lpage>1299</lpage> (<year>2002</year>).</mixed-citation></ref><ref id="R10"><label>10.</label><mixed-citation publication-type="journal"><name><surname>Bennett</surname><given-names>MJ</given-names></name>, <name><surname>Sawaya</surname><given-names>MR</given-names></name> &#x00026; <name><surname>Eisenberg</surname><given-names>D</given-names></name>
<article-title>Deposition Diseases and 3D Domain Swapping.</article-title>
<source>Structure</source>
<volume>14</volume>, <fpage>811</fpage>&#x02013;<lpage>824</lpage> (<year>2006</year>).<pub-id pub-id-type="pmid">16698543</pub-id></mixed-citation></ref><ref id="R11"><label>11.</label><mixed-citation publication-type="journal"><name><surname>Ogihara</surname><given-names>NL</given-names></name>
<etal/>
<article-title>Design of three-dimensional domain-swapped dimers and fibrous oligomers.</article-title>
<source>Proc. Natl. Acad. Sci. U. S. A</source>
<volume>98</volume>, <fpage>1404</fpage>&#x02013;<lpage>1409</lpage> (<year>2001</year>).<pub-id pub-id-type="pmid">11171963</pub-id></mixed-citation></ref><ref id="R12"><label>12.</label><mixed-citation publication-type="journal"><name><surname>Huntington</surname><given-names>JA</given-names></name>
<etal/>
<article-title>A 2.6 A structure of a serpin polymer and implications for conformational disease.</article-title>
<source>J. Mol. Biol</source>
<volume>293</volume>, <fpage>449</fpage>&#x02013;<lpage>455</lpage> (<year>1999</year>).<pub-id pub-id-type="pmid">10543942</pub-id></mixed-citation></ref><ref id="R13"><label>13.</label><mixed-citation publication-type="journal"><name><surname>Strop</surname><given-names>P</given-names></name>, <name><surname>Smith</surname><given-names>KS</given-names></name>, <name><surname>Iverson</surname><given-names>TM</given-names></name>, <name><surname>Ferry</surname><given-names>JG</given-names></name> &#x00026; <name><surname>Rees</surname><given-names>DC</given-names></name>
<article-title>Crystal structure of the &#x02018;cab&#x02019;-type beta class carbonic anhydrase from the archaeon Methanobacterium thermoautotrophicum.</article-title>
<source>J. Biol. Chem</source>
<volume>276</volume>, <fpage>10299</fpage>&#x02013;<lpage>10305</lpage> (<year>2001</year>).<pub-id pub-id-type="pmid">11096105</pub-id></mixed-citation></ref><ref id="R14"><label>14.</label><mixed-citation publication-type="journal"><name><surname>Guo</surname><given-names>Z</given-names></name> &#x00026; <name><surname>Eisenberg</surname><given-names>D</given-names></name>
<article-title>Runaway domain swapping in amyloid-like fibrils of T7 endonuclease I.</article-title>
<source>Proc. Natl. Acad. Sci</source>
<volume>103</volume>, <fpage>8042</fpage>&#x02013;<lpage>8047</lpage> (<year>2006</year>).<pub-id pub-id-type="pmid">16698921</pub-id></mixed-citation></ref><ref id="R15"><label>15.</label><mixed-citation publication-type="journal"><name><surname>Wahlbom</surname><given-names>M</given-names></name>
<etal/>
<article-title>Fibrillogenic oligomers of human cystatin C are formed by propagated domain swapping.</article-title>
<source>J. Biol. Chem</source>
<volume>282</volume>, <fpage>18318</fpage>&#x02013;<lpage>18326</lpage> (<year>2007</year>).<pub-id pub-id-type="pmid">17470433</pub-id></mixed-citation></ref><ref id="R16"><label>16.</label><mixed-citation publication-type="journal"><name><surname>Jask&#x000f3;lski</surname><given-names>M</given-names></name>
<article-title>3D domain swapping, protein oligomerization, and amyloid formation.</article-title>
<source>Acta Biochim. Pol</source>
<volume>48</volume>, <fpage>807</fpage>&#x02013;<lpage>827</lpage> (<year>2001</year>).<pub-id pub-id-type="pmid">11995994</pub-id></mixed-citation></ref><ref id="R17"><label>17.</label><mixed-citation publication-type="journal"><name><surname>Sambashivan</surname><given-names>S</given-names></name>, <name><surname>Liu</surname><given-names>Y</given-names></name>, <name><surname>Sawaya</surname><given-names>MR</given-names></name>, <name><surname>Gingery</surname><given-names>M</given-names></name> &#x00026; <name><surname>Eisenberg</surname><given-names>D</given-names></name>
<article-title>Amyloid-like fibrils of ribonuclease A with three-dimensional domain-swapped and native-like structure.</article-title>
<source>Nature</source>
<volume>437</volume>, <fpage>266</fpage>&#x02013;<lpage>269</lpage> (<year>2005</year>).<pub-id pub-id-type="pmid">16148936</pub-id></mixed-citation></ref><ref id="R18"><label>18.</label><mixed-citation publication-type="journal"><name><surname>Teng</surname><given-names>PK</given-names></name> &#x00026; <name><surname>Eisenberg</surname><given-names>D</given-names></name>
<article-title>Short protein segments can drive a non-fibrillizing protein into the amyloid state.</article-title>
<source>Protein Eng. Des. Sel. PEDS</source>
<volume>22</volume>, <fpage>531</fpage>&#x02013;<lpage>536</lpage> (<year>2009</year>).<pub-id pub-id-type="pmid">19602569</pub-id></mixed-citation></ref><ref id="R19"><label>19.</label><mixed-citation publication-type="journal"><name><surname>Goldschmidt</surname><given-names>L</given-names></name>, <name><surname>Teng</surname><given-names>PK</given-names></name>, <name><surname>Riek</surname><given-names>R</given-names></name> &#x00026; <name><surname>Eisenberg</surname><given-names>D</given-names></name>
<article-title>Identifying the amylome, proteins capable of forming amyloid-like fibrils.</article-title>
<source>Proc. Natl. Acad. Sci</source>
<volume>107</volume>, <fpage>3487</fpage>&#x02013;<lpage>3492</lpage> (<year>2010</year>).<pub-id pub-id-type="pmid">20133726</pub-id></mixed-citation></ref><ref id="R20"><label>20.</label><mixed-citation publication-type="journal"><name><surname>Lawrence</surname><given-names>MC</given-names></name> &#x00026; <name><surname>Colman</surname><given-names>PM</given-names></name>
<article-title>Shape complementarity at protein/protein interfaces.</article-title>
<source>J. Mol. Biol</source>
<volume>234</volume>, <fpage>946</fpage>&#x02013;<lpage>950</lpage> (<year>1993</year>).<pub-id pub-id-type="pmid">8263940</pub-id></mixed-citation></ref><ref id="R21"><label>21.</label><mixed-citation publication-type="journal"><name><surname>Yang</surname><given-names>F</given-names></name>, <name><surname>Moss</surname><given-names>LG</given-names></name> &#x00026; <name><surname>Phillips</surname><given-names>GN</given-names></name>
<article-title>The molecular structure of green fluorescent protein.</article-title>
<source>Nat. Biotechnol</source>
<volume>14</volume>, <fpage>1246</fpage>&#x02013;<lpage>1251</lpage> (<year>1996</year>).<pub-id pub-id-type="pmid">9631087</pub-id></mixed-citation></ref><ref id="R22"><label>22.</label><mixed-citation publication-type="journal"><name><surname>Ponstingl</surname><given-names>H</given-names></name>, <name><surname>Henrick</surname><given-names>K</given-names></name> &#x00026; <name><surname>Thornton</surname><given-names>JM</given-names></name>
<article-title>Discriminating between homodimeric and monomeric proteins in the crystalline state.</article-title>
<source>Proteins</source>
<volume>41</volume>, <fpage>47</fpage>&#x02013;<lpage>57</lpage> (<year>2000</year>).<pub-id pub-id-type="pmid">10944393</pub-id></mixed-citation></ref><ref id="R23"><label>23.</label><mixed-citation publication-type="journal"><name><surname>Wellems</surname><given-names>TE</given-names></name> &#x00026; <name><surname>Josephs</surname><given-names>R</given-names></name>
<article-title>Crystallization of deoxyhemoglobin S by fiber alignment and fusion.</article-title>
<source>J. Mol. Biol</source>
<volume>135</volume>, <fpage>651</fpage>&#x02013;<lpage>674</lpage> (<year>1979</year>).<pub-id pub-id-type="pmid">43904</pub-id></mixed-citation></ref><ref id="R24"><label>24.</label><mixed-citation publication-type="journal"><name><surname>Harrington</surname><given-names>DJ</given-names></name>, <name><surname>Adachi</surname><given-names>K</given-names></name> &#x00026; <name><surname>Royer</surname><given-names>WE</given-names><suffix>Jr</suffix></name>
<article-title>The high resolution crystal structure of deoxyhemoglobin S1.</article-title>
<source>J. Mol. Biol</source>
<volume>272</volume>, <fpage>398</fpage>&#x02013;<lpage>407</lpage> (<year>1997</year>).<pub-id pub-id-type="pmid">9325099</pub-id></mixed-citation></ref><ref id="R25"><label>25.</label><mixed-citation publication-type="journal"><name><surname>Liu</surname><given-names>Y</given-names></name>, <name><surname>Gotte</surname><given-names>G</given-names></name>, <name><surname>Libonati</surname><given-names>M</given-names></name> &#x00026; <name><surname>Eisenberg</surname><given-names>D</given-names></name>
<article-title>Structures of the two 3D domain-swapped RNase A trimers.</article-title>
<source>Protein Sci</source>
<volume>11</volume>, <fpage>371</fpage>&#x02013;<lpage>380</lpage> (<year>2002</year>).<pub-id pub-id-type="pmid">11790847</pub-id></mixed-citation></ref><ref id="R26"><label>26.</label><mixed-citation publication-type="journal"><name><surname>Wang</surname><given-names>Z</given-names></name>, <name><surname>Kishchenko</surname><given-names>G</given-names></name>, <name><surname>Chen</surname><given-names>Y</given-names></name> &#x00026; <name><surname>Josephs</surname><given-names>R</given-names></name>
<article-title>Polymerization of deoxy-sickle cell hemoglobin in high-phosphate buffer.</article-title>
<source>J. Struct. Biol</source>
<volume>131</volume>, <fpage>197</fpage>&#x02013;<lpage>209</lpage> (<year>2000</year>).<pub-id pub-id-type="pmid">11052892</pub-id></mixed-citation></ref><ref id="R27"><label>27.</label><mixed-citation publication-type="journal"><name><surname>Lomas</surname><given-names>DA</given-names></name>, <name><surname>Evans</surname><given-names>DL</given-names></name>, <name><surname>Finch</surname><given-names>JT</given-names></name> &#x00026; <name><surname>Carrell</surname><given-names>RW</given-names></name>
<article-title>The mechanism of Z alpha 1-antitrypsin accumulation in the liver.</article-title>
<source>Nature</source>
<volume>357</volume>, <fpage>605</fpage>&#x02013;<lpage>607</lpage> (<year>1992</year>).<pub-id pub-id-type="pmid">1608473</pub-id></mixed-citation></ref><ref id="R28"><label>28.</label><mixed-citation publication-type="journal"><name><surname>Bennett</surname><given-names>MJ</given-names></name>, <name><surname>Choe</surname><given-names>S</given-names></name> &#x00026; <name><surname>Eisenberg</surname><given-names>D</given-names></name>
<article-title>Domain swapping: entangling alliances between proteins.</article-title>
<source>Proc. Natl. Acad. Sci</source>
<volume>91</volume>, <fpage>3127</fpage>&#x02013;<lpage>3131</lpage> (<year>1994</year>).<pub-id pub-id-type="pmid">8159715</pub-id></mixed-citation></ref><ref id="R29"><label>29.</label><mixed-citation publication-type="journal"><name><surname>Raymond</surname><given-names>DD</given-names></name>, <name><surname>Piper</surname><given-names>ME</given-names></name>, <name><surname>Gerrard</surname><given-names>SR</given-names></name>, <name><surname>Skiniotis</surname><given-names>G</given-names></name> &#x00026; <name><surname>Smith</surname><given-names>JL</given-names></name>
<article-title>Phleboviruses encapsidate their genomes by sequestering RNA bases.</article-title>
<source>Proc. Natl. Acad. Sci. U. S. A</source>
<volume>109</volume>, <fpage>19208</fpage>&#x02013;<lpage>19213</lpage> (<year>2012</year>).<pub-id pub-id-type="pmid">23129612</pub-id></mixed-citation></ref><ref id="R30"><label>30.</label><mixed-citation publication-type="journal"><name><surname>Hospenthal</surname><given-names>MK</given-names></name>
<etal/>
<article-title>The Cryoelectron Microscopy Structure of the Type 1 Chaperone-Usher Pilus Rod.</article-title>
<source>Structure</source>
<volume>25</volume>, <fpage>1829</fpage>&#x02013;<lpage>1838</lpage>.<comment>e4</comment> (<year>2017</year>).<pub-id pub-id-type="pmid">29129382</pub-id></mixed-citation></ref><ref id="R31"><label>31.</label><mixed-citation publication-type="journal"><name><surname>Kuhlman</surname><given-names>B</given-names></name>, <name><surname>O&#x02019;Neill</surname><given-names>JW</given-names></name>, <name><surname>Kim</surname><given-names>DE</given-names></name>, <name><surname>Zhang</surname><given-names>KY</given-names></name> &#x00026; <name><surname>Baker</surname><given-names>D</given-names></name>
<article-title>Conversion of monomeric protein L to an obligate dimer by computational protein design.</article-title>
<source>Proc. Natl. Acad. Sci. U. S. A</source>
<volume>98</volume>, <fpage>10687</fpage>&#x02013;<lpage>10691</lpage> (<year>2001</year>).<pub-id pub-id-type="pmid">11526208</pub-id></mixed-citation></ref><ref id="R32"><label>32.</label><mixed-citation publication-type="journal"><name><surname>Kelley</surname><given-names>BS</given-names></name>, <name><surname>Chang</surname><given-names>LC</given-names></name> &#x00026; <name><surname>Bewley</surname><given-names>CA</given-names></name>
<article-title>Engineering an obligate domain-swapped dimer of cyanovirin-N with enhanced anti-HIV activity.</article-title>
<source>J. Am. Chem. Soc</source>
<volume>124</volume>, <fpage>3210</fpage>&#x02013;<lpage>3211</lpage> (<year>2002</year>).<pub-id pub-id-type="pmid">11916396</pub-id></mixed-citation></ref><ref id="R33"><label>33.</label><mixed-citation publication-type="journal"><name><surname>Ruigrok</surname><given-names>RWH</given-names></name> &#x00026; <name><surname>DiCapua</surname><given-names>E</given-names></name>
<article-title>On the polymerization state of recA in the absence of DNA.</article-title>
<source>Biochimie</source>
<volume>73</volume>, <fpage>191</fpage>&#x02013;<lpage>198</lpage> (<year>1991</year>).<pub-id pub-id-type="pmid">1883881</pub-id></mixed-citation></ref><ref id="R34"><label>34.</label><mixed-citation publication-type="journal"><name><surname>von der Ecken</surname><given-names>J</given-names></name>
<etal/>
<article-title>Structure of the F-actin-tropomyosin complex.</article-title>
<source>Nature</source>
<volume>519</volume>, <fpage>114</fpage>&#x02013;<lpage>117</lpage> (<year>2015</year>).<pub-id pub-id-type="pmid">25470062</pub-id></mixed-citation></ref><ref id="R35"><label>35.</label><mixed-citation publication-type="journal"><name><surname>Greenberg</surname><given-names>MJ</given-names></name>, <name><surname>Wang</surname><given-names>C-LA</given-names></name>, <name><surname>Lehman</surname><given-names>W</given-names></name> &#x00026; <name><surname>Moore</surname><given-names>JR</given-names></name>
<article-title>Modulation of Actin Mechanics by Caldesmon and Tropomyosin.</article-title>
<source>Cell Motil. Cytoskeleton</source>
<volume>65</volume>, <fpage>156</fpage>&#x02013;<lpage>164</lpage> (<year>2008</year>).<pub-id pub-id-type="pmid">18000881</pub-id></mixed-citation></ref><ref id="R36"><label>36.</label><mixed-citation publication-type="journal"><name><surname>Jung</surname><given-names>J</given-names></name> &#x00026; <name><surname>Lee</surname><given-names>B</given-names></name>
<article-title>Circularly permuted proteins in the protein structure database.</article-title>
<source>Protein Sci. Publ. Protein Soc</source>
<volume>10</volume>, <fpage>1881</fpage>&#x02013;<lpage>1886</lpage> (<year>2001</year>).</mixed-citation></ref><ref id="R37"><label>37.</label><mixed-citation publication-type="journal"><name><surname>Sawaya</surname><given-names>MR</given-names></name>, <name><surname>Guo</surname><given-names>S</given-names></name>, <name><surname>Tabor</surname><given-names>S</given-names></name>, <name><surname>Richardson</surname><given-names>CC</given-names></name> &#x00026; <name><surname>Ellenberger</surname><given-names>T</given-names></name>
<article-title>Crystal structure of the helicase domain from the replicative helicase-primase of bacteriophage T7.</article-title>
<source>Cell</source>
<volume>99</volume>, <fpage>167</fpage>&#x02013;<lpage>177</lpage> (<year>1999</year>).<pub-id pub-id-type="pmid">10535735</pub-id></mixed-citation></ref><ref id="R38"><label>38.</label><mixed-citation publication-type="journal"><name><surname>Lynch</surname><given-names>EM</given-names></name>
<etal/>
<article-title>Human CTP synthase filament structure reveals the active enzyme conformation.</article-title>
<source>Nat. Struct. Mol. Biol</source>
<volume>24</volume>, <fpage>507</fpage>&#x02013;<lpage>514</lpage> (<year>2017</year>).<pub-id pub-id-type="pmid">28459447</pub-id></mixed-citation></ref><ref id="R39"><label>39.</label><mixed-citation publication-type="journal"><name><surname>Barry</surname><given-names>RM</given-names></name> &#x00026; <name><surname>Gitai</surname><given-names>Z</given-names></name>
<article-title>Self-assembling enzymes and the origins of the cytoskeleton.</article-title>
<source>Curr. Opin. Microbiol</source>
<volume>14</volume>, <fpage>704</fpage>&#x02013;<lpage>711</lpage> (<year>2011</year>).<pub-id pub-id-type="pmid">22014508</pub-id></mixed-citation></ref><ref id="R40"><label>40.</label><mixed-citation publication-type="journal"><name><surname>Shen</surname><given-names>Q-J</given-names></name>
<etal/>
<article-title>Filamentation of Metabolic Enzymes in Saccharomyces cerevisiae.</article-title>
<source>J. Genet. Genomics Yi Chuan Xue Bao</source>
<volume>43</volume>, <fpage>393</fpage>&#x02013;<lpage>404</lpage> (<year>2016</year>).<pub-id pub-id-type="pmid">27312010</pub-id></mixed-citation></ref><ref id="R41"><label>41.</label><mixed-citation publication-type="journal"><name><surname>Pandya</surname><given-names>MJ</given-names></name>
<etal/>
<article-title>Sticky-end assembly of a designed peptide fiber provides insight into protein fibrillogenesis.</article-title>
<source>Biochemistry (Mosc.)</source>
<volume>39</volume>, <fpage>8728</fpage>&#x02013;<lpage>8734</lpage> (<year>2000</year>).</mixed-citation></ref><ref id="R42"><label>42.</label><mixed-citation publication-type="journal"><name><surname>Potekhin</surname><given-names>SA</given-names></name>
<etal/>
<article-title>De novo design of fibrils made of short alpha-helical coiled coil peptides.</article-title>
<source>Chem. Biol</source>
<volume>8</volume>, <fpage>1025</fpage>&#x02013;<lpage>1032</lpage> (<year>2001</year>).<pub-id pub-id-type="pmid">11731294</pub-id></mixed-citation></ref><ref id="R43"><label>43.</label><mixed-citation publication-type="journal"><name><surname>Padilla</surname><given-names>JE</given-names></name>, <name><surname>Colovos</surname><given-names>C</given-names></name> &#x00026; <name><surname>Yeates</surname><given-names>TO</given-names></name>
<article-title>Nanohedra: using symmetry to design self assembling protein cages, layers, crystals, and filaments.</article-title>
<source>Proc. Natl. Acad. Sci. U. S. A</source>
<volume>98</volume>, <fpage>2217</fpage>&#x02013;<lpage>2221</lpage> (<year>2001</year>).<pub-id pub-id-type="pmid">11226219</pub-id></mixed-citation></ref><ref id="R44"><label>44.</label><mixed-citation publication-type="journal"><name><surname>Yeates</surname><given-names>TO</given-names></name> &#x00026; <name><surname>Padilla</surname><given-names>JE</given-names></name>
<article-title>Designing supramolecular protein assemblies.</article-title>
<source>Curr. Opin. Struct. Biol</source>
<volume>12</volume>, <fpage>464</fpage>&#x02013;<lpage>470</lpage> (<year>2002</year>).<pub-id pub-id-type="pmid">12163069</pub-id></mixed-citation></ref><ref id="R45"><label>45.</label><mixed-citation publication-type="journal"><name><surname>Mou</surname><given-names>Y</given-names></name>, <name><surname>Yu</surname><given-names>J-Y</given-names></name>, <name><surname>Wannier</surname><given-names>TM</given-names></name>, <name><surname>Guo</surname><given-names>C-L</given-names></name> &#x00026; <name><surname>Mayo</surname><given-names>SL</given-names></name>
<article-title>Computational design of co-assembling protein-DNA nanowires.</article-title>
<source>Nature</source>
<volume>525</volume>, <fpage>230</fpage>&#x02013;<lpage>233</lpage> (<year>2015</year>).<pub-id pub-id-type="pmid">26331548</pub-id></mixed-citation></ref><ref id="R46"><label>46.</label><mixed-citation publication-type="journal"><name><surname>Sayre</surname><given-names>TC</given-names></name>, <name><surname>Lee</surname><given-names>TM</given-names></name>, <name><surname>King</surname><given-names>NP</given-names></name> &#x00026; <name><surname>Yeates</surname><given-names>TO</given-names></name>
<article-title>Protein stabilization in a highly knotted protein polymer.</article-title>
<source>Protein Eng. Des. Sel</source>
<volume>24</volume>, <fpage>627</fpage>&#x02013;<lpage>630</lpage> (<year>2011</year>).<pub-id pub-id-type="pmid">21669955</pub-id></mixed-citation></ref><ref id="R47"><label>47.</label><mixed-citation publication-type="journal"><name><surname>Bharat</surname><given-names>TAM</given-names></name>, <name><surname>Murshudov</surname><given-names>GN</given-names></name>, <name><surname>Sachse</surname><given-names>C</given-names></name> &#x00026; <name><surname>L&#x000f6;we</surname><given-names>J</given-names></name>
<article-title>Structures of actin-like ParM filaments show architecture of plasmid-segregating spindles.</article-title>
<source>Nature</source>
<volume>523</volume>, <fpage>106</fpage>&#x02013;<lpage>110</lpage> (<year>2015</year>).<pub-id pub-id-type="pmid">25915019</pub-id></mixed-citation></ref><ref id="R48"><label>48.</label><mixed-citation publication-type="journal"><name><surname>Orm&#x000f6;</surname><given-names>M</given-names></name>
<etal/>
<article-title>Crystal structure of the Aequorea victoria green fluorescent protein.</article-title>
<source>Science</source>
<volume>273</volume>, <fpage>1392</fpage>&#x02013;<lpage>1395</lpage> (<year>1996</year>).<pub-id pub-id-type="pmid">8703075</pub-id></mixed-citation></ref><ref id="R49"><label>49.</label><mixed-citation publication-type="journal"><name><surname>Garrity</surname><given-names>SJ</given-names></name>, <name><surname>Sivanathan</surname><given-names>V</given-names></name>, <name><surname>Dong</surname><given-names>J</given-names></name>, <name><surname>Lindquist</surname><given-names>S</given-names></name> &#x00026; <name><surname>Hochschild</surname><given-names>A</given-names></name>
<article-title>Conversion of a yeast prion protein to an infectious form in bacteria.</article-title>
<source>Proc. Natl. Acad. Sci. U. S. A</source>
<volume>107</volume>, <fpage>10596</fpage>&#x02013;<lpage>10601</lpage> (<year>2010</year>).<pub-id pub-id-type="pmid">20484678</pub-id></mixed-citation></ref><ref id="R50"><label>50.</label><mixed-citation publication-type="journal"><name><surname>Khlebnikov</surname><given-names>A</given-names></name>, <name><surname>Datsenko</surname><given-names>KA</given-names></name>, <name><surname>Skaug</surname><given-names>T</given-names></name>, <name><surname>Wanner</surname><given-names>BL</given-names></name> &#x00026; <name><surname>Keasling</surname><given-names>JD</given-names></name>
<article-title>Homogeneous expression of the P(BAD) promoter in Escherichia coli by constitutive expression of the low-affinity high-capacity AraE transporter.</article-title>
<source>Microbiol. Read. Engl</source>
<volume>147</volume>, <fpage>3241</fpage>&#x02013;<lpage>3247</lpage> (<year>2001</year>).</mixed-citation></ref><ref id="R51"><label>51.</label><mixed-citation publication-type="journal"><name><surname>Doan</surname><given-names>T</given-names></name>, <name><surname>Marquis</surname><given-names>KA</given-names></name> &#x00026; <name><surname>Rudner</surname><given-names>DZ</given-names></name>
<article-title>Subcellular localization of a sporulation membrane protein is achieved through a network of interactions along and across the septum.</article-title>
<source>Mol. Microbiol</source>
<volume>55</volume>, <fpage>1767</fpage>&#x02013;<lpage>1781</lpage> (<year>2005</year>).<pub-id pub-id-type="pmid">15752199</pub-id></mixed-citation></ref><ref id="R52"><label>52.</label><mixed-citation publication-type="journal"><name><surname>Schneider</surname><given-names>CA</given-names></name>, <name><surname>Rasband</surname><given-names>WS</given-names></name> &#x00026; <name><surname>Eliceiri</surname><given-names>KW</given-names></name>
<article-title>NIH Image to ImageJ: 25 years of image analysis.</article-title>
<source>Nat. Methods</source>
<volume>9</volume>, <fpage>671</fpage>&#x02013;<lpage>675</lpage> (<year>2012</year>).<pub-id pub-id-type="pmid">22930834</pub-id></mixed-citation></ref><ref id="R53"><label>53.</label><mixed-citation publication-type="journal"><name><surname>Kabsch</surname><given-names>W</given-names></name>
<article-title>XDS.</article-title>
<source>Acta Crystallogr. D Biol. Crystallogr</source>
<volume>66</volume>, <fpage>125</fpage>&#x02013;<lpage>132</lpage> (<year>2010</year>).<pub-id pub-id-type="pmid">20124692</pub-id></mixed-citation></ref><ref id="R54"><label>54.</label><mixed-citation publication-type="journal"><name><surname>Vonrhein</surname><given-names>C</given-names></name>
<etal/>
<article-title>Data processing and analysis with the autoPROC toolbox.</article-title>
<source>Acta Crystallogr. D Biol. Crystallogr</source>
<volume>67</volume>, <fpage>293</fpage>&#x02013;<lpage>302</lpage> (<year>2011</year>).<pub-id pub-id-type="pmid">21460447</pub-id></mixed-citation></ref><ref id="R55"><label>55.</label><mixed-citation publication-type="journal"><name><surname>Winn</surname><given-names>MD</given-names></name>
<etal/>
<article-title>Overview of the CCP4 suite and current developments.</article-title>
<source>Acta Crystallogr. D Biol. Crystallogr</source>
<volume>67</volume>, <fpage>235</fpage>&#x02013;<lpage>242</lpage> (<year>2011</year>).<pub-id pub-id-type="pmid">21460441</pub-id></mixed-citation></ref><ref id="R56"><label>56.</label><mixed-citation publication-type="journal"><name><surname>Auerbach</surname><given-names>D</given-names></name>
<etal/>
<article-title>Replacement of highly conserved E222 by the photostable non-photoconvertible histidine in GFP.</article-title>
<source>Chembiochem Eur. J. Chem. Biol</source>
<volume>15</volume>, <fpage>1404</fpage>&#x02013;<lpage>1408</lpage> (<year>2014</year>).</mixed-citation></ref><ref id="R57"><label>57.</label><mixed-citation publication-type="book"><name><surname>Bricogne</surname><given-names>G</given-names></name>
<etal/>
<source>Buster version 1.10.0</source> (<publisher-name>Global Phasing Ltd</publisher-name>, <year>2016</year>).</mixed-citation></ref><ref id="R58"><label>58.</label><mixed-citation publication-type="journal"><name><surname>Emsley</surname><given-names>P</given-names></name>, <name><surname>Lohkamp</surname><given-names>B</given-names></name>, <name><surname>Scott</surname><given-names>WG</given-names></name> &#x00026; <name><surname>Cowtan</surname><given-names>K</given-names></name>
<article-title>Features and development of Coot.</article-title>
<source>Acta Crystallogr. D Biol. Crystallogr</source>
<volume>66</volume>, <fpage>486</fpage>&#x02013;<lpage>501</lpage> (<year>2010</year>).<pub-id pub-id-type="pmid">20383002</pub-id></mixed-citation></ref><ref id="R59"><label>59.</label><mixed-citation publication-type="journal"><name><surname>McCoy</surname><given-names>AJ</given-names></name>
<etal/>
<article-title>Phaser crystallographic software.</article-title>
<source>J. Appl. Crystallogr</source>
<volume>40</volume>, <fpage>658</fpage>&#x02013;<lpage>674</lpage> (<year>2007</year>).<pub-id pub-id-type="pmid">19461840</pub-id></mixed-citation></ref><ref id="R60"><label>60.</label><mixed-citation publication-type="book"><name><surname>Delano</surname><given-names>W</given-names></name>
<source>The PyMOL Molecular Graphics System</source> (<publisher-name>Schr&#x000f6;dinger LLC</publisher-name>).</mixed-citation></ref><ref id="R61"><label>61.</label><mixed-citation publication-type="journal"><name><surname>Chou</surname><given-names>PY</given-names></name> &#x00026; <name><surname>Fasman</surname></name>, and <name><surname>G. D.</surname></name>
<article-title>Empirical Predictions of Protein Conformation.</article-title>
<source>Annu. Rev. Biochem</source>
<volume>47</volume>, <fpage>251</fpage>&#x02013;<lpage>276</lpage> (<year>1978</year>).<pub-id pub-id-type="pmid">354496</pub-id></mixed-citation></ref></ref-list></back><floats-group><fig id="F1" orientation="portrait" position="float"><label>Figure 1.</label><caption><title>Fluorescence images of <italic>E. coli</italic> cells containing GFP-RNase A fusion proteins.</title><p id="P59">(<bold>a</bold>) Fusion constructs. RNaseA and truncated segments were fused to the C-terminus of GFP via a 3-residue linker (white box). (<bold>b</bold>) After overnight induction of GFP-AMA-RNaseA fusion protein, cells were imaged before (left) or after incubation for 2 h at 4<sup>o</sup>C (right). (<bold>c</bold>) Variations in the linker of GFP-AMA-RNaseA(1&#x02013;14) cause variation in rod-forming propensities, as observed after overnight induction before (AMA, ALA [left]) or after incubation for 24 h at 4<sup>o</sup>C (AQA, ALA [right], AEA, AKA, AAA). The graph indicates the energy of the highest propensity amyloid-forming hexapeptide predicted by ZipperDB<sup>19</sup>for each of the six variants. Most notably, the AQA variant predicted to have the strongest amyloid propensity produced no rods, suggesting that the rods are not amyloid-like. (<bold>d</bold>) Eight appended RNase A residues suffice for rod formation. (<bold>e</bold>) Appending hexa-histidine tag does not interfere with rod formation. Constructs in panels d and e were imaged after overnight induction and incubation for 2 h at 4<sup>o</sup>C. (<bold>f</bold>) V219E, a mutation within the GFP moiety, disrupts filament formation. Cells were imaged after overnight induction and incubation for 120 h at 4<sup>o</sup>C. For all panels, fluorescence images show representative fields, with illustrative portions enlarged (<bold>insets</bold>). (<bold>g</bold>) Fluorescence and phase contrast images of dividing cells connected by continuous rod structures. Cell-spanning rods are indicated by white arrowheads. A similar phenomenon was observed in <italic>E. coli</italic> cells overexpressing an actin homolog.<sup><xref rid="R47" ref-type="bibr">47</xref></sup> (<bold>h</bold>) Cells exhibiting rod-induced bulges at their poles. Bulges are indicated by yellow arrowheads. Cells in panels g and h contain GFP-RNaseA(1&#x02013;14) fusion protein, and were imaged after induction for 4 h and incubation for 24 h at 4<sup>o</sup>C. Scale bars represent 5 &#x003bc;m.</p></caption><graphic xlink:href="nihms-977255-f0001"/></fig><fig id="F2" orientation="portrait" position="float"><label>Fig. 2.</label><caption><title>Structure of the GFP-RNase(1&#x02013;8) protofilament determined by x-ray crystallography.</title><p id="P60">(<bold>a and c</bold>) Two orientations of the protofilament related by a 45<sup>o</sup> rotation around the vertical axis. The protofilament is represented as a schematic on the left (large cylinders represent the GFP &#x003b2;-barrel) and cartoon ribbons on the right. The protofilament axis (with 2-fold screw symmetry) is marked by the vertical black line. (<bold>b</bold>) Superposition of GFP protofilaments from four crystal forms, each shaded according to the color key. The three center protomers (blue, brown and green) were superposed as unified trimers, rather than monomers. The four protofilaments superimpose nearly identically, indicating the protofilament architecture is highly conserved and robust. (<bold>d, e, f</bold>) Molecular interfaces responsible for robust assembly. (<bold>d</bold>) A close-up view of the fortuitously formed C-terminal helix of protomer i (cartoon ribbon) fitting in a groove between i+1 and i+2, represented as solvent accessible surfaces. The sequence of the C-terminal fusion and its origins are diagrammed. Arrows connect the individual sequence components to the structure. (<bold>e</bold>) Details of the helix-groove interface contacts. (<bold>f</bold>) Barrel-to-barrel stacking contacts between protomers i+1 and i+2 that facilitate fibril formation. The barrel-barrel interface has been observed as a crystal packing interaction previously (for example, see PDB ID 1EMA<sup><xref rid="R48" ref-type="bibr">48</xref></sup>).</p></caption><graphic xlink:href="nihms-977255-f0002"/></fig><fig id="F3" orientation="portrait" position="float"><label>Fig. 3.</label><caption><title>Protofilament architecture and bundling <italic>in vivo</italic>.</title><p id="P61">(<bold>a</bold>) Four independent crystal structures of the GFP-RNase fusion reveal essentially identical protofilament assemblies (<bold>b</bold>) Remarkably, this consensus protofilament structure (one is highlighted in green, blue, brown) is unperturbed despite differences in its alignment with neighboring protofilaments (parallel in C2, and P2<sub>1</sub>, and antiparallel in P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> forms 1 and 2) and different spacings between neighboring protofilaments. Only one of the unit cell dimensions is constant among the four crystal forms, and this corresponds to the filament pitch (brown scale bar). Crystallographic 2<sub>1</sub> symmetry axes are denoted by black symbols and half-arrows. (<bold>c</bold>) An X-ray diffraction pattern from dried <italic>E. coli</italic> cells carrying GFP-RNase(1&#x02013;8) filaments reveals six low resolution reflections (labeled 55 to 25 &#x000c5;). (<bold>d</bold>) Diffraction radial profiles calculated for each crystal form (orange traces) are overlaid with the observed radial profile of <italic>E. coli</italic> cells (blue dashed traces). Remarkably, one of the calculated patterns (space group C2) matches the observed pattern, implying that protofilament bundling <italic>in vivo</italic> resembles the packing observed in the C2 crystal. Correlation coefficient between observed and simulated diffraction profiles are noted in the lower right corner of each subpanel. (<bold>e</bold>) Calculated fiber diffraction pattern of the C2 crystal structure. (<bold>f</bold>) Overlay of observed and calculated (space group C2) diffraction patterns. Bands of dark green color highlight the coincidence of the most intense reflections. Similarity is evident in the Bragg spacings, relative intensities of the reflections, and orientation of the reflections with respect to the fibril axis (fibril axis is vertical). Labels indicate Miller indices and Bragg spacings.</p></caption><graphic xlink:href="nihms-977255-f0003"/></fig><fig id="F4" orientation="portrait" position="float"><label>Figure 4.</label><caption><title>Four basic protofilament-forming mechanisms and their observed combinations.</title><p id="P62">(<bold>a</bold>) In domain stacked filaments, protomers stack together through rigid domains, with no intervening hinges to confer flexibility. (<bold>b</bold>) Bridging interfaces join nonconsecutive protomers, creating a mutually supportive closed network that includes the intervening &#x0201c;bridged&#x0201d; protomer(s) (<bold>c-e</bold>) Runaway domains swapped filaments are characterized by the exchange of domains between neighboring protomers. Hinge loops, which link the domains, confer flexibility to the filament. Nucleation usually requires lengthy or harsh conditions because the closed, self-complemented state (<bold>c</bold>) is far more stable than the open state (<bold>d</bold>) in which the swapping element (in this case indicated by a bumpy rectangle) is exposed and the complementing surface (indicated by a cavity) is available for swapping. (<bold>f-h</bold>) Protofilament assembly through runaway domain coupling features a similar exchange of domains between protomers, but is more easily achieved because the donor domain is permanently open (<bold>g</bold>), geometrically restricted from self-complementing (for example, by a hinge loop that is too short). Thus no closed monomer exists (<bold>f</bold>) to compete for protofilament assembly. (<bold>i-k</bold>)Combinations of stacking, bridging, and coupling elements are commonly observed. For example, stacking, bridging and coupling elements are consolidated in the GFP fusion (<bold>j-k</bold>). In this case, the complementation site is divided over two surfaces (<bold>j</bold>). That is, two (or more) protomers cooperate to complement the swapping domain (<bold>k</bold>), rather than just one (<bold>h</bold>). Cooperativity implies a larger entropic barrier to nucleation because the nucleus is composed of more protomers, but this limit may be offset by the absence of closed monomer to compete for protofilament assembly. Conceivably, stacking, bridging, and swapping could also coexist if the complementation site resided entirely within protomer i+2.</p></caption><graphic xlink:href="nihms-977255-f0004"/></fig><table-wrap id="T1" position="float" orientation="portrait"><label>Table 1.</label><caption><p id="P63">Crystallographic data collection and refinement statistics</p></caption><table frame="hsides" rules="groups"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="top" rowspan="1" colspan="1"/><th align="left" valign="top" rowspan="1" colspan="1">GFP-RNase(1&#x02013;8)-<break/>His<sub>6</sub><break/>(PDB 5HGE)</th><th align="left" valign="top" rowspan="1" colspan="1">GFP-RNase(1&#x02013;8)-<break/>His<sub>6</sub><break/>(PDB 6AS9)</th><th align="left" valign="top" rowspan="1" colspan="1">GFP-RNase(1&#x02013;8)-<break/>His<sub>6</sub><break/>(PDB 5HBD)</th><th align="left" valign="top" rowspan="1" colspan="1">GFP-RNase(1-<break/>8)-His<sub>6</sub><break/>(PDB 5HW9)</th></tr></thead><tbody><tr><td align="left" valign="top" rowspan="1" colspan="1"><bold>Data collection</bold></td><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Space group</td><td align="left" valign="top" rowspan="1" colspan="1">P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 1</td><td align="left" valign="top" rowspan="1" colspan="1">P2<sub>1</sub>2<sub>1</sub>2<sub>1</sub> form 2</td><td align="left" valign="top" rowspan="1" colspan="1">C2</td><td align="left" valign="top" rowspan="1" colspan="1">P2<sub>1</sub></td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Cell dimensions</td><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;<italic>a</italic>, <italic>b</italic>, <italic>c</italic> (&#x000c5;)</td><td align="left" valign="top" rowspan="1" colspan="1">51.2, 62.9, 69.3</td><td align="left" valign="top" rowspan="1" colspan="1">51.1, 54.5, 85.5</td><td align="left" valign="top" rowspan="1" colspan="1">112.7, 51.3, 55.1</td><td align="left" valign="top" rowspan="1" colspan="1">45.7, 51.8, 54.8</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;<italic>&#x003b1;</italic>, <italic>&#x003b2;</italic>, <italic>&#x003b3;</italic> (&#x000b0;)</td><td align="left" valign="top" rowspan="1" colspan="1">90.0,90.0,90.0</td><td align="left" valign="top" rowspan="1" colspan="1">90.0,90.0,90.0</td><td align="left" valign="top" rowspan="1" colspan="1">90.0,99.6,90.0</td><td align="left" valign="top" rowspan="1" colspan="1">90.0, 96.0, 90.0</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Resolution (&#x000c5;)</td><td align="left" valign="top" rowspan="1" colspan="1">46.0&#x02013;1.86(1.96&#x02013;<break/>1.86)<sup><xref rid="TFN1" ref-type="table-fn">a</xref></sup></td><td align="left" valign="top" rowspan="1" colspan="1">46.0&#x02013;1.75(1.80&#x02013;<break/>1.75)</td><td align="left" valign="top" rowspan="1" colspan="1">46.0&#x02013;1.65(1.75&#x02013;<break/>1.65)</td><td align="left" valign="top" rowspan="1" colspan="1">46.0&#x02013;3.0(3.08&#x02013;<break/>3.00)</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"><italic>R</italic><sub>merge</sub></td><td align="left" valign="top" rowspan="1" colspan="1">0.106 (0.556)</td><td align="left" valign="top" rowspan="1" colspan="1">0.117 (0.585)</td><td align="left" valign="top" rowspan="1" colspan="1">0.094 (0.534)</td><td align="left" valign="top" rowspan="1" colspan="1">0.189 (0.750)</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"><italic>I/&#x003c3;</italic>(<italic>I</italic>)</td><td align="left" valign="top" rowspan="1" colspan="1">9.4 (2.3)</td><td align="left" valign="top" rowspan="1" colspan="1">14.1 (3.8)</td><td align="left" valign="top" rowspan="1" colspan="1">13.7 (2.3)</td><td align="left" valign="top" rowspan="1" colspan="1">5.8 (1.2)</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"><italic>CC</italic><sub>1/2</sub></td><td align="left" valign="top" rowspan="1" colspan="1">0.997(0.836)</td><td align="left" valign="top" rowspan="1" colspan="1">0.996 (0.630)</td><td align="left" valign="top" rowspan="1" colspan="1">0.998(0.765)</td><td align="left" valign="top" rowspan="1" colspan="1">0.975 (0.554)</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Completeness (%)</td><td align="left" valign="top" rowspan="1" colspan="1">96.7 (98.2)</td><td align="left" valign="top" rowspan="1" colspan="1">99.4 (98.5)</td><td align="left" valign="top" rowspan="1" colspan="1">91.2 (63.8)</td><td align="left" valign="top" rowspan="1" colspan="1">95.1 (84.4)</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Redundancy</td><td align="left" valign="top" rowspan="1" colspan="1">3.7 (3.7)</td><td align="left" valign="top" rowspan="1" colspan="1">7.1(7.0)</td><td align="left" valign="top" rowspan="1" colspan="1">6.0 (2.9)</td><td align="left" valign="top" rowspan="1" colspan="1">3.2 (2.1)</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"><bold>Refinement</bold></td><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Resolution (&#x000c5;)</td><td align="left" valign="top" rowspan="1" colspan="1">46.0&#x02013;1.86</td><td align="left" valign="top" rowspan="1" colspan="1">46.0&#x02013;1.75</td><td align="left" valign="top" rowspan="1" colspan="1">46.0&#x02013;1.65</td><td align="left" valign="top" rowspan="1" colspan="1">46.0&#x02013;3.00</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">No. reflections</td><td align="left" valign="top" rowspan="1" colspan="1">18,492</td><td align="left" valign="top" rowspan="1" colspan="1">24,416</td><td align="left" valign="top" rowspan="1" colspan="1">30,817</td><td align="left" valign="top" rowspan="1" colspan="1">4,963</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"><italic>R</italic><sub>work</sub> / <italic>R</italic><sub>free</sub></td><td align="left" valign="top" rowspan="1" colspan="1">0.169/0.211</td><td align="left" valign="top" rowspan="1" colspan="1">0.156/0.156</td><td align="left" valign="top" rowspan="1" colspan="1">0.174/0.206</td><td align="left" valign="top" rowspan="1" colspan="1">0.186/0.232</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">No. atoms</td><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;Protein</td><td align="left" valign="top" rowspan="1" colspan="1">1937</td><td align="left" valign="top" rowspan="1" colspan="1">1952</td><td align="left" valign="top" rowspan="1" colspan="1">1925</td><td align="left" valign="top" rowspan="1" colspan="1">1932</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;Ligands</td><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;&#x02003;2-Methyl-2,4-pentanediol</td><td align="left" valign="top" rowspan="1" colspan="1">8</td><td align="left" valign="top" rowspan="1" colspan="1">16</td><td align="left" valign="top" rowspan="1" colspan="1">0</td><td align="left" valign="top" rowspan="1" colspan="1">0</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;&#x02003;Acetate</td><td align="left" valign="top" rowspan="1" colspan="1">0</td><td align="left" valign="top" rowspan="1" colspan="1">8</td><td align="left" valign="top" rowspan="1" colspan="1">0</td><td align="left" valign="top" rowspan="1" colspan="1">0</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;Water</td><td align="left" valign="top" rowspan="1" colspan="1">172</td><td align="left" valign="top" rowspan="1" colspan="1">237</td><td align="left" valign="top" rowspan="1" colspan="1">312</td><td align="left" valign="top" rowspan="1" colspan="1">6</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"><italic>B</italic> factors</td><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;Protein</td><td align="left" valign="top" rowspan="1" colspan="1">24.4</td><td align="left" valign="top" rowspan="1" colspan="1">14.5</td><td align="left" valign="top" rowspan="1" colspan="1">19.1</td><td align="left" valign="top" rowspan="1" colspan="1">50.0</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;Ligands</td><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;&#x02003;2-Methyl-2,4-pentanediol</td><td align="left" valign="top" rowspan="1" colspan="1">46.7</td><td align="left" valign="top" rowspan="1" colspan="1">36.7</td><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;&#x02003;Acetate</td><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">24.3</td><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;Water</td><td align="left" valign="top" rowspan="1" colspan="1">30.6</td><td align="left" valign="top" rowspan="1" colspan="1">25.9</td><td align="left" valign="top" rowspan="1" colspan="1">30.9</td><td align="left" valign="top" rowspan="1" colspan="1">39.2</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">R.m.s. deviations</td><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;Bond lengths (&#x000c5;)</td><td align="left" valign="top" rowspan="1" colspan="1">0.010</td><td align="left" valign="top" rowspan="1" colspan="1">0.010</td><td align="left" valign="top" rowspan="1" colspan="1">0.013</td><td align="left" valign="top" rowspan="1" colspan="1">0.010</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">&#x02003;Bond angles (&#x000b0;)</td><td align="left" valign="top" rowspan="1" colspan="1">1.1</td><td align="left" valign="top" rowspan="1" colspan="1">1.1</td><td align="left" valign="top" rowspan="1" colspan="1">1.7</td><td align="left" valign="top" rowspan="1" colspan="1">1.2</td></tr></tbody></table><table-wrap-foot><fn id="TFN1"><label>a</label><p id="P64">Values in parentheses are for highest-resolution shell.</p></fn></table-wrap-foot></table-wrap></floats-group></article>