Conceived and designed the experiments: HY LMC PJC ROD JS. Performed the experiments: HY LMC PJC JS. Analyzed the data: HY LMC PJC ROD JS. Wrote the paper: HY LMC PJC ROD JS.
Human infections with subtype H7 avian influenza viruses have been reported as early as 1979. In 1996, a genetically stable 24-nucleotide deletion emerged in North American H7 influenza virus hemagglutinins, resulting in an eight amino acid deletion in the receptor-binding site. The continuous circulation of these viruses in live bird markets, as well as its documented ability to infect humans, raises the question of how these viruses achieve structural stability and functionality. Here we report a detailed molecular analysis of the receptor binding site of the North American lineage subtype H7N2 virus A/New York/107/2003 (NY107), including complexes with an avian receptor analog (3′-sialyl-N-acetyllactosamine, 3′SLN) and two human receptor analogs (6′-sialyl-N-acetyllactosamine, 6′SLN; sialyllacto-N-tetraose b, LSTb). Structural results suggest a novel mechanism by which residues Arg220 and Arg229 (H3 numbering) are used to compensate for the deletion of the 220-loop and form interactions with the receptor analogs. Glycan microarray results reveal that NY107 maintains an avian-type (α2-3) receptor binding profile, with only moderate binding to human-type (α2-6) receptor. Thus despite its dramatically altered receptor binding site, this HA maintains functionality and confirms a need for continued influenza virus surveillance of avian and other animal reservoirs to define their zoonotic potential.
Influenza virus adaptation to different hosts usually results in a switch in receptor specificity of the viral surface coat protein, hemagglutinin. Indeed, the hemagglutinin subtypes from the last two human influenza pandemics of the 20th Century (H2 in 1957 and H3 1968) both adapted successfully to human-type receptor specificity through only two amino acid mutations in the receptor binding pocket (Glutamine226→Leucine and Glycine228→Serine). The recent human infections reported with other avian subtypes such as H5, H7 and H9 have raised public health concerns and focused efforts on identifying potential subtypes from which a future pandemic strain may emerge. Since 1996, H7 viruses of the North American lineage have been circulating in regional live bird markets, containing an eight amino acid deletion in the receptor-binding site of HA. Here we report a detailed structural analysis of the receptor binding site of a hemagglutinin from the North American lineage of H7N2 viruses, in complex with avian and human receptor analogs, to understand how these viruses have adapted to such a dramatic structural change in the binding site while remaining one of the predominant circulating viral strains.
Influenza is an acute respiratory virus that infects up to 20% of the population in the United States, resulting in ∼36,000 deaths annually
Early cases of human infection by H7 influenza viruses are reported as far back as 1979
Since 1996, H7 viruses of the North American lineage have been circulating in regional live bird markets
Here we report a detailed molecular analysis of the RBS of the HA from North American lineage H7N2 virus, NY107, including glycan microarray analyses and structural analyses of the HA in complex with an avian receptor analog (3′-Sialyl-N-acetyllactosamine, 3′SLN) and two human receptor analogs (6′-Sialyl-N-acetyllactosamine, 6′SLN; Sialyllacto-N-tetraose b, LSTb). These results provide important insight into the interaction of H7 HAs with both avian and human hosts.
By using x-ray crystallography, the structure of H7 HA from the NY107 virus was determined to 2.6 Å resolution (
(A) One monomer is shown with the HA1 chain colored in green and the HA2 chain in cyan. The location of the receptor binding site and the HA1/HA2 cleavage site are circled. (B) The superposition of receptor binding domains of NY107 (green), Av-H7 (marine), 1918-Hu-H1 (magenta), Hu-H5 (yellow), Hu-H3 (orange), and Sw-H9 (grey). The proximity of Arg220 and Gln226 are highlighted. Three structural elements comprising this binding site are labeled. The two major differences are the extended 150-loop and the deletion of 220-loop of NY107. (C) Overlap of NY107 (green) and Av-H7 (marine) (PDB: 1TI8) illustrates the compensatory effect of R220 bringing it close to the position occupied by G228 in the avian HA. (D) Overlap of the NY107 (green), NY107- 3′SLN (orange), NY107-6′SLN (red), and NY107-LSTb (magenta) structures. All the figures were generated and rendered with the use of MacPyMOL
| NY107 | NY107+3′SLN | NY107+6′SLN | NY107+LSTb | |
| Space group | P212121 | P212121 | P212121 | P212121 |
| Cell dimensions (Å) | 66.96, 115.92, 251.61 | 67.80, 116.70, 249.84 | 66.60, 116.58, 250.68 | 67.08, 116.52, 251.95 |
| Resolution (Å) | 50-2.6 (2.69-2.60) | 30-2.7 (2.80-2.70) | 50-3.0 (3.11-3.0) | 50-2.6 (2.69-2.60) |
| Rsym or Rmerge | 10.6 (41.3) | 14.6 (48.6) | 14.3 (35.4) | 12.2 (31.5) |
| I/σ | 39.6 (2.0) | 24.3 (1.7) | 34.2 (8.2) | 40.5 (9.9) |
| Completeness (%) | 99.2 (99.0) | 99.3 (94.6) | 92.3 (75.6) | 91.3 (86.2) |
| Redundancy | 7.2 (6.2) | 5.8 (5.5) | 4.9 (4.4) | 10.9 (11.2) |
| Resolution (Å) | 50-2.6 (2.67-2.60) | 30-2.7 (2.77-2.70) | 50-3.0 (3.08-3.00) | 50-2.6 (2.67-2.60) |
| No. of reflections (total) | 57285 | 51770 | 33421 | 53603 |
| No. of reflections (test) | 3053 | 2769 | 1779 | 2842 |
| Rwork/Rfree | 21.7/25.6 | 21.4/26.4 | 20.5/26.0 | 20.4/24.7 |
| No. of atoms | 11795 | 11878 | 11648 | 12108 |
| r.m.s.d.- bond length (Å) | 0.006 | 0.006 | 0.008 | 0.006 |
| r.m.s.d.- bond angle (°) | 0.905 | 0.974 | 1.085 | 0.859 |
| Favored (%) | 96.9 | 96.5 | 94.3 | 97.1 |
| Outliers (%) (No. of residues) | 0.1 (1/1434) | 0.0 (0/1429) | 0.1 (2/1433) | 0.1 (2/1435) |
Numbers in parentheses refer to the highest resolution shell.
Reference
During viral replication, HA is synthesized as a single chain precursor (HA0) and cleaved by specific host proteases into the infectious HA1/HA2 form. In baculovirus expression systems, highly pathogenic HAs, with a polybasic cleavage site, are expressed as an HA1/HA2 form
Based on their molecular phylogenies, HAs are divided into two groups and five clades: group 1 includes H8, H9, and H12; H1, H2, H5, and H6; H11, H13 and H16; group 2 includes H3, H4, and H14; H7, H10 and H15
The RBS is at the membrane distal end of each HA monomer and its specificity for sialic acid and the nature of its linkage to a vicinal galactose residue is a major determinant of host range-restriction. The consensus RBS for all current HAs is composed of three major structural elements: a 190-helix (residues 188–194), a 220-loop (residues 221–228), and a 130-loop (residues 134–138). In addition, highly conserved residues (Tyr98, Trp153, His183, and Tyr195) form the base of the pocket.
Although the NY107 RBS is similar to other subtypes (H1, H2, H3, H5, and H9), a previously observed specific feature of H7 HAs, is also observed in the NY107 150-loop region: two residues inserted at position 158 result in this loop protruding more than 6Å towards the binding site compared to other subtype HAs (
Previously, mutations in the HA receptor binding domains of H1N1 (Glu190Asp/Gly225Asp) and H2N2/H3N2 (Gln226Leu and Gly228Ser) subtypes were responsible for adaptation of these viruses to pandemic strains
Glycan microarray analysis of recombinant NY107 (
Glycan microarray analysis of recombinant NY107 HA (A) and NY107 virus (B) compared to the recHA (C) and virus (D) from a Eurasian lineage A/Netherlands/219/2003 H7 influenza virus that was circulating in the same year and also infected a human. Colored bars highlight glycans that contain α2-3 SA (blue) and α2-6 SA (red), α2-6/α2-3 mixed SA (purple), N-glycolyl SA (green), α2-8 SA (brown), β2-6 and 9-O-acetyl SA, and non-SA (grey). Error bars reflect the standard error in the signal for six independent replicates on the array. Structures of each of the numbered glycans are found in
| Glycan Group | Graph Number | NY107 RecHA | NY107 Virus | NY107-ins Virus | NY107-ins E186G Virus | NY107-ins R205G Virus | NY107-ins E186G/R205G Virus | NL219 RecHA | NL219 Virus |
| α | |||||||||
| Sulfated | 4–8 | +++ | +++ | +++ | +++ | +++ | +++ | +++ | +++ |
| Branched | 9–11 | +++ | +++ | + | +++ | +++ | +++ | +++ | +++ |
| Linear | 12–27 | + | +++ | + | +++ | +++ | +++ | +++ | +++ |
| Fucosylated | 28–34 | − | +++ | +++ | +++ | +++ | +++ | +++ | +++ |
| α | |||||||||
| Sulfated | 41 | − | − | − | − | − | − | − | − |
| Branched mono-sialyl | 42–45, 49 | − | − | − | − | − | − | − | − |
| Branched di-sialyl | 46–48 | − | +++ | − | − | − | − | − | − |
| Linear | 50–56 | − | − | − | − | − | − | − | − |
| Internal | 58–59 | +++ | +++ | − | − | − | − | − | − |
| Sialic acid | 1–2 | − | +++ | + | − | − | − | − | − |
| α2-3/α2-6 Branched | 60–64 | − | − | − | +++ | +++ | +++ | +++ | +++ |
| Neu5Gc | 65–72 | − | − | − | +++ | +++ | +++ | − | +++ |
Members of each group are identified according to the graph number used in the microarray data in
Binding of samples to glycan subclasses are qualitatively estimated based on relative strength of the signal for the data shown in
N-glycolylneuraminic acid.
To further assess the effect of the missing 220 loop on HA structural stability and receptor specificity it was essential to evaluate these functions on the ancestral HA containing the full length 220-loop. To this end, we engineered an HA with an avian H7 consensus (PQVNGQSG) 220-loop re-introduced (NY107-220ins) into the NY107 HA and recovered this virus by reverse genetics. Compared to the NY107 virus (
NY107 was engineered to restore the 220-loop to a consensus full-length HA from 1996 (A) and additional co-variant amino acid substitutions, Glu186Gly (B), Arg205Gly (C) and the double mutant Glu186Gly/Arg205Gly (D) to restore, on the NY107 framework, an HA RBS found in viruses prior to the introduction of the deletion in North American viruses. Colored bars group glycans as described in
When viruses containing this 220-loop deletion emerged in North America in the mid 90's, four additional amino acid substitutions, Gly114Arg, Asp119Gly, Gly186Glu and Gly205Arg, in the HA1 as well as an Asp19Asn in the HA2 chain were also introduced to most of the circulating isolates. Of these, Gly186Glu and Gly205Arg in the HA1 are close to the RBS, at the monomer interface, and could potentially modulate its structure and/or function. NY107 viruses with a restored consensus 220-loop and a single Glu186Gly (NY107-ins-186) or Arg205Gly (NY107-ins-205) substitution as well as the Glu186Gly/Arg205Gly double substitution (NY107-ins-186/205) were derived by reverse genetics and evaluated. Glycan microarray analysis for the three resulting viruses revealed similar glycan binding profiles with increased binding to α2-3 sialosides, including mixed α2-3/α2-6 branched sialosides (#60–64), α2-3 Neu5Gc (#66–70), but limited binding to the α2,6 sialosides (
To understand from a structural perspective how NY107 interacts with host receptors, we solved the structure of NY107 in complex with an avian and two human receptor analogs. For the avian receptor analog, 3′SLN, the electron density maps revealed well-ordered features for the Sia-1, Gal-2, and GlcNAc-3 in the NY107 HA complex structure (
The top panel shows the interactions of NY107 with (A) 3′SLN, (B) 6′SLN and (C) LSTb. NY107 is shown in orange/red/magenta cartoon respectively. The interacting HA residues are shown as green sticks. The bottom panel shows the electron density map of the ligands. The NY107 is shown in the same colors as above, and the ligands are shown as green sticks, the 2fo-fc electron density maps (contoured at 1σ) are shown in grey. Simulated annealing omit maps are shown in supplementary
Published avian HA structures with an intact 220-loop form very close interactions with Gal-2 of 3′SLN via residue Gln226 which is important in receptor specificity and host adaptation. For example, in the avian H7/3′SLN HA structure it interacts with Gal-2 O4
In the NY107/6′SLN complex, only Sia-1 and Gal-2 are ordered (
Glycan array results with NY107 revealed a strong binding signal for the internal α2-6 sialoside, LSTb. To further investigate this interaction, we solved the structure of the NY107/LSTb complex. The final model contained Sia-1, NAG-2, Gal-3, and Gal-5 in the RBS. Although glycan microarray data indicated NY107 to have a specific affinity for LSTb, few interactions were apparent from the crystal structure. Sia-1 still forms multiple hydrogen bonds with residues in the RBS (
Human infections by avian influenza viruses, including H7 subtypes, continue to pose a major public health threat. Although the species barrier prevents avian influenza viruses from widespread infection of the human population, the molecular determinants of efficient interspecies transmission and pathogenicity are still poorly understood. The viral coat protein HA however, is perhaps a critical molecule since previous pandemic viruses modified their receptor specificity and overcame the interspecies barrier to spread in the human population. Although HA structures alone and in complex with receptor analogs provide considerable insight into receptor binding, it is clear that HAs from different species and subtypes have significant structural variation. Indeed, low-pathogenic H7N2 avian influenza viruses with an 8 amino acid deletion within its RBS started to circulate in live-bird markets in the northeast United States in 1996. Despite what one would consider a debilitating mutation, these viruses have been reported as the predominant isolate
The crystal structures of NY107 and its complexes with both avian and human receptor analogs describe a mechanism as to how an influenza virus might adapt by dramatically altering its RBS, and still be functional. Arg220 of the HA1 chain of NY107 compensates for the loss of the 220-loop, by forming hydrogen bonds with Gal-2 from the avian analog (binding was not observed in either of the structures complexes with the human analogs). However, in the LSTb complex, branched Gal-5 forms extra interactions with the 130-loop, thus improving the binding preference for this particular glycan. Consistent with the structural evidence, glycan microarray analyses of NY107 revealed a strong binding preference for the branched α2-6 sialoside, LSTb. Except for the absence of the 220-loop, other key residues within the RBS are conserved in NY107 and thus, direct interactions with sialic acid are maintained.
The 220-loop is recognized as one of the three crucial structural elements in the RBS. Aside from the North American lineage H7N2 viruses, which have been circulating with a deletion (221–228) in this loop, there has been one other report describing a seven amino acid deletion (224–230) in a laboratory generated H3N2 escape mutant which was reported to have a slightly increased affinity for α2-3-linked glycans by hemagglutination assay
When compared to NL219, another co-circulating H7 avian virus HA (
Interestingly, NY107 and NL219 virus receptor binding and specificity has been addressed previously using glycan microarray analysis that reported a significantly increased preference for α2-6 and decreased preference for α2-3-linked sialosides
The 220-loop is an integral feature of the receptor binding site, and thus one would predict that such a deletion might have compromised this strain to be deleted from the population of circulating viruses. However, this was not the case
The NY107 RBS with its more restricted α2-3 glycan binding preference and weak/moderate increase in α2-6 binding may have given the virus a selective advantage to be maintained in poultry at live bird markets and supplying farms. Certain terrestrial birds, such as quails and chickens, have recently been shown to present both human and avian types of receptors in the trachea and intestine
Based on H3 numbering
Soluble NY107 was recovered from the cell supernatant by metal affinity chromatography using Ni-NTA resin (Qiagen Inc.). Fractions containing NY107 were pooled and dialyzed against 10 mM Tris-HCl, 50 mM NaCl, pH 8.0, then subjected to ion-exchange chromatography (IEX) using a Mono-Q HR 10/10 column (GE Healthcare). IEX purified NY107 was subjected to thrombin digest (3 units/mg protein; overnight at 4°C) and purified by gel filtration chromatography using a Superdex-200 16/60 column (GE Healthcare) and 50 mM Tris-HCl, 100 mM NaCl, pH 8.0 as running buffer. Protein eluting as a trimer was buffer exchanged into 10 mM Tris-HCl, 50 mM NaCl, pH 8.0 and concentrated to 14.5 mg/ml for crystallization trials. At this stage, the protein sample still contained the additional plasmid-encoded residues at both the N (ADPG) and C terminus (SGRLVPR).
Initial crystallization trials were set up using a Topaz Free Interface Diffusion (FID) Crystallizer system (Fluidigm Corporation, San Francisco, CA). Crystals were observed in several conditions containing PEG 3350 or PEG 4000. Following optimization, diffraction quality crystals for NY107 were obtained at room temperature using a modified method for microbath under oil
The structure of NY107 was determined by molecular replacement with Phaser
Wild type and mutant viruses of NY107 (H7N2) and A/Netherland/219/2003 (H7N7) were generated from plasmids by a reverse genetics approach
Glycan microarray printing and recHA analyses have been described previously
The atomic coordinates and structure factors of NY107 are available from the RCSB PDB under accession codes 3M5G for the unliganded NY107, 3M5H for the NY107 with 3′-SLN and 3M5I and 3M5J for NY107 with 6′SLN and LSTb, respectively.
A/New York/107/03 (H7N2), Genbank: ACC55270; A/Netherlands/219/03 (H7N7), Genbank: AAR02640; A/Hong Kong/1-9/68 (H3N2), 2HMG; A/Duck/Ukraine/1/63 (H3N8), PDB: 1MQL; A/South Carolina/1/18 (H1N1), PDB: 1RD8; A/Puerto Rico/8/34 (H1N1), PDB: 1RU7; A/Swine/Iowa/15/30 (H1N1), PDB: 1RUY; A/Singapore/1/1957 (H2N2), PDB: 2WRC; A/Viet Nam/1203/04 (H5N1), PDB: 2FK0; A/Duck/Singapore/3/97 (H5N3), PDB: 1JSM; A/Swine/Hong Kong/9/98 (H9N2), PDB: 1JSD; A/Turkey/Italy/8000/02 (H7N3), PDB: 1TI8; C/Johannesburg/1/66, 1FLC.
Sequence alignment of selected structurally available HAs. Human H3 (PDB: 2HMG), Avian H3 (PDB: 1MQL), 1918-Human H1 (PDB: 1RD8), 1934-Human H1 (PDB: 1RU7), Swine H1 (PDB: 1RUY), 1957-Huamn H2 (PDB: 2WRC), Human H5 (PDB: 2FK0), Avian H5 (PDB: 1JSM), Swine H9 (PDB: 1JSD), and Avian H7 (PDB: 1TI8) were used in the alignments. The fusion domain of HA1 is highlighted in magenta, the vestigial esterase domain is highlighted in green, the receptor binding domain is highlighted in blue, and the fusion domain of HA2 is highlighted in red. Residue numbering is based on the H3 HA sequence.
(2.84 MB TIF)
Click here for additional data file.
Expression and purification of NY107. SDS-PAGE reveals that NY107 was expressed as the HA0 form with a mass approximately 60kDa (middle lane). Thrombin cleavage resulted in an unexpected reduction in band size to a HA1/HA2 profile (right lane) with possible multiple glycoforms for the HA2 clearly present.
(0.23 MB TIF)
Click here for additional data file.
Comparison of glycan binding to NY107 with other HAs. A. Overlap of α2-3 ligands binding in the receptor binding site from NY-H7 (green), Av-H3 (orange), 1930-Hu-H1 (magenta), 1957-Hu-H2 (cyan), Av-H5 (yellow), and Sw-H9 (grey). B. Overlap of α2-6 linkage ligands binding in the receptor binding site from NY-H7 (green), Av-H3 (orange), 1930-Hu-H1 (magenta), 1957-Hu-H2 (cyan), Av-H5 (yellow), and Sw-H9 (grey).
(2.55 MB TIF)
Click here for additional data file.
Simulated annealing omit maps of the receptor binding site (contoured at 1σ). A. NY107 (blue), B. NY107-3′SLN (orange), C. NY107-6′SLN (red), and D. NY107-LSTb (magenta). The protein model is shown in cartoon, and the residues involved in the binding to receptor analogs were shown in sticks. Maps were generated using version 1.2 of the Crystallography and NMR System (CNS) software.
(1.93 MB TIF)
Click here for additional data file.
Comparison of NY107 RBS to HEF. Overlap of RBS from NY107 (green), Av-H7 (marine) and HEF (magenta).
(1.12 MB TIF)
Click here for additional data file.
Comparison of r.m.s.d. (Å) for different HA domains. For analyzing differences in the overall structure, r.m.s.d. values were calculated between monomers or domains of different HA's, after the Cα atoms of the HA2 domains were superposed by sequence and structural alignment onto the equivalent domains of NY107.
(0.04 MB DOC)
Click here for additional data file.
Comparison of r.m.s.d. (Å) for individual domains. Each domain was superimposed separately to determine how the individual NY107 domains compared to equivalent domains in the other structures.
(0.04 MB DOC)
Click here for additional data file.
Molecular interactions between NY107 and receptor analogs. The hydrogen bond cutoff is 3.8 Å for the listing interactions.
(0.07 MB DOC)
Click here for additional data file.
Glycan array differences between NY107, the fully restored NY107-ins, and NL219 (virus and rHA). The color coding in the left hand column reflects the same coloring scheme used in
(0.19 MB DOC)
Click here for additional data file.
The authors would like to thank the staff of SER-CAT sector 22 for their help in data collection. We also thank WHO Global Influenza Surveillance Network for providing NY107 and NL219 viruses from which the reverse genetics viruses were generated. Glycan microarray data presented here will be made available on-line through the CFG web site upon publication (
The authors have declared that no competing interests exist.
This work was funded by the Centers for Disease Control and Prevention. Use of the Advanced Photon Source at Argonne National Laboratory was supported by the U. S. Department of Energy, Office of Science, Office of Basic Energy Sciences, under Contract No. DE-AC02-06CH11357. Glycan microarrays as well as glycans for direct binding experiments were produced for the Centers for Disease Control by the CFG funded by National Institute of General Medical Sciences Grant GM62116. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.