Academic Editor: Hiromi Nishida
The Lyssavirus glycoprotein (G) is a membrane protein responsible for virus entry and protective immune responses. To explore possible roles of the glycoprotein in host shift or adaptation of
Positive selection and recombination are important mechanisms in microbial pathogen adaption to new hosts, resistance to antibiotics, and evasion of immune responses [
Here, we used newly developed computational algorithms as well as traditional methods to investigate potential recombination events and selection pressures in the G gene of Lyssaviruses. The dataset for the study was comprised of 53 full-length glycoprotein gene sequences isolated from different hosts in 21 countries over a period of 70 years. We hypothesized that if different hosts with rabies infections over decades did not lead to positive selection or recombination events in the G gene, the gene does not play a significant role in lyssavirus adaptation.
We choose a dataset that covers lyssavirus isolates spatially and geographically over a long period of time in various animal hosts. Fifty-three full-length G sequences from 21 countries isolated over a period of 70 years were retrieved from NCBI GenBank. The sequences were aligned using fast statistical alignment (FSA, [
A phylogenetic tree was reconstructed by using the neighbor joining algorithm in the MEGA 4 package [
We first applied PHI [
To test positive selection on sites of the G gene in Lyssaviruses, the Codeml program in PAML software package version 4.4 was employed [
We also applied single-likelihood ancestor counting (SLAC), fixed-effects likelihood (FEL), and random-effects likelihood (REL) [
Our dataset covered lyssaviruses isolated over a period of 70 years from 21 countries (
The PHI and Max
By using 3SEQ, 6 long recombinant sequences (>100 bp) were detected: AF233275, AY237121, AY987478, DQ074978, DQ849071, and L04523 (
The analysis by using GARD also suggested evidence of recombination with significant topological incongruence at the 2 breakpoints (
We analyzed the recombination events by using BootScanning as implemented in SimPlot. Sequence AY987478 was used as a query sequence in all four cases (Figures
Since recombination with 2 breakpoints was predicted by 3SEQ, GARD, and Bootscanning, we constructed phylogenetic trees by using sequences from the beginning to the first breakpoint and the sequences from the second breakpoint to the end (
The selection pressure analysis with the glycoprotein gene by using PAML is presented in
To test the effect of recombination on positive selection analysis, we excluded the putative recombinant AY987478 from the dataset. Similar results were observed, and the BEB posterior probability supports for amino acid sites under positive selection were nonsignificant (
To study the effect of viral passages and possible genetic bottlenecks on the results, we repeated the analysis with a dataset excluding six vaccine sequences and the sequence AF233275 (PV11) from cell culture of lyssaviruses under intensive cell culture. We found no significant evidence for positive selection pressure on any site of the G gene.
Analyses using SLAC, REL, and FEL found no evidence of any amino acid in the G gene under positive selection, instead most of the amino acids were found to be under negative selection (
Lyssaviruses can infect all warm-blooded mammals, and spillover events and host shift have been well documented [
Recombination is another important evolutionary driving force in adaptation, and it is a mechanism that prevents the accumulation of deleterious substitutions [
In summary, we did not find significant support for positive selection pressure on G gene in lyssavirus isolates from different rabies hosts and vaccine strains that cover 70 years of evolution in 21 countries. The recombination analysis suggested an orphan event that needs further investigation. It appears that evolution of the G gene may not play a major role in lyssavirus adaptation. It is surprising considering the functions of glycoprotein in lyssavirus infection. It has been reported that host switching from chiropters to carnivores has occurred in lyssavirus evolution history [
The authors thank Jan Pohl, Elizabeth Neuhaus, and Charles E. Rupprecht for support in this investigation. They also thank Kathryn Kellar and Scott Sammons for helpful suggestions to the paper. Use of trade names and commercial sources are for identification only and do not imply endorsement by the U S Department of Health and Human Services. The findings and conclusions in this paper are those of the authors and do not necessarily represent the views of the funding agency.
Bootscanning analysis of recombination in glycoprotein gene of lyssavirus by using the SimPlot program with a window size of 200 nucleotides and a step size of 10 nucleotides.
(a) NJ phylogenetic tree of 53 glycoprotein gene sequences with regions concatenated from position of 1 to 441 and position of 1090 to 1572. Bootstrap values of 1000 replicates are shown above the branches. The red marker represents the putative recombinant. (b) NJ phylogenetic tree of 53 glycoprotein gene sequences with region from position of 441 to 1089. Bootstrap values of 1000 replicates are shown above the branches. The red marker represents the putative recombinant.
Sequences of glycoprotein gene used in this study.
| Accession no. | Country | Host | Year of isolation | Strain/isolate | Genotype | References |
|---|---|---|---|---|---|---|
| AB115921 | Indonesia | Dog | 2001 | SN01-23 | GT1 | Unpublished |
| AF233275 | India | Sheep | PV11 | GT1 | Unpublished | |
| AF298141 | USA | Bat | 1979 | USA7-BT | GT1 | Badrane et al. [ |
| AF298142 | Poland | Bat | 1985 | EBL1POL | GT5 | Badrane et al. [ |
| AF298143 | France | Bat | 1989 | EBL1FRA | GT5 | Badrane et al. [ |
| AF298144 | Finland | Bat | 1986 | EBL2FIN | GT6 | Badrane et al. [ |
| AF298145 | Holland | Bat | 1986 | EBL2HOL | GT6 | Badrane et al. [ |
| AF298146 | S. Africa | Bat | 1970 | DuvSAF1 | GT4 | Badrane et al. [ |
| AF298147 | S. Africa | Bat | 1981 | DuvSAF2 | GT4 | Badrane et al. [ |
| AF325487 | Malaysia | Human | 1985 | MAL1-HM | GT1 | Badrane and Tordo [ |
| AF325489 | Nepal | Dog | 1989 | NEP1-DG | GT1 | Badrane and Tordo [ |
| AF325490 | French | Bovine | 1985 | GUY1-BV | GT1 | Badrane and Tordo [ |
| AF325491 | Brazil | Bovine | 1986 | BRA1-BV | GT1 | Badrane and Tordo [ |
| AF325492 | Mexico | Bat | 1987 | MEX2-VP | GT1 | Badrane and Tordo [ |
| AF325494 | USA | Bat | 1981 | USA8-BT | GT1 | Badrane and Tordo [ |
| AF325495 | USA | Bat | 1982 | USA9-BT | GT1 | Badrane and Tordo [ |
| AF401285 | Thailand | 8743THA | GT1 | Unpublished | ||
| AF426297 | Australia | Bat | 1997 | ABLSF12NB | GT7 | Guyatt et al. [ |
| AF426298 | Australia | Bat | 1997 | ABLSF11KW | GT7 | Guyatt et al. [ |
| AJ871962 | China | Vaccine | PM | GT1 | Unpublished | |
| AY009098 | China | Human | 1986 | CNX8601 | GT1 | Tang et al. [ |
| AY009099 | China | Human | 1986 | CNX8511 | GT1 | Tang et al. [ |
| AY009100 | China | Dog (Vaccine) | 1955 | CTN | GT1 | Tang et al. [ |
| AY237121 | India | Dog | RVD | GT1 | Unpublished | |
| AY257980 | Thailand | Human | HM65 | GT1 | Hemachudha et al. [ | |
| AY257982 | Thailand | Human | HM88 | GT1 | Hemachudha et al. [ | |
| AY257983 | Thailand | Human | HM208 | GT1 | Hemachudha et al. [ | |
| AY987478 | India | Dog | 1999 | CHAND03 | GT1 | Unpublished |
| D14873 | Japan | Vaccine | RC-HL | GT1 | Unpublished | |
| D16330 | Japan | Vaccine | RC-HL | GT1 | Ito et al. [ | |
| DQ074978 | India | Dog | GT1 | Agrawal et al. [ | ||
| DQ076097 | S. Korea | Bovine | SKRBV0404HC | GT1 | Hyun et al. [ | |
| DQ076099 | S. Korea | Dog | SKRRD9903YG | GT1 | Hyun et al. [ | |
| DQ767897 | China | Vaccine | CTN-35 | GT1 | Unpublished | |
| DQ849071 | China | Dog | 1994 | GX4 | GT1 | Meng et al. [ |
| DQ849072 | China | Dog | 1992 | CQ92 | GT1 | Meng et al. [ |
| L04522 | China | Vaccine (Dog) | 1931 | 3aG | GT1 | Bai et al. [ |
| L04523 | China | Vaccine (dog) | 1993 | CGX89-1 | GT1 | Bai et al. [ |
| L40426 | CVS | GT1 | Yelverton et al. [ | |||
| M81058 | Algeria | Dog | ALG1-DG | GT1 | Benmansour et al. [ | |
| M81059 | Algeria | Human | GT1 | Benmansour et al. [ | ||
| M81060 | Algeria | Human | GT1 | Benmansour et al. [ | ||
| U03765 | Canada | Vulpes | 8480FX | GT1 | Nadin-Davis et al. [ | |
| U03766 | Arctic Circle | Dog | 1992 | Arctic A1-1090DG | GT1 | Nadin-Davis et al. [ |
| U03767 | Canada | Dog | 1993 | Hudson Bay-4055DG | GT1 | Nadin-Davis et al. [ |
| U11736 | Canada | Arctic Fox | 91RABN1035 | GT1 | Nadin-Davis et al. [ | |
| U11755 | Canada | Skunk | 91RABN1578 | GT1 | Nadin-Davis et al. [ | |
| U27214 | USA | Raccoon | NY 516 | GT1 | Nadin-Davis et al. [ | |
| U27215 | USA | Raccoon | NY 771 | GT1 | Nadin-Davis et al. [ | |
| U27216 | USA | Raccoon | FLA 125 | GT1 | Nadin-Davis et al. [ | |
| U27217 | USA | Raccoon | PA R89 | GT1 | Nadin-Davis et al. [ | |
| U52946 | USA | Bat | 1994 | SHBRV | GT1 | Morimoto et al. [ |
| X69122 | India | Vaccine | Flury | GT1 | Unpublished |
Recombination detection in glycoprotein gene of lyssavirus by using 3SEQ.
| P | Q | C | Dunn Sidak | Breakpoints | ||
|---|---|---|---|---|---|---|
| M81058 | AY987478 | AF233275 | 0 | 2.08 | 432–440, 1080–1089 | 456–496, 1080–1089 |
| M81060 | AY987478 | AF233275 | 1 | 1.31 | 432–440, 1080–1089 | 456–496, 1080–1089 |
| AY987478 | M81059 | AY237121 | 0 | 2.81 | 441–455, 1077–1079 | |
| AY987478 | M81058 | AY237121 | 0 | 2.13 | 441–455, 1077–1079 | |
| AY987478 | M81060 | AY237121 | 0 | 1.13 | 441–455, 1077–1079 | |
| AY987478 | AF233275 | AY237121 | 1.3 | 1.88 | 432–455, 1068–1089 | 465–518, 1068–1089 |
| AY987478 | L04522 | AY237121 | 1.1 | 1.48 | 627–638, 1077–1089 | 663–666, 1077–1089 |
| AY987478 | AF325489 | AY237121 | 0 | 2.71 | 700-701, 1077–1097 | |
| AY987478 | U11755 | AY237121 | 3.2 | 4.42 | 717–719, 1077–1082 | 729–734, 1077–1082 |
| AY987478 | U11736.2 | AY237121 | 3.3 | 4.61 | 717–719, 1077–1082 | 729–734, 1077–1082 |
| AY987478 | DQ849071 | AY237121 | 6.1 | 8.61 | 736-737, 1077–1079 | |
| AY987478 | DQ076097 | AY237121 | 1.2 | 1.69 | 630–638, 1077–1089 | 699–701, 1077–1089 |
| AY987478 | DQ076099 | AY237121 | 9 | 1.31 | 700-701, 1077–1089 | 714–719, 1077–1089 |
| AY987478 | L04523 | AY237121 | 2.2 | 3.04 | 736-737, 1077–1079 | |
| AY987478 | X69122 | AY237121 | 4 | 6.00 | 666–669, 1032–1049 | 666–669, 1077–1089 |
| AY987478 | AY009098 | AY237121 | 4 | 4.99 | 693–701, 1077–1079 | 705–711, 1077–1079 |
| AY987478 | AY009099 | AY237121 | 4 | 4.99 | 693–701, 1077–1079 | 705–711, 1077–1079 |
| AY987478 | DQ849072 | AY237121 | 2.1 | 3.02 | 693–701, 1077–1079 | 705–711, 1077–1079 |
| AY987478 | AJ871962 | AY237121 | 1 | 7.29 | 750–794, 1077–1089 | |
| AY987478 | AF325487 | AY237121 | 0 | 1.36 | 780–794, 1077–1079 | |
| AY987478 | L40426 | AY237121 | 4.8 | 6.71 | 750–794, 1077–1089 | |
| AY987478 | AF401285 | AY237121 | 0 | 9.99 | 780–795, 1077–1079 | |
| AY987478 | AY257983 | AY237121 | 2.3 | 3.27 | 780–795, 1077–1079 | |
| AY987478 | AY257980 | AY237121 | 0 | 5.70 | 750–761, 1077–1079 | 780–795, 1077–1079 |
| AY987478 | AY257982 | AY237121 | 5.9 | 8.33 | 780–795, 1032–1043 | 780–795, 1077–1079 |
| AY987478 | DQ767897 | AY237121 | 1 | 1.46 | 759–767, 972–974 | |
| AY987478 | U52946 | AY237121 | 5.3 | 7.43 | 741–748, 900-901 | 741–748, 918–938 |
| AY987478 | U03766 | AY237121 | 2.4 | 3.27 | 717–719, 876–889 | 717–719, 894–914 |
| AY987478 | U03765 | AY237121 | 2.6 | 3.55 | 717–719, 876–889 | 717–719, 894–914 |
| AY237121 | AF233275 | AY987478 | 0 | 1.38 | 432–452, 1077–1089 | |
| AY237121 | DQ074978 | AY987478 | 0 | 1.16 | 432–452, 1077–1089 | |
| AY237121 | L04522 | AY987478 | 0 | 7.15 | 627–647, 1065–1089 | |
| AY237121 | DQ076097 | AY987478 | 0 | 1.47 | 630–647, 1056–1058 | 630–647, 1065–1089 |
| AY237121 | U03767 | AY987478 | 1 | 1.42 | 630–638, 1041–1058 | 630–638, 1065–1079 |
| AY237121 | AJ871962 | AY987478 | 0 | 6.36 | 642–647, 1041–1058 | 642–647, 1065–1089 |
| AY237121 | X69122 | AY987478 | 0 | 3.16 | 642–647, 1041–1049 | 654–659, 1041–1049 |
| AY237121 | L40426 | AY987478 | 0 | 1.38 | 642–647, 1041–1058 | 642–647, 1065–1089 |
| AY237121 | M81058 | AY987478 | 0 | 9.60 | 441–452, 1065–1079 | 618–710, 1065–1079 |
| AY237121 | M81060 | AY987478 | 0 | 6.49 | 441–452, 1065–1079 | 618–710, 1065–1079 |
| AY237121 | D14873 | AY987478 | 0 | 6.82 | 685–701, 1065–1085 | 705–710, 1065–1085 |
| AY237121 | D16330 | AY987478 | 0 | 5.23 | 685–701, 1065–1085 | 705–710, 1065–1085 |
| AY237121 | AY257980 | AY987478 | 5.3 | 7.49 | 705–710, 1041–1046 | |
| AY237121 | DQ849071 | AY987478 | 1.9 | 2.68 | 708–710, 1041–1046 | |
| AY237121 | L04523 | AY987478 | 1.3 | 1.85 | 708–710, 1041–1046 | |
| AY237121 | DQ076099 | AY987478 | 0 | 1.07 | 634–647, 1056–1058 | 634–647, 1065–1089 |
| AY237121 | U11755 | AY987478 | 1 | 1.60 | 630–647, 1056–1058 | 630–647, 1065–1082 |
| AY237121 | U11736.2 | AY987478 | 0 | 2.29 | 630–647, 1056–1058 | 630–647, 1065–1082 |
| AY237121 | DQ767897 | AY987478 | 1.7 | 2.37 | 708–710, 1017–1022 | 708–710, 1041–1046 |
| AY237121 | AY009098 | AY987478 | 1.8 | 2.58 | 705–710, 1041–1046 | 736-737, 1041–1046 |
| AY237121 | AY009099 | AY987478 | 1.8 | 2.58 | 705–710, 1041–1046 | 736-737, 1041–1046 |
| AY237121 | AF325487 | AY987478 | 7.5 | 1.06 | 705–710, 1041–1046 | 732–734, 1041–1046 |
| AY237121 | U03766 | AY987478 | 1.2 | 1.75 | 630–638, 1041–1058 | 630–638, 1065–1079 |
| AY237121 | U03765 | AY987478 | 2.3 | 3.26 | 630–638, 1041–1058 | 630–638, 1065–1079 |
| AY237121 | M81059 | AY987478 | 0 | 1.27 | 441–452, 993–998 | 441–452, 1017–1034 |
| AY237121 | AF325490 | AY987478 | 9.2 | 1.30 | 705–710, 993–995 | 705–710, 1017–1019 |
| AY237121 | AF325491 | AY987478 | 7 | 9.93 | 705–710, 993–995 | |
| AY237121 | AF325492 | AY987478 | 9.6 | 1.34 | 700-701, 993–995 | 705–710, 993–995 |
| AY237121 | DQ849072 | AY987478 | 1.3 | 1.83 | 705–710, 924–935 | 705–710, 945–950 |
| AY237121 | AY009100 | AY987478 | 3.4 | 4.62 | 708–710, 885–887 | 708–710, 924–938 |
| AY237121 | AF401285 | AY987478 | 6.4 | 8.95 | 736-737, 883–887 | |
| AY237121 | AY257983 | AY987478 | 9.9 | 1.38 | 732–734, 883–887 | 732–734, 1041–1046 |
| M81059 | AY987478 | DQ074978 | 0 | 9.05 | 519–522, 1080–1089 | |
| M81058 | AY987478 | DQ074978 | 0 | 4.12 | 519–522, 1080–1089 | |
| M81060 | AY987478 | DQ074978 | 0 | 2.80 | 519–522, 1080–1089 | |
| AY009100 | M81059 | DQ849071 | 2 | 2.80 | 0–3, 108–119 | |
| AY009100 | M81058 | DQ849071 | 3 | 4.20 | 0–3, 108–119 | |
| AY009100 | M81060 | DQ849071 | 1.2 | 1.70 | 0–3, 108–119 | |
| AY009100 | AJ871962 | DQ849071 | 2.2 | 3.02 | 0–3, 108–110 | |
| AY009100 | M81059 | L04523 | 9.6 | 1.34 | 0–3, 108–119 | |
| AY009100 | M81058 | L04523 | 1.5 | 2.04 | 0–3, 108–119 | |
| AY009100 | M81060 | L04523 | 6.7 | 9.37 | 0–3, 108–119 | 0–3, 139–161 |
| AY009100 | AJ871962 | L04523 | 1.3 | 1.88 | 0–3, 108–110 | |
Note: P and Q are putative parent sequences, and C is the putative child sequence in the recombination.
KH tests verify the significance of breakpoints estimated by GARD analysis.
| Breakpoint | LHS | RHS | Significance |
|---|---|---|---|
| 441 | .00040 | .00040 | 0.01 |
| 1089 | .00040 | .00040 | 0.01 |
Parameter estimates,
| Parameter estimates | Likelihood scores ( | Model comparison (2Δ | Positive selection | ||
|---|---|---|---|---|---|
| M0: one ratio | 0.08 | None | |||
| M1: Nearly neutral | 0.17 | Not allowed | |||
| M2: Positive selection | 0.17 | −24010.40 | M2 versus M1:0, d.f. = 2, | None | |
| M7: | 0.10 | Not allowed | |||
| M8: | 0.10 | M7 versus M8: 18.18, d.f. = 2, | See | ||
Positive selection sites in the glycoprotein gene predicted by using Bayes empirical analysis under different PAML models.
| Codon | Amino acid | Posterior probability | Post mean ± S.E. | ||||
|---|---|---|---|---|---|---|---|
| Dataset I | Dataset II | Dataset I | Dataset II | Dataset I | Dataset II | Dataset I | Dataset II |
| 466 | 466 | A | A | 0.68 | 0.72 | 1.27 ± 0.35 | 1.29 ± 0.34 |
| 483 | 483 | V | V | 0.95 | 0.84 | 1.46 ± 0.16 | 1.39 ± 0.26 |
| 486 | 486 | T | T | 0.56 | 0.53 | 1.19 ± 0.36 | 1.16 ± 0.36 |
| 490 | 490 | Q | Q | 0.82 | 0.80 | 1.38 ± 0.27 | 1.36 ± 0.29 |
Dataset I: The whole 53 nucleotide sequences. Dataset II: AY987478 was excluded.
Detection of selection pressure on glycoprotein gene using methods implemented in the Datamonkey website.
| Dataset | Mean | Positive selection sites | Negative selection sites | Codon ( | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SLAC | FEL | REL | SLAC | FEL | REL | SLAC | FEL | REL | ||
| Dataset I | 0.1226 | 0.1278 | 0 | 0 | 0 | 397 | 418 | 0 | ||
| Dataset II | 0.1231 | 0.1274 | 0 | 0 | 0 | 391 | 417 | 0 | ||
| Dataset III | 0.1214 | 0.1233 | 0 | 1 | 0 | 386 | 416 | 0 | 416 ( | |
Dataset I: The whole 53 nucleotide sequences. Dataset II: AY987478 was excluded. Dataset III: the six putative recombinants were excluded.