A divergent parvovirus genome was the only eukaryotic viral sequence detected in feces of a Tunisian child with unexplained diarrhea. Tusavirus 1 shared 44% and 39% identity with the nonstructural protein 1 and viral protein 1, respectively, of the closest genome, Kilham rat parvovirus, indicating presence of a new human viral species in the
Parvoviruses are small icosahedral viruses with linear single-stranded DNA genomes of ≈5 kb that are associated with a wide spectrum of illnesses in humans and animals. The subfamily
Using metagenomic deep sequencing, we analyzed fecal samples from 180 infants and children ages 7 days–96 months (mean 18.7 months) in Tunisia who had unexplained diarrhea that tested negative for rotavirus, norovirus, astrovirus, sapovirus, adenovirus types 40 and 41, and Aichi virus by reverse transcription PCR (
The fecal supernatants were filtered through a 0.45-μm filter (Millipore, Darmstadt, Germany) to remove bacterium-sized particles, and the filtrates were digested with a mixture of DNases (Turbo DNase from Ambion, Carlsbad, CA, USA; Baseline-ZERO from Epicenter, Madison, WI, USA; and Benzonase from Novagen, San Diego, CA, USA) and RNase (Fermentas, Pittsburgh, PA, USA) to digest unprotected nucleic acids. Enriched viral nucleic acids (RNA and DNA) were then extracted and amplified by using ScriptSeq v2 RNA-Seq Library Preparation Kit (Epicenter) and analyzed in pools of 10 specimens in 2 Illumina MiSeq run of 250-bp end reads, yielding 20,693,619 unique sequences. We compared the Illumina sequences with the GenBank nonredundant protein databases using BLASTx (
Using a BLASTx E-score cutoff of 10−5, we identified, in decreasing frequency, sequences related to the mammalian viruses: sapovirus (120,177 reads), anelloviridae (14,841 reads), parechovirus (10,557 reads), norovirus (4,551 reads), enterovirus (3,857 reads), circoviridae (2,127 reads), group A rotavirus (839 reads), adeno-associated virus (812 reads), picobirnavirus (274 reads), bufavirus (168 reads), WU polyomavirus (136 reads), bocavirus (62 reads), adenovirus (58 reads), papillomavirus (22 reads), cosavirus (20 reads), group C rotavirus (17 reads), human astrovirus 1 (14 reads), salivirus (4 reads), and Aichi virus (2 reads). One pool showed a single read encoding a parvovirus-like protein segment with high levels of genetic similarity (BLASTx E-score of 5 × 10−8) to the nonstructural protein (NS) 1 of rat parvovirus (GenBank accession no. AFV67813).
The individual fecal sample within the pool containing this sequence was then identified by using PCR and underwent further deep sequencing as above, generating 11 more parovirus sequences. No other eukaryotic viral sequences were identified from 260,000 unique sequence reads from this patient. The near complete parvovirus genome was then acquired by filling genome gaps by PCR and amplifying 5′ and 3′ extremities using RACE (Rapid Amplification of cDNA Ends, Life Technologies). Amplicons were directly sequenced by primer walking. We named this virus Tusavirus 1 for
A nearly complete 4,424-bp genome (Tusavirus 1, GenBank accession no. KJ495710) was successfully acquired with partial 5′ untranslated region (243 bp), complete NS1 open reading frame (625 aa), complete viral protein (VP) 1 open reading frame (715 aa), and a partial 3′ untranslated region (68 bp). Tusavirus has a potential upstream start codon MSS in a weaker Kozak consensus sequence than MAQ (
New parvovirus genome and phylogeny. A) Organization of the Tusavirus genome. Theoretical splicing for expression of viral protein (VP) 1 is shown. The alignment of the PLA2 regions of representatives of 5 protoparvovirus species show the calcium-binding region and catalytic residues in Tusavirus. Pairwise sliding window of percentage nucleotide similarity of Tusavirus aligns with the genetically closest Kilham rat parvovirus. B) Phylogenetic trees generated with nonstructural protein (NS) 1 and VP1 of Tusavirus and of the 5 International Committee on Taxonomy of Viruses–designated species in the
Protein sequence alignments were made by using ClustalX version 2.0.3 (
Phylogenetic analysis showed that Tusavirus 1 was distinct from known members of the
We used a nested PCR targeting NS1 to determine the prevalence of this virus in the 180 Tunisia diarrhea samples. Primers Tusa-F1 (5′-GAAGAAGCTGGAAACTGTGGTCA-3′) and Tusa-R1 (5′-CTCGTCTTTCTCCCAGGCATCT-3′) were used for the first round of PCR, and primers Tusa-F2 (5′-ATTGCTCCAACACCAGTCATCA-3′) and Tusa-R2 (5′-TCTGGTCTGGTCCAATCTTCTTC-3′) for the second round of PCR. The PCR conditions were: 95°C for 5 min, 35 cycles 95°C for 30 s, 52°C or 51°C (for the first or second round, respectively) for 30 s, and 72°C for 1 min, a final extension at 72°C for 10 min. No samples except the one initially detected by deep sequencing were PCR positive, yielding a low prevalence of 0.56% (1/180) in this Tunisian population.
We detected fecal shedding of a previously uncharacterized parvovirus in a child with unexplained diarrhea. The 18-month-old girl showed twice daily liquid and greenish feces over 3 days but no fever (37.4°C), vomiting, or dehydration. To identify other viral infections in this patient, the Tusavirus-positive sample was individually analyzed by the same metagenomics method. No other mammalian virus was detected, suggesting a possible role for Tusavirus 1 in this patient’s gastrointestinal illness, although the lack of testing for pathogenic bacteria and parasites does not enable us to exclude these alternative explanations. Wider geographic sampling of human samples, including case–control PCR studies of unexplained diarrhea and serologic tests, are needed to define the prevalence and disease association of this new parvovirus species in different age groups and populations.
The National Heart, Lung, and Blood Institute, National Institutes of Health, provided support to E.D. (grant R01 HL105770). The Blood Systems Research Institute also provided support for this study.
Dr Phan is a staff scientist at the Blood Systems Research Institute and Department of Laboratory Medicine, University of California, San Francisco. His research interests include novel virus discovery.