Genome AnnouncGenome AnnouncgagaGAGenome Announcements2169-8287American Society for Microbiology1752 N St., N.W., Washington, DC249260614056304genomeA00582-1410.1128/genomeA.00582-14ProkaryotesDraft Genome Sequences of Nine Enteropathogenic Escherichia coli Strains from KenyaHazenTracy H.aHumphrysMichael S.aOchiengJohn BenjaminbParsonsMichelecBoppCheryl A.cO’ReillyCiara E.cMintzEriccRaskoDavid A.a,dInstitute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, USAKenya Medical Research Institute/Centers for Disease Control and Prevention, Kisumu, KenyaDivision of Foodborne, Waterborne, and Environmental Diseases, National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia, USADepartment of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, Maryland, USAAddress correspondence to David A. Rasko, drasko@som.umaryland.edu.1262014May-Jun201423e00582-1422520142752014Copyright © 2014 Hazen et al.2014Hazen et al.This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported license.

We report here the draft genome sequences of nine enteropathogenic Escherichia coli (EPEC) strains isolated from children in Kenya who died during hospitalization with diarrhea. Each of the isolates possess the EPEC adherence factor (EAF) plasmid encoding the bundle-forming pilus, which is characteristic of EPEC. These isolates represent diverse serogroups and EPEC phylogenomic lineages.

cover-dateMay/June 2014access-typefree
GENOME ANNOUNCEMENT

Attaching and effacing Escherichia coli (AEEC) is characterized by the presence of the locus of enterocyte effacement (LEE) pathogenicity region that encodes a type III secretion system involved in pathogenesis (13). AEEC is further identified as either LEE-positive Shiga toxin-producing E. coli (STEC) or enteropathogenic E. coli (EPEC). EPEC strains contain the LEE region but not the Shiga toxin-encoding phage, and they are considered typical EPEC when they possess the bundle-forming pilus (BFP) encoded by the EPEC adherence factor (EAF) plasmid (1, 4, 5). In contrast, LEE-positive stx-negative bfp-negative E. coli strains are considered atypical EPEC (4). The nine E. coli isolates described here are LEE positive, stx negative, and bfp positive and are therefore classified as typical EPEC based on their virulence factor content.

The genomic DNA for sequencing was isolated from an overnight culture of each isolate using the Sigma GenElute kit (Sigma-Aldrich). Genome sequencing was performed at the University of Maryland School of Medicine, Institute for Genome Sciences, at the Genome Resource Center. The genome sequences were generated using paired-end libraries with 300-bp inserts on the Illumina HiSeq 2000. The Illumina reads generated for each isolate were assembled de novo using the Velvet assembly program (6), with k-mer values determined using VelvetOptimiser version 2.1.4 (http://bioinformatics.net.au/software.velvetoptimiser.shtml). The genome assemblies contained an average of 181 contigs per isolate (range, 117 to 283), with an average G+C content of 50.4% (range, 50.3 to 50.6%) and an average genome size of 5.2 Mb (range, 5.0 to 5.4 Mb).

These nine isolates were isolated from children in Kenya who died during hospitalization with diarrhea (7). These EPEC isolates have diverse serogroups (O126, O119, O111, O53, O127, O55, and O114). Phylogenomic analysis of the whole genomes demonstrated that these isolates occur in two E. coli phylogroups (B1 and B2) (8, 9) and occupy four EPEC phylogenomic lineages. Among the four EPEC isolates in phylogroup B2 was isolate S6995 (O127), which was identified in the EPEC1 phylogenomic lineage, with the most widely studied prototype EPEC isolate E2348/69 (10). Also from phylogroup B2 was EPEC isolate S6400 (O119), which was identified in the recently described EPEC4 phylogenomic lineage (11). The remaining two EPEC isolates identified in phylogroup B2, S6966 (O53) and S7438 (O55), were present in unassigned phylogenomic lineages. The five EPEC isolates identified in phylogroup B1, S7005 (O126), S6685 (O114), S5274 (O111), S7380 (serogroup unknown), and S6662 (O111), were all present in the EPEC2 phylogenomic lineage, which contains the EPEC prototype isolate B171 (12).

The genome sequencing of these nine EPEC isolates demonstrates that the EPEC isolates associated with diarrheal illness of children in Kenya exhibit considerable genomic diversity that was not previously appreciated.

Nucleotide sequence accession numbers.

The genome assemblies have been deposited at DDBJ/EMBL/GenBank with accession no. JICI00000000, JICH00000000, JICG00000000, JICF00000000, JICE00000000, JICD00000000, JICC00000000, JICB00000000, and JICA00000000, for isolates S6966, S6995, S7005, S7438, S6400, S7380, S6662, S5274, and S6685, respectively.

Citation Hazen TH, Humphrys MS, Ochieng JB, Parsons M, Bopp CA, O’Reilly CE, Mintz E, Rasko DA. 2014. Draft genome sequences of nine enteropathogenic Escherichia coli strains from Kenya. Genome Announc. 2(3):e00582-14. doi:10.1128/genomeA.00582-14.

ACKNOWLEDGMENTS

This project was funded in part by federal funds from the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under contract 1U19AI090873, and by funds from the State of Maryland.

REFERENCES KaperJBNataroJPMobleyHL 2004 Pathogenic Escherichia coli. Nat. Rev. Microbiol. 2:123140. 10.1038/nrmicro81815040260 McDanielTKJarvisKGDonnenbergMSKaperJB 1995 A genetic locus of enterocyte effacement conserved among diverse enterobacterial pathogens. Proc. Natl. Acad. Sci. U. S. A. 92:16641668. 10.1073/pnas.92.5.16647878036 McDanielTKKaperJB 1997 A cloned pathogenicity island from enteropathogenic Escherichia coli confers the attaching and effacing phenotype on E. coli K-12. Mol. Microbiol. 23:399407. 10.1046/j.1365-2958.1997.2311591.x9044273 NataroJPKaperJB 1998 Diarrheagenic Escherichia coli. Clin. Microbiol. Rev. 11:1422019457432 DonnenbergMSKaperJB 1992 Enteropathogenic Escherichia coli. Infect. Immun. 60:395339611398907 ZerbinoDRBirneyE 2008 Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18:821829. 10.1101/gr.074492.10718349386 O’ReillyCEJaronPOchiengBNyaguaraATateJEParsonsMBBoppCAWilliamsKAVinjeJBlantonEWannemuehlerKAVululeJLasersonKFBreimanRFFeikinDRWiddowsonMAMintzE 2012 Risk factors for death among children less than 5 years old hospitalized with diarrhea in rural western Kenya, 2005–2007: a cohort study. PLoS Med. 9:e1001256. 10.1371/journal.pmed.100125622802736 TenaillonOSkurnikDPicardBDenamurE 2010 The population genetics of commensal Escherichia coli. Nat. Rev. Microbiol. 8:207217. 10.1038/nrmicro229820157339 JaureguyFLandraudLPassetVDiancourtLFrapyEGuigonGCarbonnelleELortholaryOClermontODenamurEPicardBNassifXBrisseS 2008 Phylogenetic and genomic diversity of human bacteremic Escherichia coli strains. BMC Genomics 9:560. 10.1186/1471-2164-9-56019036134 IguchiAThomsonNROguraYSaundersDOokaTHendersonIRHarrisDAsadulghaniMKurokawaKDeanPKennyBQuailMAThurstonSDouganGHayashiTParkhillJFrankelG 2009 Complete genome sequence and comparative genome analysis of enteropathogenic Escherichia coli O127:H6 strain E2348/69. J. Bacteriol. 191:347354. 10.1128/JB.01238-0818952797 HazenTHSahlJWFraserCMDonnenbergMSScheutzFRaskoDA 2013 Refining the pathovar paradigm via phylogenomics of the attaching and effacing Escherichia coli. Proc. Natl. Acad. Sci. U. S. A. 110:1281012815. 10.1073/pnas.130683611023858472 RaskoDARosovitzMJMyersGSMongodinEFFrickeWFGajerPCrabtreeJSebaihiaMThomsonNRChaudhuriRHendersonIRSperandioVRavelJ 2008 The pangenome structure of Escherichia coli: comparative genomic analysis of E. coli commensal and pathogenic isolates. J. Bacteriol. 190:68816893. 10.1128/JB.00619-0818676672