<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" article-type="research-article"><?properties open_access?><front><journal-meta><journal-id journal-id-type="nlm-ta">J Clin Microbiol</journal-id><journal-id journal-id-type="iso-abbrev">J. Clin. Microbiol</journal-id><journal-id journal-id-type="hwp">jcm</journal-id><journal-id journal-id-type="pmc">jcm</journal-id><journal-id journal-id-type="publisher-id">JCM</journal-id><journal-title-group><journal-title>Journal of Clinical Microbiology</journal-title></journal-title-group><issn pub-type="ppub">0095-1137</issn><issn pub-type="epub">1098-660X</issn><publisher><publisher-name>American Society for Microbiology</publisher-name><publisher-loc>1752 N St., N.W., Washington, DC</publisher-loc></publisher></journal-meta><article-meta><article-id pub-id-type="pmid">28592550</article-id><article-id pub-id-type="pmc">5527429</article-id><article-id pub-id-type="publisher-id">00361-17</article-id><article-id pub-id-type="doi">10.1128/JCM.00361-17</article-id><article-categories><subj-group subj-group-type="heading"><subject>Bacteriology</subject></subj-group></article-categories><title-group><article-title>Validation and Implementation of Clinical Laboratory Improvements Act-Compliant Whole-Genome Sequencing in the Public Health Microbiology Laboratory</article-title><alt-title alt-title-type="running-head">WGS for Public Health Microbiology</alt-title><alt-title alt-title-type="short-authors">Kozyreva et al.</alt-title></title-group><contrib-group><contrib contrib-type="author"><name><surname>Kozyreva</surname><given-names>Varvara K.</given-names></name></contrib><contrib contrib-type="author"><name><surname>Truong</surname><given-names>Chau-Linda</given-names></name></contrib><contrib contrib-type="author"><name><surname>Greninger</surname><given-names>Alexander L.</given-names></name></contrib><contrib contrib-type="author"><name><surname>Crandall</surname><given-names>John</given-names></name></contrib><contrib contrib-type="author"><name><surname>Mukhopadhyay</surname><given-names>Rituparna</given-names></name></contrib><contrib contrib-type="author" corresp="yes"><name><surname>Chaturvedi</surname><given-names>Vishnu</given-names></name></contrib><aff>Microbial Diseases Laboratory, California Department of Public Health, Richmond, California, USA</aff></contrib-group><contrib-group><contrib contrib-type="editor"><name><surname>Diekema</surname><given-names>Daniel J.</given-names></name><role>Editor</role><aff>University of Iowa College of Medicine</aff></contrib></contrib-group><author-notes><corresp id="cor1">Address correspondence to Vishnu Chaturvedi, <email>Vishnu.Chaturvedi@cdph.ca.gov</email>.</corresp><fn fn-type="other"><p><bold>Citation</bold> Kozyreva VK, Truong C-L, Greninger AL, Crandall J, Mukhopadhyay R, Chaturvedi V. 2017. Validation and implementation of Clinical Laboratory Improvements Act-compliant whole-genome sequencing in the public health microbiology laboratory. J Clin Microbiol 55:2502&#x02013;2520. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1128/JCM.00361-17">https://doi.org/10.1128/JCM.00361-17</ext-link>.</p></fn></author-notes><pub-date pub-type="epreprint"><day>7</day><month>6</month><year>2017</year></pub-date><pub-date pub-type="epub"><day>25</day><month>7</month><year>2017</year></pub-date><pub-date pub-type="ppub"><month>8</month><year>2017</year></pub-date><pub-date pub-type="pmc-release"><day>25</day><month>7</month><year>2017</year></pub-date><!-- PMC Release delay is 0 months and 0 days and was based on the <pub-date pub-type="epub"/>. --><volume>55</volume><issue>8</issue><fpage>2502</fpage><lpage>2520</lpage><history><date date-type="received"><day>2</day><month>3</month><year>2017</year></date><date date-type="rev-request"><day>25</day><month>3</month><year>2017</year></date><date date-type="accepted"><day>26</day><month>5</month><year>2017</year></date></history><permissions><copyright-statement>Copyright &#x000a9; 2017 Kozyreva et al.</copyright-statement><copyright-year>2017</copyright-year><copyright-holder>Kozyreva et al.</copyright-holder><license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/"><license-p>This is an open-access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International license</ext-link>.</license-p></license></permissions><self-uri content-type="pdf" xlink:href="zjm00817002502.pdf"/><abstract><title>ABSTRACT</title><p>Public health microbiology laboratories (PHLs) are on the cusp of unprecedented improvements in pathogen identification, antibiotic resistance detection, and outbreak investigation by using whole-genome sequencing (WGS). However, considerable challenges remain due to the lack of common standards. Here, we describe the validation of WGS on the Illumina platform for routine use in PHLs according to Clinical Laboratory Improvements Act (CLIA) guidelines for laboratory-developed tests (LDTs). We developed a validation panel comprising 10 <named-content content-type="genus-species">Enterobacteriaceae</named-content> isolates, 5 Gram-positive cocci, 5 Gram-negative nonfermenting species, 9 <named-content content-type="genus-species">Mycobacterium tuberculosis</named-content> isolates, and 5 miscellaneous bacteria. The genome coverage range was 15.71&#x000d7; to 216.4&#x000d7; (average, 79.72&#x000d7;; median, 71.55&#x000d7;); the limit of detection (LOD) for single nucleotide polymorphisms (SNPs) was 60&#x000d7;. The accuracy, reproducibility, and repeatability of base calling were &#x0003e;99.9%. The accuracy of phylogenetic analysis was 100%. The specificity and sensitivity inferred from multilocus sequence typing (MLST) and genome-wide SNP-based phylogenetic assays were 100%. The following objectives were accomplished: (i) the establishment of the performance specifications for WGS applications in PHLs according to CLIA guidelines, (ii) the development of quality assurance and quality control measures, (iii) the development of a reporting format for end users with or without WGS expertise, (iv) the availability of a validation set of microorganisms, and (v) the creation of a modular template for the validation of WGS processes in PHLs. The validation panel, sequencing analytics, and raw sequences could facilitate multilaboratory comparisons of WGS data. Additionally, the WGS performance specifications and modular template are adaptable for the validation of other platforms and reagent kits.</p></abstract><kwd-group><title>KEYWORDS</title><kwd>bacteria</kwd><kwd>whole-genome sequencing</kwd><kwd>performance specifications</kwd><kwd>laboratory-developed test</kwd><kwd>quality management</kwd><kwd>validation</kwd><kwd>CLIA</kwd><kwd>public health</kwd><kwd>bioinformatics pipeline</kwd><kwd>WGS</kwd></kwd-group><funding-group><award-group id="award1"><funding-source id="gs1">HHS | Centers for Disease Control and Prevention (CDC)
<named-content content-type="funder-id">https://doi.org/10.13039/100000030</named-content></funding-source><award-id rid="gs1">6NU50CK000410-03</award-id><principal-award-recipient>Vishnu Chaturvedi</principal-award-recipient></award-group></funding-group><counts><count count="1" count-type="supplementary-material"/><fig-count count="3"/><table-count count="4"/><equation-count count="0"/><ref-count count="59"/><page-count count="19"/><word-count count="12417"/></counts><custom-meta-group><custom-meta><meta-name>cover-date</meta-name><meta-value>August 2017</meta-value></custom-meta></custom-meta-group></article-meta></front><body><sec sec-type="intro" id="sec1"><title>INTRODUCTION</title><p>Clinical microbiology laboratories and public health microbiology laboratories (PHLs) are undergoing transformative changes with the adoption of whole-genome sequencing (WGS) (<xref rid="B1" ref-type="bibr">1</xref>, <xref rid="B2" ref-type="bibr">2</xref>). For several years, leading laboratories have reported proof-of-concept studies on WGS-enabled advances in the identification of pathogens, antibiotic resistance (ABR) detection, and disease outbreak investigations (<xref rid="B3" ref-type="bibr">3</xref><xref ref-type="bibr" rid="B4">&#x02013;</xref><xref rid="B6" ref-type="bibr">6</xref>). Technologies also referred to as next-generation sequencing (NGS) have yielded more detailed information about the microbial features than was possible by using a combination of other laboratory approaches. Further developments of WGS platforms allowed remarkable in-depth inquiry of pathogenic genomes for the discovery of genetic variants and genome rearrangements that could have been missed by using other DNA methods (<xref rid="B3" ref-type="bibr">3</xref>, <xref rid="B7" ref-type="bibr">7</xref>, <xref rid="B8" ref-type="bibr">8</xref>). Enhanced investigations of disease outbreaks have led to a new understanding of routes of transmission of infectious agents (<xref rid="B9" ref-type="bibr">9</xref><xref ref-type="bibr" rid="B10">&#x02013;</xref><xref rid="B11" ref-type="bibr">11</xref>). WGS-enabled metagenomics and microbiome discoveries have revealed a new appreciation for the role of microbes in health and disease (<xref rid="B12" ref-type="bibr">12</xref><xref ref-type="bibr" rid="B13">&#x02013;</xref><xref rid="B15" ref-type="bibr">15</xref>). The innovations are continuing at such an unprecedented pace that WGS is expected to become an alternative to culture-dependent approaches in clinical and public health microbiology laboratories (<xref rid="B16" ref-type="bibr">16</xref><xref ref-type="bibr" rid="B17">&#x02013;</xref><xref rid="B18" ref-type="bibr">18</xref>).</p><p>Notwithstanding its promises, several challenges remain for the adoption of WGS in microbiology laboratories (<xref rid="B19" ref-type="bibr">19</xref><xref ref-type="bibr" rid="B20">&#x02013;</xref><xref rid="B22" ref-type="bibr">22</xref>). The accelerated obsolescence of sequencing platforms presents several obstacles in bridging the gap between research and routine diagnostics, including standardization efforts (<xref rid="B23" ref-type="bibr">23</xref>). The downstream bioinformatics pipelines are also unique challenges for microbiology laboratories regarding both infrastructure and skilled operators (<xref rid="B24" ref-type="bibr">24</xref><xref ref-type="bibr" rid="B25">&#x02013;</xref><xref rid="B27" ref-type="bibr">27</xref>). Overall, WGS &#x0201c;wet-bench&#x0201d; and &#x0201c;dry-bench&#x0201d; workflows represent integrated processes, which are not easily amenable to the traditional quality metrics used by microbiology laboratories (<xref rid="B27" ref-type="bibr">27</xref><xref ref-type="bibr" rid="B28">&#x02013;</xref><xref rid="B29" ref-type="bibr">29</xref>). The capital investments and recurring costs of WGS for clinical laboratories, although rapidly declining, remain relatively high to allow multilaboratory comparisons for the standardization of analytical parameters. Finally, regulatory agencies have not yet proposed standard WGS guidelines for clinical microbiology (<xref rid="B30" ref-type="bibr">30</xref>), and external proficiency testing (PT) programs for clinical and public health microbiology laboratories are still in development (<xref rid="B31" ref-type="bibr">31</xref>, <xref rid="B32" ref-type="bibr">32</xref>).</p><p>There are a few notable developments toward the standardization and validation of next-generation sequencing in clinical laboratories. The U.S. Centers for Disease Control and Prevention (CDC) sponsored the Next-Generation Sequencing: Standardization of Clinical Testing (Nex-StoCT) workgroup to propose quality laboratory practices for the detection of DNA sequence variations associated with heritable human disorders (<xref rid="B33" ref-type="bibr">33</xref>, <xref rid="B34" ref-type="bibr">34</xref>). This workgroup developed principles and guidelines for test validation, quality control, proficiency testing, and reference materials. Although not focused on infectious diseases, these guidelines provide a valuable roadmap for the implementation of WGS in clinical microbiology and public health laboratories. The College of American Pathologists (CAP) reported 18 requirements in an accreditation checklist for next-generation sequencing analytic (wet-bench) and bioinformatics (dry-bench) processes as part of its molecular pathology checklist (<xref rid="B30" ref-type="bibr">30</xref>). These &#x0201c;foundational&#x0201d; accreditation requirements were designed to be broadly applicable to the testing of inheritable disorders, molecular oncology, and infectious diseases. Along the same lines, the feasibility of <italic>in silico</italic> proficiency testing has been demonstrated for NGS (<xref rid="B35" ref-type="bibr">35</xref>). Recently, high accuracy and reproducibility were shown for WGS-based microbial strain typing performed in a ring trial study involving five laboratories; the results suggested that a proficiency testing program for WGS is feasible in clinical microbiology laboratories (<xref rid="B36" ref-type="bibr">36</xref>). The Clinical and Laboratory Standards Institute (CLSI) updated its guidelines for nucleic acid sequencing methods in diagnostic laboratory medicine with considerations specific to the application of next-generation sequencing in microbiology (<xref rid="B37" ref-type="bibr">37</xref>). Thus, a broad technical framework is now available to design WGS validation protocols that will be most relevant for clinical and public health laboratories. Our aims for the present study were to establish performance metrics for the typical workflow in public health microbiology laboratories, design modular templates for the validation of different platforms and chemistries, finalize a user-friendly report format, and identify a set of bacterial pathogens that could be used for WGS validation and performance assessments.</p></sec><sec sec-type="results" id="sec2"><title>RESULTS</title><sec id="sec2-1"><title>Accuracy of WGS.</title><p>A number of Clinical Laboratory Improvements Act (CLIA)-required performance parameters were adopted, with modifications for the validation of WGS (<xref ref-type="table" rid="T1">Table 1</xref>). The modular validation template and a summary are presented in <xref ref-type="fig" rid="F1">Fig. 1</xref>. The quality assurance (QA) and quality control (QC) measures are described in detail later in Results.</p><table-wrap id="T1" orientation="portrait" position="float"><label>TABLE 1</label><caption><p>Performance characteristics, definitions, and formulas used for validation<xref ref-type="table-fn" rid="T1F1"><sup><italic>a</italic></sup></xref></p></caption><table frame="hsides" rules="groups"><colgroup span="1"><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/></colgroup><thead><tr valign="bottom"><th rowspan="2" align="left" colspan="1">Performance characteristic for WGS applications</th><th rowspan="2" align="left" colspan="1">Definition of performance characteristic for WGS applications</th><th rowspan="2" align="left" colspan="1">Formula used for calculation</th><th rowspan="2" align="left" colspan="1">Assay-specific definition</th><th colspan="4" align="left" rowspan="1">Result of assay used for validation of parameter<hr/></th></tr><tr valign="bottom"><th align="left" rowspan="1" colspan="1">hqSNP-based genotyping</th><th align="left" rowspan="1" colspan="1">MLST</th><th align="left" rowspan="1" colspan="1">16S</th><th align="left" rowspan="1" colspan="1">Antibiotic resistance gene detection</th></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">Accuracy</td><td rowspan="1" colspan="1">Degree of agreement between the nucleic acid sequences derived from the assay (measured value) and those from a reference sequence (true value)</td><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/></tr><tr valign="top"><td rowspan="1" colspan="1">&#x02003;&#x02003;&#x02003;&#x02003;Accuracy of platform</td><td rowspan="1" colspan="1">Accuracy of base calling against the reference sequence; the accuracy of the platform was established as the agreement between base calling made by the MiSeq sequencer (measured value) and the NCBI/CDC reference sequence (true value)</td><td rowspan="1" colspan="1">% agreement with reference = [(covered genome length) &#x02212; (total no. of SNPs differing from reference)]/(covered genome length) &#x000d7; 100</td><td rowspan="1" colspan="1">Accuracy of the platform</td><td rowspan="1" colspan="1">99.999378%</td><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/></tr><tr valign="top"><td rowspan="1" colspan="1">&#x02003;&#x02003;&#x02003;&#x02003;Accuracy of assay</td><td rowspan="1" colspan="1">Accuracy of assay is determined as an agreement of the assay result for validation sequences generated by the PHL with the assay result for reference sequences of the same strains</td><td rowspan="1" colspan="1">Accuracy = (no. of correct results)/(total no. of results) &#x000d7; 100</td><td rowspan="1" colspan="1">Definition of correct result</td><td rowspan="1" colspan="1">Congruence of phylogenetic trees built using reference sequences and validation sequences</td><td rowspan="1" colspan="1">Detection and correct identification of each of the MLST alleles</td><td rowspan="1" colspan="1">ID of the 16S rRNA sequence of the validation sample matches the ID of the 16S rRNA sequence of the reference sequence</td><td rowspan="1" colspan="1">Presence of ABR genes characteristic of the reference strain, absence of any other ABR genes</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Single test unit</td><td rowspan="1" colspan="1">Individual sample clustering</td><td rowspan="1" colspan="1">Allele</td><td rowspan="1" colspan="1">16S rRNA ID result</td><td rowspan="1" colspan="1">Antibiotic resistance gene</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Accuracy of assay</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">100%</td></tr><tr valign="top"><td rowspan="1" colspan="1">&#x02003;&#x02003;&#x02003;&#x02003;Accuracy of bioinformatics pipeline</td><td rowspan="1" colspan="1">Agreement of the clustering suggested by previous investigators with the clustering achieved by analysis using PHL validation bioinformatics pipeline</td><td rowspan="1" colspan="1">% agreement = (no. of outbreak isolates clustered correctly in validation tree)/(total no. of outbreak isolates clustered together in the study tree) &#x000d7; 100</td><td rowspan="1" colspan="1">Accuracy of bioinformatics pipeline</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1">Precision</td><td rowspan="1" colspan="1">Degree to which repeated sequence analyses give the same result repeatably (within-run precision) and reproducibly (between-run precision)</td><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/></tr><tr valign="top"><td rowspan="1" colspan="1">&#x02003;&#x02003;&#x02003;&#x02003;Repeatability</td><td rowspan="1" colspan="1">Repeatability was established by sequencing the same samples multiple times under the same conditions and evaluating the concordance of the assay results and performance</td><td rowspan="1" colspan="1">Repeatability = (no. of within-run replicates in agreement)/(total no. of tests performed for within-run replicates) &#x000d7; 100</td><td rowspan="1" colspan="1">Definition of correct result</td><td rowspan="1" colspan="1">Repeatability of single nucleotide variant detection</td><td rowspan="1" colspan="1">Repeatability of allele detection</td><td rowspan="1" colspan="1">Repeatability of 16S ID</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Single test unit</td><td rowspan="1" colspan="1">SNP (precision per replicate), SNP (precision per genome size)</td><td rowspan="1" colspan="1">Allele</td><td rowspan="1" colspan="1">16S rRNA ID</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Repeatability</td><td rowspan="1" colspan="1">99.02%, 99.9999997%</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1">&#x02003;&#x02003;&#x02003;&#x02003;Reproducibility</td><td rowspan="1" colspan="1">Reproducibility was assessed as the consistency of the assay results and performance characteristics for the same sample sequenced under different conditions, such as between different runs, operators, and sample preparations</td><td rowspan="1" colspan="1">Reproducibility = (no. of between-run replicates in agreement)/(total no. of tests performed for between-run replicates) &#x000d7; 100</td><td rowspan="1" colspan="1">Definition of correct result</td><td rowspan="1" colspan="1">Reproducibility of single nucleotide variant detection</td><td rowspan="1" colspan="1">Reproducibility of allele detection</td><td rowspan="1" colspan="1">Reproducibility of 16S ID</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Single test unit</td><td rowspan="1" colspan="1">SNP (precision per replicate), SNP (precision per base pair)</td><td rowspan="1" colspan="1">Allele</td><td rowspan="1" colspan="1">16S rRNA ID</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Reproducibility</td><td rowspan="1" colspan="1">97.05%, 99.999998%</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1">Analytical sensitivity (LOD)</td><td rowspan="1" colspan="1">Minimum coverage that allows accurate SNP detection (LOD<sub>SNP</sub>)</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">LOD<sub>SNP</sub></td><td rowspan="1" colspan="1">60&#x000d7;</td><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1">Analytical specificity (interference)</td><td rowspan="1" colspan="1">Ability of an assay to detect only the intended target in the presence of potentially cross-reacting nucleotide sequences</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">Interference/cross-reactivity</td><td rowspan="1" colspan="1">Cross-reactivity and interference from contaminating sequencing reads are possible</td><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1">Diagnostic sensitivity</td><td rowspan="1" colspan="1">Likelihood that a WGS assay will detect sequence variation when present within the analyzed genomic region (this value reflects the false-negative rate of the assay)</td><td rowspan="1" colspan="1">Diagnostic sensitivity = TP/(TP + FN) &#x000d7; 100</td><td rowspan="1" colspan="1">Definition of true-positive result</td><td rowspan="1" colspan="1">Clustering of related samples (no. of validation samples with clustering results matching the reference)</td><td rowspan="1" colspan="1">No. of correctly identified alleles</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Definition of false-negative result</td><td rowspan="1" colspan="1">No. of validation samples that clustered together with samples genetically distant according to the reference tree</td><td rowspan="1" colspan="1">No. of unidentified or misidentified alleles in validation samples</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Single test unit</td><td rowspan="1" colspan="1">Individual sample clustering</td><td rowspan="1" colspan="1">Allele</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Diagnostic sensitivity</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1">Diagnostic specificity</td><td rowspan="1" colspan="1">Probability that a WGS assay will not detect sequence variations when none are present within the analyzed genomic region (this value reflects an assay's false-positive rate)</td><td rowspan="1" colspan="1">Diagnostic specificity = TN/(TN + FP) &#x000d7; 100</td><td rowspan="1" colspan="1">Definition of true-negative result</td><td rowspan="1" colspan="1">No clustering between unrelated samples (no. of validation samples with clustering results matching the reference)</td><td rowspan="1" colspan="1">No. of unidentified alleles in negative-control samples</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Definition of false-positive result</td><td rowspan="1" colspan="1">No. of validation samples that failed to cluster together with samples genetically similar according to the reference tree</td><td rowspan="1" colspan="1">No. of identified alleles in negative-control samples</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Single test unit</td><td rowspan="1" colspan="1">Individual sample clustering</td><td rowspan="1" colspan="1">Allele</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Diagnostic specificity</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">100%</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1">NA</td></tr><tr valign="top"><td rowspan="1" colspan="1">Reportable range</td><td rowspan="1" colspan="1">Region of the genome in which a sequence of an acceptable quality can be derived by the laboratory assay</td><td rowspan="1" colspan="1">NA</td><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1">Genome-wide hqSNPs</td><td rowspan="1" colspan="1">Housekeeping genes in MLST scheme</td><td rowspan="1" colspan="1">16S rRNA gene</td><td rowspan="1" colspan="1">Genes in ResFinder database</td></tr></tbody></table><table-wrap-foot><fn fn-type="other" id="T1F1"><label>a</label><p>See details in Document S1 in the supplemental material. Abbreviations: TP, true-positive results; TN, true-negative results; FP, false-positive results; FN, false-negative results; LOD, limit of detection; LOD<sub>SNP</sub>, limit of SNP detection; ID, identification; NA, the parameter was not defined for the given assay.</p></fn></table-wrap-foot></table-wrap><fig id="F1" orientation="portrait" position="float"><label>FIG 1</label><caption><p>Summary of WGS validation. The estimated performance parameters are shown in blue boxes. The components of WGS accuracy determined in this study are shown in purple boxes. The WGS assays evaluated in order to deduce the corresponding performance parameters are shown in green boxes. Percentages alongside the boxes represent values measured during this validation for the corresponding parameters.</p></caption><graphic xlink:href="zjm9990956030001"/></fig><p>The accuracy of WGS was divided into three components: platform accuracy, assay accuracy, and bioinformatics pipeline accuracy.</p></sec><sec id="sec2-2"><title>Platform accuracy.</title><p>Platform accuracy was assessed as the accuracy of the identification of individual base pairs (&#x0201c;base calling&#x0201d;) in the bacterial genome. The accuracy of the platform was established by determining the proximity of agreement between base callings made by the MiSeq sequencer (measured value) and the NCBI/CDC reference sequence (the true value). We determined the accuracy of the MiSeq Illumina platform by mapping generated reads to the corresponding reference sequence and identifying single nucleotide polymorphisms (SNPs) throughout the genome. A few validation samples differed from the reference genome by several SNPs. However, 99% (324 out of 327) of these SNPs were reproducible among all five replicates sequenced for each sample. Since amplification and sequencing errors were random between different library preparations, it was unlikely that the same erroneous SNP would occur in all five replicates. Therefore, we concluded that these discrepancies were not caused by sequencing errors but most likely were the result of the accumulation of mutations in the reference strains or previous sequencing mistakes in the reference sequence. Sanger sequencing confirmed the SNPs found in WGS results for the selected isolates (see Appendix 14 in Document S1 in the supplemental material). However, in several instances, it was impossible to design specific primers within reach of Sanger sequencing read lengths because SNPs were found in the repeat regions. In one case, the results of Sanger sequencing matched the results of mapping, but upon comparison with <italic>de novo</italic> assemblies, the same sequence was found in two variants, with and without the SNP detected by mapping. This highlighted the limitation of the short-read sequencing technology and the underlying caution that must be exercised in the interpretation of the SNPs called in the repeat regions of the genomes. In both cases, whether we took into account all SNPs detected between validation and reference sequences or only those SNPs that did not appear in all of the replicates (true sequencing errors), we observed &#x0003e;99.999% agreement of the generated whole-genome sequences with the reference sequences for each tested sample.</p></sec><sec id="sec2-3"><title>Assay accuracy.</title><p>Assay accuracy was defined as the agreement of the assay results for the validation sequences with the assay results for the reference sequences of the same strains. Four applications of WGS were used to validate the accuracy of the assay: an <italic>in silico</italic>
<underline>m</underline>ulti<underline>l</underline>ocus <underline>s</underline>equence <underline>t</underline>yping (MLST) assay, a 16S rRNA gene species identification assay, an assay for the detection of ABR genes, and a genotyping assay using high-quality SNPs (hqSNPs).</p><p>The definition for a correct result for MLST corresponds to the correct identification of each of the MLST alleles in the validation sequence as well as in the reference sequence analyzed in parallel. For all validation samples, each of the sequences of the seven housekeeping genes used in the typing scheme (or six genes for <named-content content-type="genus-species">Aeromonas hydrophila</named-content>) were identified correctly, resulting in 100% allele identification accuracy.</p><p>For ABR gene detection, the sequences of the ATCC susceptibility control strains generated during validation were analyzed by using ResFinder. Data analysis using ResFinder implies comparison of the sequence against each entry in the ResFinder database, which at the moment of validation contained sequences of 1,719 antibiotic resistance genes, resulting in a total of 1,719 tests performed for each validation sample. The results of ResFinder detection were compared to the resistance genes known to be present in the ATCC strains. Two of the sequenced isolates contained one resistance gene each, which were also detected by ResFinder, with no additional genes being identified. No resistance genes were detected by ResFinder in the samples suggested to be negative susceptibility controls and lacking resistance determinants. Thus, the accuracy of the assay for ABR gene detection was 100% (see the section on detection of resistance genes in ATCC strains using ResFinder in Document S1 in the supplemental material). Moreover, 13 reference sequences of Gram-negative (7 sequences) and Gram-positive (6 sequences) bacterial isolates with various resistance genes from the U.S. Food and Drug Administration (FDA)-CDC Antimicrobial Resistance Isolate (AR) Bank were used for <italic>in silico</italic> testing of ABR gene detection accuracy using ResFinder. The resistance genes in the AR Isolate Bank isolates were previously detected by the CDC using PCR-based methods (for primary resistance types) and by ResFinder (database last updated 2 June 2016). Our analysis of reference sequences with ResFinder (last updated 17 February 2017) confirmed the presence of all genes that were detected by PCR-based methods (<italic>n</italic> = 8), resulting in 100% accuracy. At the moment of analysis, ResFinder did not have the ability to detect the truncation of porin genes; therefore, porin-related resistance mechanisms mentioned in the CDC database could not be detected with ResFinder. Upon ResFinder analysis at the CDC, the isolates harbored a total of 83 resistance genes (representative of 57 different alleles). The detection of the genes by the CDC using ResFinder was replicated in this study, with few discrepancies (see the section on <italic>in silico</italic> detection of resistance genes in FDA-CDC isolates using ResFinder and Appendix 15 in Document S1 in the supplemental material). In the case of Gram-negative bacteria, all discrepancies were represented by additional genes detected in comparison with CDC ResFinder results, and these discrepancies were most likely caused by the database update; e.g., the <italic>aph(3&#x02032;)-Ia</italic> gene was not on the list of the genes detected with ResFinder by the CDC but was detected in the same sequences using ResFinder in this study. This particular gene discrepancy could be explained by the use of a later version of ResFinder, which was updated with additional aminoglycoside resistance genes. In the case of Gram-positive bacteria, several discrepancies were also caused by the additional genes detected in this study, but two genes that were present in the CDC results were missing from our results. These false-negative genes were detected by us but possessed &#x0003c;99% identity or an incomplete sequence and therefore were excluded from the final result. Nevertheless, the agreement between CDC ResFinder results and our ResFinder results was 99.97%.</p><p>For the 16S rRNA identification assay, variations in one gene were detected, so the species identification results as a whole (e.g., &#x0201c;<named-content content-type="genus-species">Escherichia coli</named-content>&#x0201d;) were considered a single test. The identity of the 16S rRNA sequences extracted from validation samples showed 100% matches with the 16S rRNA sequences extracted from the reference sequence.</p><p>To assess the accuracy of the genome-wide hqSNP-based genotyping assay, phylogenetic trees were built by using reference sequences and validation sequences, and the resulting trees were compared. For better comparison, we used at least five strains of the same species in the phylogenetic tree. The accuracy of the genotyping assay was determined by using two approaches: (i) topological similarity between the reference tree and the validation tree using Compare2Trees software and (ii) comparison of the clustering patterns of the validation tree and the reference tree. Therefore, in addition to the detection of SNP differences between the validation sequences and reference sequences, their effect on the final tree topology was also estimated to determine the accuracy of the genotyping assay. Phylogenetic trees were generated for five bacterial species (<named-content content-type="genus-species">Escherichia coli</named-content>, <named-content content-type="genus-species">Salmonella enterica</named-content>, <named-content content-type="genus-species">Staphylococcus aureus</named-content>, <named-content content-type="genus-species">Enterococcus faecalis</named-content>, and <named-content content-type="genus-species">Stenotrophomonas maltophilia</named-content>). All five validation trees had matching clustering patterns and 100% topological similarity with the corresponding reference trees (see Table S2 in the supplemental material).</p></sec><sec id="sec2-4"><title>Accuracy of the bioinformatics pipeline.</title><p>The accuracy of the bioinformatics pipeline for hqSNP-based genotyping by itself was assessed by the recapitulation of previously reported results using WGS raw reads of bacterial isolates included in two previous studies (<xref ref-type="table" rid="T2">Table 2</xref>). This was an additional assessment of the accuracy of the genotyping pipeline apart from the accuracy of base calling from in-house sequencing of the isolates. Phylogenetic analyses of isolates associated with outbreaks caused by a Gram-positive pathogen in the first study (MRSA [methicillin-resistant <named-content content-type="genus-species">Staphylococcus aureus</named-content>] study) (<xref rid="B38" ref-type="bibr">38</xref>) and a Gram-negative pathogen in the second study (<named-content content-type="genus-species">Salmonella</named-content> study) (<xref rid="B39" ref-type="bibr">39</xref>) were performed for the validation of the bioinformatics pipeline (<xref ref-type="fig" rid="F2">Fig. 2</xref>). The clustering of the validation tree completely replicated the clustering of the tree from the MRSA study (<xref rid="B38" ref-type="bibr">38</xref>) (<xref ref-type="fig" rid="F2">Fig. 2A</xref> and <xref ref-type="fig" rid="F2">B</xref>); e.g., isolates 4 and 5 were identical and clustered together according to the MRSA study, and the same results were shown in the validation tree, with isolates 4 and 5 sharing the same node. All conclusions with regard to the genetic relatedness of the isolates that can be drawn from the tree reported in the MRSA study could also be made for the analysis of the corresponding validation tree. The group of outbreak isolates from the MRSA study was compared with epidemiologically unrelated isolates suggested by the same study (no tree was available from that report). Phylogenetic analysis using the in-house bioinformatics pipeline showed that epidemiologically unrelated isolates did not cluster with the group of outbreak isolates and appeared to be genetically distant (<xref ref-type="fig" rid="F2">Fig. 2C</xref>). Thus, the resulting phylogenetic tree produced by our bioinformatics pipeline showed complete concordance with the epidemiological data.</p><table-wrap id="T2" orientation="portrait" position="float"><label>TABLE 2</label><caption><p>Summary of data from previous studies used for validation of the bioinformatics pipeline</p></caption><table frame="hsides" rules="groups"><colgroup span="1"><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/></colgroup><thead><tr valign="bottom"><th rowspan="2" align="left" colspan="1">Study parameter</th><th colspan="2" align="left" rowspan="1">Value for study<hr/></th></tr><tr valign="bottom"><th align="left" rowspan="1" colspan="1">MRSA study<xref ref-type="table-fn" rid="T2F1"><sup><italic>a</italic></sup></xref></th><th align="left" rowspan="1" colspan="1"><named-content content-type="genus-species">Salmonella</named-content> study<xref ref-type="table-fn" rid="T2F2"><sup><italic>b</italic></sup></xref></th></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">Microorganism</td><td rowspan="1" colspan="1">Methicillin-resistant <named-content content-type="genus-species">Staphylococcus aureus</named-content></td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Salmonella enterica</named-content> serovar Typhimurium</td></tr><tr valign="top"><td rowspan="1" colspan="1">Source of isolates</td><td rowspan="1" colspan="1">Human</td><td rowspan="1" colspan="1">Human</td></tr><tr valign="top"><td rowspan="1" colspan="1">No. of isolates analyzed</td><td rowspan="1" colspan="1">7 outbreak isolates (1 outbreak cluster) + 2 epidemiologically unrelated isolates</td><td rowspan="1" colspan="1">9 outbreak isolates (4 outbreak clusters) + 2 epidemiologically unrelated isolates</td></tr><tr valign="top"><td rowspan="1" colspan="1">Type of outbreak</td><td rowspan="1" colspan="1">Hospital-associated outbreak</td><td rowspan="1" colspan="1">Foodborne outbreaks</td></tr><tr valign="top"><td rowspan="1" colspan="1">Samples used for validation</td><td rowspan="1" colspan="1">P1, P2, P3, P4, P16, P21, and P25; an isolate identified by infectious control investigation as belonging to nonoutbreak ST1; a MRSA isolate identified by searching a microbiology database as belonging to nonoutbreak ST772</td><td rowspan="1" colspan="1">0803T57157, 0808S61603, 0808F31478, 0903R11327, 0811R10987, 0804R9234, 0810R10649, 0901M16079, 0110T17035, 1005R12913, and 1006R12965</td></tr><tr valign="top"><td rowspan="1" colspan="1">GenBank accession no. of corresponding samples</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="ERR070045">ERR070045</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR070042">ERR070042</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR070043">ERR070043</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR070044">ERR070044</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR124429">ERR124429</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR124433">ERR124433</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR128708">ERR128708</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR070041">ERR070041</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR072248">ERR072248</ext-link></td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="ERR277220">ERR277220</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR277226">ERR277226</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR277223">ERR277223</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR277222">ERR277222</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR277224">ERR277224</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR277221">ERR277221</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR277227">ERR277227</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR277228">ERR277228</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR277203">ERR277203</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR277233">ERR277233</ext-link>, <ext-link ext-link-type="gen" xlink:href="ERR277234">ERR277234</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">No. of clusters in the study tree</td><td rowspan="1" colspan="1">1</td><td rowspan="1" colspan="1">4</td></tr><tr valign="top"><td rowspan="1" colspan="1">No. of clusters in the validation tree</td><td rowspan="1" colspan="1">1</td><td rowspan="1" colspan="1">4</td></tr><tr valign="top"><td rowspan="1" colspan="1">No. of outbreak isolates in each cluster in the study tree</td><td rowspan="1" colspan="1">7 for cluster 1</td><td rowspan="1" colspan="1">2 for cluster 1, 3 for cluster 2, 2 for cluster 3, and 2 for cluster 4</td></tr><tr valign="top"><td rowspan="1" colspan="1">No. of outbreak isolates in each cluster in the validation tree</td><td rowspan="1" colspan="1">7 for cluster 1</td><td rowspan="1" colspan="1">2 for cluster 1, 3 for cluster 2, 2 for cluster 3, and 2 for cluster 4</td></tr><tr valign="top"><td rowspan="1" colspan="1">No. of epidemiologically unrelated isolates in the set</td><td rowspan="1" colspan="1">2</td><td rowspan="1" colspan="1">2</td></tr><tr valign="top"><td rowspan="1" colspan="1">No. of epidemiologically unrelated isolates that clustered with outbreak isolates</td><td rowspan="1" colspan="1">0</td><td rowspan="1" colspan="1">0</td></tr><tr valign="top"><td rowspan="1" colspan="1">% agreement {[(no. of outbreak isolates clustered correctly in the validation tree) &#x000d7; 100]/(total no. of outbreak isolates that clustered together in the study tree)}</td><td rowspan="1" colspan="1">(7 &#x000d7; 100/7) = 100</td><td rowspan="1" colspan="1">(9 &#x000d7; 100/9) = 100</td></tr></tbody></table><table-wrap-foot><fn fn-type="other" id="T2F1"><label>a</label><p>See reference <xref rid="B38" ref-type="bibr">38</xref>. ST1, sequence type 1; ST772, sequence type 772.</p></fn><fn fn-type="other" id="T2F2"><label>b</label><p>See reference <xref rid="B39" ref-type="bibr">39</xref>.</p></fn></table-wrap-foot></table-wrap><fig id="F2" orientation="portrait" position="float"><label>FIG 2</label><caption><p>Bioinformatics pipeline validation with outbreak isolates from two previously published studies. (A) Phylogenetic tree of outbreak isolates reported in the &#x0201c;MRSA study&#x0201d; by Harris et al. (<xref rid="B38" ref-type="bibr">38</xref>). The isolates from the MRSA study that were picked for validation are indicated by arrows and numbers assigned for purposes of validation (1 to 7). (B) Phylogenetic tree validation using the samples from the MRSA study and the validation bioinformatics pipeline. The same isolates in the original tree and the validation tree are marked with the same numbers. (C) Comparison of the group of related isolates (isolates 1 to 7) from the MRSA study with epidemiologically unrelated isolates from the same study using the validation bioinformatics pipeline. (D) Phylogenetic tree combining epidemiologically related and nonrelated isolates reported in the &#x0201c;<named-content content-type="genus-species">Salmonella</named-content> study&#x0201d; by Leekitcharoenphon et al. (<xref rid="B39" ref-type="bibr">39</xref>). The isolates from the <named-content content-type="genus-species">Salmonella</named-content> study that were picked for validation are marked with green node circles and have the numbers 1 to 11 assigned for purposes of validation. Epi, epidemiologically. (E) Validation phylogenetic tree generated for the samples from the <named-content content-type="genus-species">Salmonella</named-content> study using the in-house bioinformatics pipeline. The same isolates in the tree from the <named-content content-type="genus-species">Salmonella</named-content> study and the validation tree are marked with the same numbers.</p></caption><graphic xlink:href="zjm9990956030002"/></fig><p>From the <named-content content-type="genus-species">Salmonella</named-content> study (<xref rid="B39" ref-type="bibr">39</xref>), we selected nine isolates that were representative of four independent outbreaks and two isolates that were epidemiologically unrelated controls (<xref ref-type="fig" rid="F2">Fig. 2D</xref>). The clustering of the validation tree was identical to the clustering of the tree from the <named-content content-type="genus-species">Salmonella</named-content> study. For example, isolates 6 and 7 were part of the same outbreak, while isolate 8 was an epidemiologically unrelated control used in that study. In agreement with the epidemiological data and the tree from the <named-content content-type="genus-species">Salmonella</named-content> study, the validation tree showed that isolates 6 and 7 clustered together but not with isolate 8 (<xref ref-type="fig" rid="F2">Fig. 2E</xref>). All observations about the genetic relatedness of the isolates drawn from the tree from the <named-content content-type="genus-species">Salmonella</named-content> study could be replicated from the analysis of the validation tree. In summary, based on <italic>in silico</italic> analysis of data from both studies, 100% accuracy of our bioinformatics pipeline was established for phylogenetic analysis.</p></sec><sec id="sec2-5"><title>WGS repeatability and reproducibility.</title><p>Repeatability (within-run precision) was established as the concordance of the assay results and quality metrics obtained for a sample tested multiple times within the same sequencing run. Reproducibility (between-run precision) was assessed as the consistency of the assay results and quality metrics for the same sample sequenced on different occasions. Thirty-four validation samples each were sequenced three times in the same sequencing run (for repeatability) and three times in different runs (for reproducibility). For within-run replicates, one DNA extract was used, but independent library preparations were done, with the final samples being included in a single sequencing run. Therefore, for each sample, the numbers of within-run replicates and between-run replicates were 3 each, and the total numbers of repeated results were 5. All quality parameters (depth of coverage, uniformity of coverage, and accuracy of base calling [<italic>Q</italic> score], etc.) did not change significantly for within- and between-run replicates, as determined by a two-tailed <italic>t</italic> test. For the quality values for all sequenced samples, see the section on inter- and intra-assay agreement and Appendix 8 in Document S1 in the supplemental material.</p><p>The reproducibility and repeatability of the WGS assay were evaluated with two methods: evaluation of base calling reproducibility and repeatability per replicate and evaluation of base calling reproducibility and repeatability relative to genome size. All validation samples except C50 yielded identical whole-genome sequences for all three within-run replicates. One out of three within-run replicates for isolate C50 (<named-content content-type="genus-species">Pseudomonas aeruginosa</named-content> ATCC 27853) had one SNP difference from other within-run replicates (Table S3). The repeatability per replicate was 99.02%. Three validation samples had one out of three between-run replicates that differed from the other two between-run replicates. Sample C47 (<named-content content-type="genus-species">Staphylococcus epidermidis</named-content> ATCC 12228) had one between-run replicate with two SNPs that differed from the other replicates. Samples C49 (<named-content content-type="genus-species">Streptococcus pneumoniae</named-content> ATCC 6305) and C55 (<named-content content-type="genus-species">Escherichia coli</named-content> ATCC 25922) each had one between-run replicate that differed from other replicated sequences by one SNP. The reproducibility of base calling per replicate was 97.05%. Both the reproducibility and repeatability of base calling relative to the genome size (in relation to the total number of base pairs of the covered genome size) were &#x0003e;99.9999%.</p><p>We also estimated the reproducibility and repeatability of MLST and 16S rRNA identification assays. For MLST, a total of 441 alleles were analyzed for within- or between-run replicates. Each single allele in all validation samples was identified consistently among within- and between-run replicates. Within- and between-run replicates had repeatable/reproducible sequences of 16S rRNA genes and resulted in the consistent identification of the species. The reproducibility and repeatability of allele detection and species identification within and between runs were 100%.</p></sec><sec id="sec2-6"><title>WGS sensitivity and specificity.</title><p>The sensitivity of WGS was assessed as (i) analytical sensitivity (minimum coverage that allows accurate SNP detection) and (ii) diagnostic sensitivity (the likelihood that a WGS assay will detect sequence variation when it is present) (this value reflects the false-negative rate of the assay).</p><p>The specificity of WGS was determined as (i) analytical specificity (the ability of an assay to detect only the intended target in the presence of potentially cross-reacting nucleotide sequences) and (ii) diagnostic specificity (the probability that a WGS assay will not detect sequence variations when none are present) (this value reflects the false-positive rate of the assay).</p></sec><sec id="sec2-7"><title>Analytical sensitivity.</title><p>The limit of detection (LOD) is traditionally defined as &#x0201c;the lowest actual concentration of an analyte in a specimen that can be consistently detected &#x02026; with acceptable precision&#x0201d; (<xref rid="B40" ref-type="bibr">40</xref>). The LOD in this sense is not applicable to WGS, which utilizes pure bacterial cultures as starting material; the amount of the analyte (genomic DNA) that is added to the reaction mixture is strictly standardized, and the DNA concentration of each sample is measured by a fluorometric method before each assay. In our workflow, we established the minimum amount of the starting DNA input to be 1 ng (at a concentration of 0.2 ng/&#x003bc;l) and did not process DNA extracts with concentrations of &#x0003c;1 ng/&#x003bc;l. Instead, we determined the LOD of SNP detection (LOD<sub>SNP</sub>) by establishing the lowest coverage that allows accurate SNP calling. The LOD<sub>SNP</sub> was estimated by modeling different mapping coverages and estimating the number of SNPs called at each of the coverage values. Nine samples representative of different species were <italic>in silico</italic> downsampled to coverages of 60&#x000d7;, 50&#x000d7;, 40&#x000d7;, 30&#x000d7;, 20&#x000d7;, 15&#x000d7;, 10&#x000d7;, and 5&#x000d7;. The LOD<sub>SNP</sub> was established to be 60&#x000d7;, as it was the lowest coverage which yielded accurate SNP detection for all of the samples (see Table S4 and the section on analytical sensitivity of SNP detection in Document S1 in the supplemental material).</p></sec><sec id="sec2-8"><title>Analytical specificity.</title><p>Analytical specificity is referred to as the &#x0201c;ability of an assay to detect only the intended target and that quantification of the target is not affected by cross-reactivity from related or potentially interfering nucleic acids or specimen-related conditions&#x0201d; (<xref rid="B40" ref-type="bibr">40</xref>). Since our WGS pipeline is not intended for clinical specimens directly, the conditions interfering with fragmentation, amplification, and sequencing processes are minimized and are monitored via multiple QC steps, including DNA and library purity and concentration measurements. However, interference from contaminating nucleotide sequences is much more consequential. Analytical specificity (interference) was determined by creating sequencing files containing a mixture of the reads from two different samples <italic>in silico</italic>, thus mimicking contamination and demonstrating its effect on mapping metrics (percentage of reads mapped, percentage of the reference sequence covered, etc.) and SNP detection (see Table S5 and the section on the analytical specificity of SNP detection in Document S1 in the supplemental material). As expected, contaminating reads led to a decrease in the percentage of mapped reads and an increase in the portion of unmapped reads. The percentage of reads in pairs decreased for samples containing contaminating reads. Modeled contamination with <named-content content-type="genus-species">E. coli</named-content> C1 and <named-content content-type="genus-species">M. tuberculosis</named-content> C57 sequencing reads did not cause any change in called SNPs. Contamination with any of the other reads led to additional SNPs being called both between the compared samples and with the reference sequence. In the sample contaminated with <named-content content-type="genus-species">S. enterica</named-content> C75, in addition to nonspecific SNPs, one of the SNPs detected previously was missing. The bioinformatics pipeline had a certain tolerance of contaminating reads depending on the nature of the contamination.</p><p>Diagnostic sensitivity and specificity were estimated for genotyping and MLST assays.</p></sec><sec id="sec2-10"><title>Diagnostic sensitivity and specificity of genotyping.</title><p>To estimate the diagnostic sensitivity and specificity of WGS-based genotyping, the hqSNP phylogenetic trees generated from the validation sequences were compared to the trees generated from the reference sequences for the same strain. In the case of whole-genome sequence genotyping, the true sequence variation (SNP), which was not detected, represents a false-negative result. Changes introduced into the DNA sequence during library preparation/sequencing or data analysis errors could result in false-positive sequence variations. All generated validation trees repeated clustering and had 100% topological similarity to the corresponding reference trees, indicating the absence of false-negative or false-positive results of the genotyping assay. The hqSNP-based genotyping assay was 100% sensitive and specific.</p></sec><sec id="sec2-11"><title>Diagnostic sensitivity and specificity of MLST.</title><p>The sequence types of validation sequences and reference sequences were determined by using organism-specific MLST databases. For MLST, the number of true-positive results corresponds to the number of alleles correctly identified in the validation samples. For true-negative results, we performed a comparison of validation sequences against MLST databases for unmatched species, e.g., a search of alleles for the <named-content content-type="genus-species">Escherichia coli</named-content> C1 validation sample against the MLST database for <named-content content-type="genus-species">Salmonella enterica</named-content>. In the latter case, the MLST assay was expected not to identify any alleles. All alleles in the positive validation samples were identified correctly. None of the alleles were identified in the negative controls. Both the diagnostic sensitivity and analytical specificity of the <italic>in silico</italic> MLST assay were 100%.</p></sec><sec id="sec2-12"><title>Reportable range for WGS.</title><p>The following information about the sequenced genome was collected for the reportable range: genome-wide hqSNPs, housekeeping genes used in MLST schemes, 16S rRNA genes, and antibiotic resistance genes included in the ResFinder database. A reporting language was developed to assist in the interpretation of results by an end user with or without specific WGS knowledge. The appropriate template and examples are provided in Document S1 in the supplemental material.</p></sec><sec id="sec2-13"><title>Quality assurance and quality control for WGS.</title><p>QA and QC measures were developed to ensure high quality and consistency of routine testing using the MiSeq Illumina platform. QC was performed during the preanalytical (DNA isolation and library preparation), analytical (quality metrics of the sequencing run), and postanalytical (data analysis) steps of WGS. Five QC checkpoints were implemented throughout the WGS procedure: DNA template QC, library QC, sequencing run QC, raw data QC, and data analysis QC. The quality parameters evaluated at each of these QC checkpoints are summarized in <xref ref-type="fig" rid="F3">Fig. 3</xref>. The QA and QC manual established for WGS applications is presented in Document S1 in the supplemental material. Based on the preliminary quality thresholds, all runs passed from the first attempt, and none of the samples had to be resequenced. The final quality thresholds were set at the lower border of quality values, which still allowed the generation of accurate and reproducible assay results (phylogenetic analysis, MLST, 16S identification, or ABR detection). The WGS quality cutoff values are summarized in Table S6 in the supplemental material. We determined the optimal depth of coverage to be &#x02265;60&#x000d7; based on the accuracy of SNP detection at various simulated genome coverages. Table S6 does not reflect the optimal data quality parameters but merely the thresholds below which data will be rejected and the sample will have to be resequenced.</p><fig id="F3" orientation="portrait" position="float"><label>FIG 3</label><caption><p>WGS quality control scheme. The preanalytical, analytical, and postanalytical steps of WGS are shown on the left in orange boxes. The QC metrics that are being evaluated at each step are presented in light green boxes. Three vertical blocks represent the types of samples to which major QC metrics are being applied: test samples, positive controls, and negative controls. The overlap between the boxes designating QC metrics and the vertical blocks shows which QC metrics are applicable to which of the three types of samples.</p></caption><graphic xlink:href="zjm9990956030003"/></fig><p>We employed two levels of positive and negative controls. First, as an internal positive control, we used spiked-in PhiX Control (Illumina Inc., San Diego, CA, USA) and evaluated its sequencing error rate. As an internal negative control, at the demultiplexing step, we called index combinations, which did not correspond to any samples in the current sequencing run but were used in the previous run. This negative-index-combination control allowed us to capture carryover contamination with the library fragments generated in the previous runs. Second, as an external positive control, we included <named-content content-type="genus-species">E. coli</named-content> strain ATCC 25922 processed from the DNA extraction step all the way to the data analysis step. As an external negative control, a no-template control was processed through all WGS steps starting with DNA extraction. Additional positive controls representing one Gram-positive strain and one <named-content content-type="genus-species">M. tuberculosis</named-content> strain were introduced solely for the DNA extraction step in cases when the corresponding types of samples were processed (to control for differences in the DNA extraction protocols). The quality cutoff values for the controls can be found in Table S6.</p></sec><sec id="sec2-14"><title>Validation summary.</title><p>Analytical sensitivity (LOD of SNP detection) was established at a 60&#x000d7; depth of genome coverage. Analytical specificity in the presence of interfering sequencing reads was demonstrated for different types of contamination. The WGS assay was shown to have &#x0003e;99.9% accuracy, &#x0003e;99.9% reproducibility/repeatability, and 100% diagnostic specificity and sensitivity. These parameters met the CLIA requirements for laboratory-developed tests (LDTs).</p></sec></sec><sec sec-type="discussion" id="sec3"><title>DISCUSSION</title><p>This study established the workflow and reference materials for the validation of WGS for routine use in PHLs according to CLIA guidelines for LDTs. The validation panel, sequencing analytics, and raw sequences generated during this study could serve as resources for future multilaboratory comparisons of WGS data. Additionally, the WGS performance specifications and modular validation template developed in this study could be easily adopted for the validation of other platforms and reagent kits. These results could strengthen the concept of unified laboratory standards for WGS enunciated by some professional organizations, including the Global Microbial Identifier (GMI) initiative (<xref rid="B30" ref-type="bibr">30</xref>, <xref rid="B31" ref-type="bibr">31</xref>, <xref rid="B33" ref-type="bibr">33</xref>, <xref rid="B41" ref-type="bibr">41</xref>). A few other groups have also highlighted the challenges and solutions for the implementation of WGS in clinical and public health microbiology laboratories (<xref rid="B21" ref-type="bibr">21</xref>, <xref rid="B42" ref-type="bibr">42</xref>).</p><p>Using a combination of reference strains and corresponding publicly available genomes, we devised a framework of &#x0201c;best practices&#x0201d; for the quality management of the integrated &#x0201c;wet-lab&#x0201d; and &#x0201c;dry-lab&#x0201d; WGS workflows (&#x0201c;pipeline&#x0201d;). The importance of reference materials for the validation and QC of wet- and dry-lab WGS processes was noted previously (<xref rid="B28" ref-type="bibr">28</xref>, <xref rid="B31" ref-type="bibr">31</xref>, <xref rid="B33" ref-type="bibr">33</xref>). Unlike human genomics (<xref rid="B43" ref-type="bibr">43</xref>), there is no well-established resource of reference materials for the validation of WGS in public health microbiology laboratories. The main challenge for creating a customized validation set is a lack of reference materials, in other words, strains that can be easily acquired by PHLs and that have high-quality, well-characterized reference genomes available. While the use of genomic sequences of ATCC strains from the NCBI is an option, it is far from being perfect. The genome sequences available from public databases are generated by using different methods, chemistries, and platforms, which might yield different error rates; therefore, deposited sequences are not guaranteed to be free of such errors. With the ongoing development of new sequencing technologies and improvements in the quality of sequences, it is likely that the genomes sequenced with old methods might appear less accurate than the sequences generated by the laboratory during validation. Additionally, there is the possibility of mutation accumulation in control strains, e.g., ATCC cultures, after many rounds of subculture in different laboratories. Overall, there is no gold standard available yet for use as reference material for the validation of WGS for pathogenic bacteria. Nevertheless, NCBI, ENA, and similar public genome depositories remain the best resources for the genomic sequences of control strains that could be used for validation, such as the FDA Database for Regulatory-Grade Microbial Sequences (FDA-ARGOS) (<ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/bioproject/231221">https://www.ncbi.nlm.nih.gov/bioproject/231221</ext-link>). In the future, it would be helpful to have a network/agency/bank that could distribute panels of sequenced and curated isolates with genomic sequences available online for WGS validation. In the absence of such a resource, we developed a validation set of microorganisms, which can be used for future validations of WGS platforms. Bacterial genomes vary in size, GC content, the abundance of repeat regions, and other properties, which affect WGS results. We created a validation set that reflects the diversity of microorganisms with various genome sizes and GC contents that are routinely sequenced by PHLs. Different species of Gram-positive and Gram-negative microorganisms and <named-content content-type="genus-species">M. tuberculosis</named-content> were included to account for the differences in DNA extraction procedures as well.</p><p>Samples were validated based on four core elements reflected in the formal assay report: 16S rRNA-based species identification, <italic>in silico</italic> MLST, hqSNP phylogenetic analysis, and the presence of ABR determinants. Specific WGS assays should be validated, in addition to platform accuracy, to account for the ability to reach decent coverage in genomic areas of interest and tolerate certain base-calling error rates. Overall, we achieved high accuracy, reproducibility, repeatability, diagnostic sensitivity, and specificity for all assay analytes ranging from 99 to 100%, which exceeds the 90% threshold for LDT performance parameters per CLIA requirements. These findings are in agreement with several recent reports of 93% to 100% accuracies in WGS identification, subtyping, and antimicrobial resistance gene detection for a number of pathogens (<xref rid="B44" ref-type="bibr">44</xref><xref ref-type="bibr" rid="B45">&#x02013;</xref><xref rid="B47" ref-type="bibr">47</xref>). We determined SNP detection at coverages of 5&#x000d7; to 60&#x000d7; and established an LOD for SNP detection at 60&#x000d7;, which was the lowest coverage that yielded accurate SNP detection in all of the samples. We determined the effect of contaminating reads on the analytical specificity of WGS. We point out that CLIA LDT performance parameters are difficult to apply to WGS analysis. For example, in determining the accuracy of the platform, CLIA would allow for up to 10% of base calls to be incorrect, which, in the case of the &#x0223c;5-Mb <named-content content-type="genus-species">E. coli</named-content> genome, would mean 500,000 inaccurate SNPs, which is clearly an unacceptable error rate for any WGS application. Genome-wide hqSNP detection can be used as a way to validate platform accuracy since it allows assessment of the accuracy of base calling throughout the genome. Additionally, validation of the actual hqSNP genotyping pipeline would be required, as the phylogenetic assay takes into account not only the number of SNPs detected between isolates but also how it affects the tree topology, because one erroneous SNP is unlikely to change genotyping conclusions in most instances.</p><p>Successful CLIA integration for WGS would also require a laboratory to implement a continuous performance measurement plan via an internal or external PT program. Such PT programs are under active development, with the GMI network, the Genetic Testing Reference Materials Coordination Program (Get-RM), the Genome in a Bottle (GIAB) Consortium, and CDC PulseNet NextGen being the most prominent (<xref rid="B31" ref-type="bibr">31</xref>, <xref rid="B43" ref-type="bibr">43</xref>). A set of generic standards has been proposed by the CAP molecular pathology checklist (<xref rid="B30" ref-type="bibr">30</xref>). The proposed quality standards include both live cultures as well as &#x0201c;sequence-only&#x0201d; formats for a comprehensive assessment of the WGS pipeline. Our validation set of isolates is amenable to both internal and external quality assurance testing. In preliminary internal PT, we were able to successfully assess the entire workflow and personnel performance (details not shown).</p><p>Microbial WGS remains a dynamic technology, and therefore, any validated pipeline is unlikely to remain static. For this reason, the implementation of a modular validation template becomes crucial for the seamless and timely introduction of changes to the pipeline; e.g., we had to carry out several amendments to the protocol since its implementation in the laboratory. These amendments included a new processing algorithm for highly contagious pathogens and minor adjustments in the data analysis algorithm. The changes were accomplished via minor modifications of the pipeline with corroborative testing using a modular validation template developed by us. We also performed a two-sequencer agreement study to allow the processing of increased volumes of samples (see Document S1 in the supplemental material). The WGS report format could pose challenges for the end user. The report format in our study was designed to convey assay results to an end user with or without extensive knowledge of WGS. Additional disclaimers were used to avoid erroneous interpretations of results, for example, the disclaimer that the detection of antibiotic resistance genes by WGS does not guarantee the resistance of the strain <italic>in vivo</italic> and that phenotypic susceptibility testing is required to confirm antimicrobial resistance (see section on results reporting in Document S1 in the supplemental material).</p><p>This study possesses certain limitations. First, only a limited number of WGS-based assays were included in the validation study based on the most common PHL applications. Other types of WGS assays/analytics would have to be validated in a similar manner to determine the performance specifications, which are required to generate accurate and reproducible results, e.g., a threshold for the base-calling accuracy of the platform or a depth of coverage of specific genes. Second, not all validation set samples had available NCBI database entries to provide comparison sets. Third, the absence of any eukaryotic pathogens in the current validation scheme would require the implementation of a specialized pipeline for pathogenic fungi and parasites. Finally, the presence of extrachromosomal elements or polymorphic or repeat regions can negatively affect reference mapping and the <italic>de novo</italic> assembly of the reads generated on short-read platforms such as the Illumina platform; however, the direct effect of such genomic features on assay performance was not investigated in this study.</p><p>As the clinical and public microbiology community implements high-quality WGS, it would be opportune to consider the available models for the delivery of these services (<xref rid="B48" ref-type="bibr">48</xref>). Since their inception, most WGS activities have taken place in reference facilities with rather large supporting infrastructures. Although inevitable in the early stages, the centralization of services presents several challenges, such as turnaround time and access to specific expertise on the local population structure of a given pathogen, which is crucial for the management of infectious diseases at the local and regional levels. WGS services could be delivered locally and more easily with affordable sequencers, standardized reagents, and well-defined quality metrics. The local-delivery model would also be more responsive to the needs of the target clients and enhance the adoption of WGS across health care systems. Another alternative is a hybrid model with complementary central and local services to balance the need for speed with advanced expertise and resources (<xref rid="B48" ref-type="bibr">48</xref>). Two prominent examples of hybrid models in the United States are the FDA GenomeTrakr network for the tracking of foodborne pathogens and the CDC Advanced Molecular Detection (AMD) initiative for the improved surveillance of infectious diseases (<xref rid="B49" ref-type="bibr">49</xref>, <xref rid="B50" ref-type="bibr">50</xref>). The AMD and GenomeTrakr frameworks rely on a participatory model with enhanced analysis, curation, and data storage at a central site. However, these resource-intensive networks focus on few selected pathogens at present. Notably, there are still significant challenges for the implementation of comprehensive WGS services at the local level (<xref rid="B42" ref-type="bibr">42</xref>, <xref rid="B51" ref-type="bibr">51</xref>). We hope that the quality framework proposed in the present study will advance the localization of comprehensive WGS services in clinical and public health laboratories.</p><p>In summary, the salient achievements of this study included (i) the establishment of performance specifications for WGS applications in PHLs according to CLIA guidelines, (ii) development of quality assurance and quality control measures, (iii) a reporting format for end users with or without WGS expertise, (iv) availability of a validation set of microorganisms to be used for future validations, and (v) creation of a modular template for the validation of WGS processes in PHLs.</p></sec><sec sec-type="materials|methods" id="sec4"><title>MATERIALS AND METHODS</title><sec id="sec4-1"><title>Bacterial isolates and sequences.</title><p>A set of 34 bacterial isolates representing the typical workflow in PHLs was used for validation and quality control of WGS. These isolates included 10 <named-content content-type="genus-species">Enterobacteriaceae</named-content> isolates, 5 Gram-positive bacterial pathogens, 5 Gram-negative nonfermenting bacterial pathogens, 9 <named-content content-type="genus-species">Mycobacterium tuberculosis</named-content> isolates, and 5 miscellaneous bacterial pathogens (<xref ref-type="table" rid="T3">Tables 3</xref> and <xref ref-type="table" rid="T4">4</xref>). The bacterial pathogens for the validation set were selected to represent various genome sizes and GC contents and to account for differences in the DNA extraction protocols. We selected ATCC strains with whole-genome sequences available from the NCBI; the NCBI sequences served as references for the validation sequencing results. Several isolates with reference sequences available from the CDC were also included in the validation set.</p><table-wrap id="T3" orientation="portrait" position="float"><label>TABLE 3</label><caption><p>List of strains used for validation and corresponding reference materials<xref ref-type="table-fn" rid="T3F1"><sup><italic>a</italic></sup></xref></p></caption><table frame="hsides" rules="groups"><colgroup span="1"><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/></colgroup><thead><tr valign="bottom"><th rowspan="2" align="left" colspan="1">MDL ID</th><th rowspan="2" align="left" colspan="1">Species</th><th colspan="2" align="left" rowspan="1">Reference genome<hr/></th></tr><tr valign="bottom"><th align="left" rowspan="1" colspan="1">NCBI strain</th><th align="left" rowspan="1" colspan="1">NCBI accession no.</th></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">C1</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Escherichia coli</named-content> O157:H7 CDC EDL 933</td><td rowspan="1" colspan="1"><bold>O157:H7 CDC EDL 933</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NZ_CP008957.1">NZ_CP008957.1</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C3</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Escherichia coli</named-content> ATCC 8739</td><td rowspan="1" colspan="1"><bold>ATCC 8739</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NC_010468.1">NC_010468.1</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C55</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Escherichia coli</named-content> ATCC 25922</td><td rowspan="1" colspan="1"><bold>ATCC 25922</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NZ_CP009072.1">NZ_CP009072.1</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C4</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Enterobacter cloacae</named-content> ATCC 13047</td><td rowspan="1" colspan="1"><bold>ATCC 13047</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NC_014121">NC_014121</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C6</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Salmonella enterica</named-content> serovar Typhimurium ATCC 14028</td><td rowspan="1" colspan="1"><bold>14028S</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NC_016856">NC_016856</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C5</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Staphylococcus aureus</named-content> ATCC 25923</td><td rowspan="1" colspan="1"><bold>ATCC 25923</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NZ_CP009361">NZ_CP009361</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C46</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Enterococcus faecalis</named-content> ATCC 29212</td><td rowspan="1" colspan="1"><bold>ATCC 29212</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NZ_CP008816">NZ_CP008816</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C47</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Staphylococcus epidermidis</named-content> ATCC 12228</td><td rowspan="1" colspan="1"><bold>ATCC 12228</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NC_004461">NC_004461</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C48</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Staphylococcus saprophyticus</named-content> ATCC 15305</td><td rowspan="1" colspan="1"><bold>ATCC 15305</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NC_007350">NC_007350</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C49</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Streptococcus pneumoniae</named-content> ATCC 6305</td><td rowspan="1" colspan="1">ATCC 700669</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="FM211187">FM211187</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C50</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Pseudomonas aeruginosa</named-content> ATCC 27853</td><td rowspan="1" colspan="1">FRD1</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NZ_CP010555">NZ_CP010555</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C51</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Stenotrophomonas maltophilia</named-content> ATCC 13637</td><td rowspan="1" colspan="1"><bold>ATCC 13637</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NZ_CP008838">NZ_CP008838</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C52</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Legionella pneumophila</named-content> SG-12 ATCC 43290</td><td rowspan="1" colspan="1"><bold>ATCC 43290</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NC_016811">NC_016811</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C53</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Moraxella catarrhalis</named-content> 87A-3084</td><td rowspan="1" colspan="1">ATCC 25240</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NZ_CP008804">NZ_CP008804</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C54</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Acinetobacter baumannii</named-content> ATCC 17945</td><td rowspan="1" colspan="1">PKAB07</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NZ_CP006963">NZ_CP006963</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C103</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Bacteroides fragilis</named-content> ATCC 25285</td><td rowspan="1" colspan="1">638R</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_016776">NC_016776</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C104</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Haemophilus influenzae</named-content> ATCC 10211</td><td rowspan="1" colspan="1">KR494</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_022356">NC_022356</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C2</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Aeromonas hydrophila</named-content> ATCC 7966</td><td rowspan="1" colspan="1"><bold>ATCC 7966</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="NC_008570">NC_008570</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C105</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Corynebacterium jeikeium</named-content> ATCC 43734</td><td rowspan="1" colspan="1"><bold>ATCC 43734</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="gen" xlink:href="GG700813">GG700813</ext-link></bold>, <bold><ext-link ext-link-type="gen" xlink:href="GG700833">GG700833</ext-link></bold></td></tr><tr valign="top"><td rowspan="1" colspan="1">C106</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Neisseria gonorrhoeae</named-content> ATCC 49226</td><td rowspan="1" colspan="1">MS11</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_022240">NC_022240</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C56</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Mycobacterium tuberculosis</named-content></td><td rowspan="1" colspan="1">H37Rv</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_000962.3">NC_000962.3</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C57</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Mycobacterium tuberculosis</named-content></td><td rowspan="1" colspan="1">H37Rv</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_000962.3">NC_000962.3</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C58</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Mycobacterium tuberculosis</named-content></td><td rowspan="1" colspan="1">H37Rv</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_000962.3">NC_000962.3</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C59</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Mycobacterium tuberculosis</named-content></td><td rowspan="1" colspan="1">H37Rv</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_000962.3">NC_000962.3</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C61</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Mycobacterium tuberculosis</named-content></td><td rowspan="1" colspan="1">H37Rv</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_000962.3">NC_000962.3</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C65</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Mycobacterium tuberculosis</named-content></td><td rowspan="1" colspan="1">H37Rv</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_000962.3">NC_000962.3</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C67</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Mycobacterium tuberculosis</named-content></td><td rowspan="1" colspan="1">H37Rv</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_000962.3">NC_000962.3</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C68</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Mycobacterium tuberculosis</named-content></td><td rowspan="1" colspan="1">H37Rv</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_000962.3">NC_000962.3</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C69</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Mycobacterium tuberculosis</named-content></td><td rowspan="1" colspan="1">H37Rv</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_000962.3">NC_000962.3</ext-link></td></tr></tbody></table><table-wrap-foot><fn fn-type="other" id="T3F1"><label>a</label><p>Boldface type indicates reference strains for which genomes are available from the NCBI database. Lightface type indicates cases where the genome is not available from the NCBI database and an alternative reference genome was used for mapping. MDL, Microbial Diseases Laboratory, California Department of Public Health, Richmond, CA.</p></fn></table-wrap-foot></table-wrap><table-wrap id="T4" orientation="portrait" position="float"><label>TABLE 4</label><caption><p>List of strains used for validation and corresponding reference materials available from the CDC<xref ref-type="table-fn" rid="T4F1"><sup><italic>a</italic></sup></xref></p></caption><table frame="hsides" rules="groups"><colgroup span="1"><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/><col align="left" span="1"/></colgroup><thead><tr valign="bottom"><th rowspan="2" align="left" colspan="1">MDL ID</th><th rowspan="2" align="left" colspan="1">Species</th><th colspan="2" align="left" rowspan="1">Reference raw reads generated by the CDC<hr/></th><th colspan="2" align="left" rowspan="1">Reference genome used for mapping<hr/></th></tr><tr valign="bottom"><th align="left" rowspan="1" colspan="1">CDC strain</th><th align="left" rowspan="1" colspan="1">GenBank accession no.</th><th align="left" rowspan="1" colspan="1">NCBI strain</th><th align="left" rowspan="1" colspan="1">NCBI accession no.</th></tr></thead><tbody><tr valign="top"><td rowspan="1" colspan="1">C72</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Escherichia coli</named-content> O121:H19</td><td rowspan="1" colspan="1"><bold>2014C-3857</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/sra/?term=SRR1610033">SRR1610033</ext-link></bold></td><td rowspan="1" colspan="1">2011C-3493</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_018658">NC_018658</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C73</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Salmonella enterica</named-content> serovar Enteritidis</td><td rowspan="1" colspan="1"><bold>CDC_2010K-1543</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/sra/?term=SRR518749">SRR518749</ext-link></bold></td><td rowspan="1" colspan="1">P125109</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_011294.1">NC_011294.1</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C74</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Salmonella enterica</named-content> serovar Infantis</td><td rowspan="1" colspan="1"><bold>2014K-0434</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/sra/?term=SRR1616809">SRR1616809</ext-link></bold></td><td rowspan="1" colspan="1">1326/28</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NZ_LN649235">NZ_LN649235</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C75</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Salmonella enterica</named-content> serovar Adelaide</td><td rowspan="1" colspan="1"><bold>2014K-0941</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/sra/?term=SRR1686419">SRR1686419</ext-link></bold></td><td rowspan="1" colspan="1">P125109</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_011294.1">NC_011294.1</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C76</td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Salmonella enterica</named-content> serovar Worthington</td><td rowspan="1" colspan="1"><bold>2012K-1219</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/sra/?term=SRR1614868">SRR1614868</ext-link></bold></td><td rowspan="1" colspan="1">P125109</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_011294.1">NC_011294.1</ext-link></td></tr><tr valign="top"><td rowspan="1" colspan="1">C77<xref ref-type="table-fn" rid="T4F2"><sup><italic>b</italic></sup></xref></td><td rowspan="1" colspan="1"><named-content content-type="genus-species">Salmonella enterica</named-content> serovar Saintpaul</td><td rowspan="1" colspan="1"><bold>2014K-0875</bold></td><td rowspan="1" colspan="1"><bold><ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/sra/?term=SRR1640105">SRR1640105</ext-link></bold></td><td rowspan="1" colspan="1">14028S</td><td rowspan="1" colspan="1"><ext-link ext-link-type="gen" xlink:href="NC_016856">NC_016856</ext-link></td></tr></tbody></table><table-wrap-foot><fn fn-type="other" id="T4F1"><label>a</label><p>Boldface type indicates the reference strains for which genomes are available from the NCBI database. MDL, Microbial Diseases Laboratory, California Department of Public Health, Richmond, CA.</p></fn><fn fn-type="other" id="T4F2"><label>b</label><p>Sample C77 was sequenced by the MDL only for genotyping assay accuracy validation. No replicates were done.</p></fn></table-wrap-foot></table-wrap></sec><sec id="sec4-2"><title>Reference whole genomes.</title><p>The genome sequences of ATCC strains, isolates characterized by the CDC, and other representative isolates were downloaded from the NCBI database (<ext-link ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/genome/">http://www.ncbi.nlm.nih.gov/genome/</ext-link>) to be used as a reference according to CLSI guidelines (<xref rid="B37" ref-type="bibr">37</xref>) (<xref ref-type="table" rid="T3">Tables 3</xref> and <xref ref-type="table" rid="T4">4</xref>).</p></sec><sec id="sec4-3"><title>WGS wet-bench workflow.</title><p>Whole-genome sequencing was performed on an Illumina MiSeq sequencer (see Fig. S1 in the supplemental material). The Nextera XT library preparation procedure and 2 &#x000d7; 300-cycle MiSeq sequencing kits were used (Illumina Inc., San Diego, CA, USA). Illumina Nextera XT indexes were used for barcoding. Bacterial DNA was extracted by using the Wizard Genomic DNA kit (Promega, Madison, WI, USA). The bacterial DNA concentrations were measured by using Qubit fluorometric quantitation with a Qubit double-stranded RNA (dsDNA) BR assay kit (Thermo Fisher Scientific, Waltham, MA, USA). DNA purity was estimated by using a NanoDrop 2000 UV-visible (UV-Vis) spectrophotometer (NanoDrop Products, Wilmington, DE, USA). The Mastercycler nexus was used for incubation for tagmentation reaction and PCR (Eppendorf North America, Hauppauge, NY, USA). The library concentration was measured by using a Qubit HS kit. The DNA library size distribution was estimated by using a 2100 BioAnalyzer instrument and a High Sensitivity DNA analysis kit (Agilent Technologies, Santa Clara, CA, USA). Ampure beads were used for size selection. Manual normalization of libraries was performed. Genome sizes of samples pooled in one run were taken into account to ensure the equal representation of each genome per flow cell. The number of samples was kept at 16 per run (except for one run with 19 samples), resulting in a total genome load of 48 to 84 Mb/run (average, 69 Mb/run). The PhiX Control V3 sequencing control at 1% was spiked in for every sequencing run (Illumina Inc., San Diego, CA, USA). Genomes were generated with a depth of coverage (estimated for trimmed mapped reads) in the range of 15.71&#x000d7; to 216.4&#x000d7; (average, 79.72&#x000d7;; median, 71.55&#x000d7;).</p><p>The preliminary run and raw data acceptance criteria were set for (i) the percentage of bases with a &#x02265;Q30 quality score for the run of &#x02265;50%, with the requirement that the Q30 score for the generated genome sequences must be &#x02265;75% for at least 80 bp of the read length; (ii) a minimum average coverage of the genome of 10&#x000d7;; (iii) a PhiX error rate that must be &#x0003c;6%; and (iv) the negative control meeting the following parameters: &#x0003c;10,000 reads after trimming, an N50 value of &#x0003c;1,000, and the highest coverage of <italic>de novo</italic>-assembled contigs of &#x0003c;10&#x000d7;. Data which did not meet these quality parameters were rejected. The preliminary quality thresholds were further adjusted based on the quality ranges observed during validation.</p></sec><sec id="sec4-4"><title>Bioinformatics pipeline.</title><p>Paired-end reads were quality trimmed with the threshold of Q30 and then used for mapping to the reference and <italic>de novo</italic> assemblies on CLCbio Genomic Workbench 8.0.2 (Qiagen, Aarhus, Denmark). The BAM files generated after mapping to the reference genome were taken through series of software suites to generate the phylogenetic tree. A customized shell script was created to automate the subsequent steps after mapping, which included (i) calculating the genotype likelihood using SAMtools mpileup (v.1.2) (<xref rid="B52" ref-type="bibr">52</xref>), (ii) conversion to a VCF matrix using bcftools view (v0.1.19; <ext-link ext-link-type="uri" xlink:href="http://samtools.github.io/bcftools/">http://samtools.github.io/bcftools/</ext-link>), (iii) single nucleotide polymorphism calling in coding and noncoding genome areas using bcftools call &#x02014;c (v0.1.19), (iv) variant parsing using vcftools (v.0.1.12b) (<xref rid="B53" ref-type="bibr">53</xref>) to include only hqSNPs with a mean coverage of &#x02265;30&#x000d7; and a minimum SNP quality (min<italic>Q</italic>) of &#x02265;200 with indels and heterozygote calls excluded and the allele frequency defining a homozygous call at &#x02265;93%, and (v) conversion of the SNP matrix to a FASTA alignment file for export back to CLCbio Genomic Workbench 8.0.2 for the generation of the phylogenetic tree.</p></sec><sec id="sec4-5"><title>hqSNP-based genotyping.</title><p>Maximum likelihood phylogenetic trees were generated based on hqSNPs under the Jukes-Cantor nucleotide substitution model; bootstrapping included 100 replicates.</p></sec><sec id="sec4-6"><title>16S rRNA gene-based identification.</title><p>Genomes were annotated with prokka v1.1 (<xref rid="B54" ref-type="bibr">54</xref>), and species identification was performed by comparing 16S rRNA gene sequences against data in the Ribosomal Database Project (RDP) database (<xref rid="B55" ref-type="bibr">55</xref>).</p></sec><sec id="sec4-7"><title><italic>In silico</italic> MLST.</title><p><italic>In silico</italic> MLST was performed by using the Center for Genomic Epidemiology (CGE) online tool (<xref rid="B56" ref-type="bibr">56</xref>).</p></sec><sec id="sec4-8"><title>Detection of antibiotic resistance genes.</title><p>ABR gene detection was performed by using the CGE ResFinder online resource (<xref rid="B57" ref-type="bibr">57</xref>) with 99% identity and 100% query length coverage thresholds. Two sets of sequences were analyzed. For the first set, ATCC reference bacterial strains designated for use as antibiotic susceptibility controls were sequenced by the laboratory performing the validation. Negative controls were chosen among strains described as being susceptible, with no known antibiotic resistance genes, according to CLSI document M100-S25 (<xref rid="B58" ref-type="bibr">58</xref>). Positive controls were chosen among strains that possessed resistance determinants, according to CLSI document M100-S25. For the second set, reference sequences were acquired from the FDA-CDC Antimicrobial Resistance Isolate Bank (<ext-link ext-link-type="uri" xlink:href="https://www.cdc.gov/drugresistance/resistance-bank/">https://www.cdc.gov/drugresistance/resistance-bank/</ext-link>) for <italic>in silico</italic> testing. Thirteen isolates (7 Gram-negative and 6 Gram-positive organisms) with various resistance genes were analyzed.</p></sec><sec id="sec4-9"><title>Validation plan.</title><p>Thirty-four bacterial isolates were sequenced in triplicate. The WGS pipeline, including wet- and dry-bench processes, was validated by assessing the performances of the platform, specific WGS-based assays, and the bioinformatics analysis pipeline (<xref ref-type="fig" rid="F1">Fig. 1</xref>). Validated WGS assays included genome-wide SNP-based genotyping, MLST, 16S rRNA species identification, and antibiotic resistance gene detection. The following performance characteristics were assessed: accuracy, reproducibility (between-run precision), repeatability (within-run precision), analytical and diagnostic sensitivity, and analytical and diagnostic specificity.</p><p>For reproducibility assessments, all between-run replicates were generated starting from fresh cultures except for <named-content content-type="genus-species">M. tuberculosis</named-content>, where DNA samples were used. Between-run replicates were processed on three different days, alternating between two operators, as recommended in CLSI document MM11A (<xref rid="B59" ref-type="bibr">59</xref>). For repeatability, within-run replicates were prepared from one DNA extract, but independent library preparations were done, with the final samples being included in one sequencing run. The reproducibility and repeatability of the genotyping assay were evaluated with two methods: (i) evaluation of the reproducibility and repeatability of SNP calling per replicate takes into account the whole genome of one replicate as a single test, meaning that any number of single nucleotide changes in one out of three of the replicates was regarded as 33.3% disagreement for that validation sample, and (ii) the reproducibility and repeatability of SNP calling relative to the genome size were calculated by taking a discordant SNP into account as a percentage of the number of base pairs in the genome (genome size); e.g., a 10-SNP difference among the replicates with 5-Mb genomes equals 99.9998% reproducibility.</p></sec><sec id="sec4-10"><title>Analytical sensitivity.</title><p>The LOD of SNP calling was estimated by modeling different mapping coverages and estimating the minimum coverage that allowed accurate SNP calling (LOD<sub>SNP</sub>). Mapping BAM files were downsampled in order to achieve different coverage values (60&#x000d7;, 50&#x000d7;, 40&#x000d7;, 30&#x000d7;, 20&#x000d7;, 15&#x000d7;, 10&#x000d7;, and 5&#x000d7;) for each of the nine samples representative of the different species. The original sequence mapping coverage was estimated from the BAM file by using the following command: samtools depth bamfile_sorted.bam &#x02223; awk &#x02018;{sum+ = $3} END {print &#x0201c;Average =&#x0201d;,sum/NR}&#x02019; The bam files were downsampled to the desired coverage by using the command samtools view &#x02014;h &#x02014;s F bamfile_sorted.bam &#x0003e;bamfile_sorted_30x.bam, where F is a fraction of the desired coverage in relation to the original coverage. Downsampled BAM files were compared to other replicates for the same sample at their original coverage.</p></sec><sec id="sec4-11"><title>Analytical specificity.</title><p>To determine analytical specificity (interference), we generated <italic>in silico</italic> sequencing files containing a mixture of reads from two different samples, thus mimicking contamination. The effect of potentially interfering sequencing reads on mapping metrics (percentage of reads mapped/not mapped, percentage of the reference sequence covered, etc.) and SNP detection was estimated. Sample C3 (<named-content content-type="genus-species">Escherichia coli</named-content> ATCC 8739) was selected as a &#x0201c;control, not-contaminated sample,&#x0201d; and equal parts of reads from different species were merged with it to generate mixed fastq files. This was done by using the concatenate command, e.g., cat file1 file2 &#x0003e; mixed file. To ensure that the correct file was generated, the numbers of lines in both files and the mixed file were counted. The number of lines in the mixed file was the sum of the numbers of lines in the two parent files. Also, the column headings were checked to ensure that the headers of the mixed file were similar to those of the parent file. &#x0201c;Contaminated&#x0201d; and &#x0201c;not-contaminated&#x0201d; reads were mapped to the same reference genome of <named-content content-type="genus-species">E. coli</named-content> ATCC 8739 from the NCBI. Further SNP calling analysis was performed by using the standardized pipeline under validation. The specificity of SNP calling in the samples containing potentially interfering reads was estimated by comparing (i) the number of SNPs detected between the contaminated sequence and the reference sequence and (ii) the number of SNPs detected between the contaminated and not-contaminated sequences included in the same SNP calling analysis (vcf file after filtering).</p></sec><sec id="sec4-12"><title>Accession number(s).</title><p>Data from this whole-genome shotgun project have been deposited in GenBank under raw read accession numbers <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/sra/?term=SRR4114366">SRR4114366</ext-link> to <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/sra/?term=SRR4114399">SRR4114399</ext-link> and assembly accession numbers <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/MTFS00000000">MTFS00000000</ext-link> to <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/MTGZ00000000">MTGZ00000000</ext-link> (see Table S1 in the supplemental material). All WGS data, including sequences of the replicates, are associated with BioProject accession number <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA341407">PRJNA341407</ext-link>.</p></sec></sec>
<sec sec-type="supplementary-material">
<title>Supplementary Material</title>
<supplementary-material id="PMC_1" content-type="local-data">
<caption>
<title>Supplemental material</title>
</caption>
<media mimetype="text" mime-subtype="html" xlink:href="supp_55_8_2502__index.html"/>
<media xlink:role="associated-file" mimetype="application" mime-subtype="pdf"
xlink:href="JCM.00361-17_zjm999095603s1.pdf"/>
</supplementary-material>
</sec>
</body><back><fn-group><fn fn-type="supplementary-material"><p>Supplemental material for this article may be found at <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1128/JCM.00361-17">https://doi.org/10.1128/JCM.00361-17</ext-link>.</p></fn></fn-group><ack><title>ACKNOWLEDGMENTS</title><p>This study was supported in part by grant 6NU50CK000410-03 from the HHS, U.S. Centers for Disease Control and Prevention.</p><p>We are thankful to Eija Trees from the Centers for Disease Control and Prevention for assistance in acquiring reference sequences for the comparative study.</p><p>V.K.K. and V.C. were responsible for conceptualization, data analysis, and the original draft; V.K.K., C.-L.T., A.L.G., and J.C. performed data collection and analysis; R.M. performed software coding; V.K.K., C.-L.T., R.M., A.L.G., J.C., and V.C. reviewed the draft; and V.C. was responsible for project administration and supervision.</p></ack><ref-list><title>REFERENCES</title><ref id="B1"><label>1.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Didelot</surname><given-names>X</given-names></name>, <name name-style="western"><surname>Bowden</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Wilson</surname><given-names>DJ</given-names></name>, <name name-style="western"><surname>Peto</surname><given-names>TEA</given-names></name>, <name name-style="western"><surname>Crook</surname><given-names>DW</given-names></name></person-group>
<year>2012</year>
<article-title>Transforming clinical microbiology with bacterial genome sequencing</article-title>. <source>Nat Rev Genet</source>
<volume>13</volume>:<fpage>601</fpage>&#x02013;<lpage>612</lpage>. doi:<pub-id pub-id-type="doi">10.1038/nrg3226</pub-id>.<pub-id pub-id-type="pmid">22868263</pub-id></mixed-citation></ref><ref id="B2"><label>2.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>K&#x000f6;ser</surname><given-names>CU</given-names></name>, <name name-style="western"><surname>Ellington</surname><given-names>MJ</given-names></name>, <name name-style="western"><surname>Cartwright</surname><given-names>EJ</given-names></name>, <name name-style="western"><surname>Gillespie</surname><given-names>SH</given-names></name>, <name name-style="western"><surname>Brown</surname><given-names>NM</given-names></name>, <name name-style="western"><surname>Farrington</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Holden</surname><given-names>MT</given-names></name>, <name name-style="western"><surname>Dougan</surname><given-names>G</given-names></name>, <name name-style="western"><surname>Bentley</surname><given-names>SD</given-names></name>, <name name-style="western"><surname>Parkhill</surname><given-names>J</given-names></name></person-group>
<year>2012</year>
<article-title>Routine use of microbial whole genome sequencing in diagnostic and public health microbiology</article-title>. <source>PLoS Pathog</source>
<volume>8</volume>:<elocation-id>e1002824</elocation-id>. doi:<pub-id pub-id-type="doi">10.1371/journal.ppat.1002824</pub-id>.<pub-id pub-id-type="pmid">22876174</pub-id></mixed-citation></ref><ref id="B3"><label>3.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Grad</surname><given-names>YH</given-names></name>, <name name-style="western"><surname>Lipsitch</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Feldgarden</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Arachchi</surname><given-names>HM</given-names></name>, <name name-style="western"><surname>Cerqueira</surname><given-names>GC</given-names></name>, <name name-style="western"><surname>FitzGerald</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Godfrey</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Haas</surname><given-names>BJ</given-names></name>, <name name-style="western"><surname>Murphy</surname><given-names>CI</given-names></name>, <name name-style="western"><surname>Russ</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Sykes</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Walker</surname><given-names>BJ</given-names></name>, <name name-style="western"><surname>Wortman</surname><given-names>JR</given-names></name>, <name name-style="western"><surname>Young</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Zeng</surname><given-names>Q</given-names></name>, <name name-style="western"><surname>Abouelleil</surname><given-names>A</given-names></name>, <name name-style="western"><surname>Bochicchio</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Chauvin</surname><given-names>S</given-names></name>, <name name-style="western"><surname>DeSmet</surname><given-names>T</given-names></name>, <name name-style="western"><surname>Gujja</surname><given-names>S</given-names></name>, <name name-style="western"><surname>McCowan</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Montmayeur</surname><given-names>A</given-names></name>, <name name-style="western"><surname>Steelman</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Frimodt-M&#x000f8;ller</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Petersen</surname><given-names>AM</given-names></name>, <name name-style="western"><surname>Struve</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Krogfelt</surname><given-names>KA</given-names></name>, <name name-style="western"><surname>Bingen</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Weill</surname><given-names>F-X</given-names></name>, <name name-style="western"><surname>Lander</surname><given-names>ES</given-names></name>, <name name-style="western"><surname>Nusbaum</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Birren</surname><given-names>BW</given-names></name>, <name name-style="western"><surname>Hung</surname><given-names>DT</given-names></name>, <name name-style="western"><surname>Hanage</surname><given-names>WP</given-names></name></person-group>
<year>2012</year>
<article-title>Genomic epidemiology of the <italic>Escherichia coli</italic> O104:H4 outbreaks in Europe, 2011</article-title>. <source>Proc Natl Acad Sci U S A</source>
<volume>109</volume>:<fpage>3065</fpage>&#x02013;<lpage>3070</lpage>. doi:<pub-id pub-id-type="doi">10.1073/pnas.1121491109</pub-id>.<pub-id pub-id-type="pmid">22315421</pub-id></mixed-citation></ref><ref id="B4"><label>4.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>McGann</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Bunin</surname><given-names>JL</given-names></name>, <name name-style="western"><surname>Snesrud</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Singh</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Maybank</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Ong</surname><given-names>AC</given-names></name>, <name name-style="western"><surname>Kwak</surname><given-names>YI</given-names></name>, <name name-style="western"><surname>Seronello</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Clifford</surname><given-names>RJ</given-names></name>, <name name-style="western"><surname>Hinkle</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Yamada</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Barnhill</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Lesho</surname><given-names>E</given-names></name></person-group>
<year>2016</year>
<article-title>Real time application of whole genome sequencing for outbreak investigation&#x02014;what is an achievable turnaround time?</article-title>
<source>Diagn Microbiol Infect Dis</source>
<volume>85</volume>:<fpage>277</fpage>&#x02013;<lpage>282</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.diagmicrobio.2016.04.020</pub-id>.<pub-id pub-id-type="pmid">27185645</pub-id></mixed-citation></ref><ref id="B5"><label>5.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Gordon</surname><given-names>NC</given-names></name>, <name name-style="western"><surname>Price</surname><given-names>JR</given-names></name>, <name name-style="western"><surname>Cole</surname><given-names>K</given-names></name>, <name name-style="western"><surname>Everitt</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Morgan</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Finney</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Kearns</surname><given-names>AM</given-names></name>, <name name-style="western"><surname>Pichon</surname><given-names>B</given-names></name>, <name name-style="western"><surname>Young</surname><given-names>B</given-names></name>, <name name-style="western"><surname>Wilson</surname><given-names>DJ</given-names></name>, <name name-style="western"><surname>Llewelyn</surname><given-names>MJ</given-names></name>, <name name-style="western"><surname>Paul</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Peto</surname><given-names>TEA</given-names></name>, <name name-style="western"><surname>Crook</surname><given-names>DW</given-names></name>, <name name-style="western"><surname>Walker</surname><given-names>AS</given-names></name>, <name name-style="western"><surname>Golubchik</surname><given-names>T</given-names></name></person-group>
<year>2014</year>
<article-title>Prediction of <italic>Staphylococcus aureus</italic> antimicrobial resistance by whole-genome sequencing</article-title>. <source>J Clin Microbiol</source>
<volume>52</volume>:<fpage>1182</fpage>&#x02013;<lpage>1191</lpage>. doi:<pub-id pub-id-type="doi">10.1128/JCM.03117-13</pub-id>.<pub-id pub-id-type="pmid">24501024</pub-id></mixed-citation></ref><ref id="B6"><label>6.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Leopold</surname><given-names>SR</given-names></name>, <name name-style="western"><surname>Goering</surname><given-names>RV</given-names></name>, <name name-style="western"><surname>Witten</surname><given-names>A</given-names></name>, <name name-style="western"><surname>Harmsen</surname><given-names>D</given-names></name>, <name name-style="western"><surname>Mellmann</surname><given-names>A</given-names></name></person-group>
<year>2014</year>
<article-title>Bacterial whole-genome sequencing revisited: portable, scalable, and standardized analysis for typing and detection of virulence and antibiotic resistance genes</article-title>. <source>J Clin Microbiol</source>
<volume>52</volume>:<fpage>2365</fpage>&#x02013;<lpage>2370</lpage>. doi:<pub-id pub-id-type="doi">10.1128/JCM.00262-14</pub-id>.<pub-id pub-id-type="pmid">24759713</pub-id></mixed-citation></ref><ref id="B7"><label>7.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Goodwin</surname><given-names>S</given-names></name>, <name name-style="western"><surname>McPherson</surname><given-names>JD</given-names></name>, <name name-style="western"><surname>McCombie</surname><given-names>WR</given-names></name></person-group>
<year>2016</year>
<article-title>Coming of age: ten years of next-generation sequencing technologies</article-title>. <source>Nat Rev Genet</source>
<volume>17</volume>:<fpage>333</fpage>&#x02013;<lpage>351</lpage>. doi:<pub-id pub-id-type="doi">10.1038/nrg.2016.49</pub-id>.<pub-id pub-id-type="pmid">27184599</pub-id></mixed-citation></ref><ref id="B8"><label>8.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Roetzer</surname><given-names>A</given-names></name>, <name name-style="western"><surname>Diel</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Kohl</surname><given-names>TA</given-names></name>, <name name-style="western"><surname>R&#x000fc;ckert</surname><given-names>C</given-names></name>, <name name-style="western"><surname>N&#x000fc;bel</surname><given-names>U</given-names></name>, <name name-style="western"><surname>Blom</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Wirth</surname><given-names>T</given-names></name>, <name name-style="western"><surname>Jaenicke</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Schuback</surname><given-names>S</given-names></name>, <name name-style="western"><surname>R&#x000fc;sch-Gerdes</surname><given-names>S</given-names></name></person-group>
<year>2013</year>
<article-title>Whole genome sequencing versus traditional genotyping for investigation of a <italic>Mycobacterium tuberculosis</italic> outbreak: a longitudinal molecular epidemiological study</article-title>. <source>PLoS Med</source>
<volume>10</volume>:<elocation-id>e1001387</elocation-id>. doi:<pub-id pub-id-type="doi">10.1371/journal.pmed.1001387</pub-id>.<pub-id pub-id-type="pmid">23424287</pub-id></mixed-citation></ref><ref id="B9"><label>9.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Gardy</surname><given-names>JL</given-names></name>, <name name-style="western"><surname>Johnston</surname><given-names>JC</given-names></name>, <name name-style="western"><surname>Sui</surname><given-names>SJH</given-names></name>, <name name-style="western"><surname>Cook</surname><given-names>VJ</given-names></name>, <name name-style="western"><surname>Shah</surname><given-names>L</given-names></name>, <name name-style="western"><surname>Brodkin</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Rempel</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Moore</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Zhao</surname><given-names>Y</given-names></name>, <name name-style="western"><surname>Holt</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Varhol</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Birol</surname><given-names>I</given-names></name>, <name name-style="western"><surname>Lem</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Sharma</surname><given-names>MK</given-names></name>, <name name-style="western"><surname>Elwood</surname><given-names>K</given-names></name>, <name name-style="western"><surname>Jones</surname><given-names>SJM</given-names></name>, <name name-style="western"><surname>Brinkman</surname><given-names>FSL</given-names></name>, <name name-style="western"><surname>Brunham</surname><given-names>RC</given-names></name>, <name name-style="western"><surname>Tang</surname><given-names>P</given-names></name></person-group>
<year>2011</year>
<article-title>Whole-genome sequencing and social-network analysis of a tuberculosis outbreak</article-title>. <source>N Engl J Med</source>
<volume>364</volume>:<fpage>730</fpage>&#x02013;<lpage>739</lpage>. doi:<pub-id pub-id-type="doi">10.1056/NEJMoa1003176</pub-id>.<pub-id pub-id-type="pmid">21345102</pub-id></mixed-citation></ref><ref id="B10"><label>10.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Eyre</surname><given-names>DW</given-names></name>, <name name-style="western"><surname>Cule</surname><given-names>ML</given-names></name>, <name name-style="western"><surname>Wilson</surname><given-names>DJ</given-names></name>, <name name-style="western"><surname>Griffiths</surname><given-names>D</given-names></name>, <name name-style="western"><surname>Vaughan</surname><given-names>A</given-names></name>, <name name-style="western"><surname>O'Connor</surname><given-names>L</given-names></name>, <name name-style="western"><surname>Ip</surname><given-names>CLC</given-names></name>, <name name-style="western"><surname>Golubchik</surname><given-names>T</given-names></name>, <name name-style="western"><surname>Batty</surname><given-names>EM</given-names></name>, <name name-style="western"><surname>Finney</surname><given-names>JM</given-names></name>, <name name-style="western"><surname>Wyllie</surname><given-names>DH</given-names></name>, <name name-style="western"><surname>Didelot</surname><given-names>X</given-names></name>, <name name-style="western"><surname>Piazza</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Bowden</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Dingle</surname><given-names>KE</given-names></name>, <name name-style="western"><surname>Harding</surname><given-names>RM</given-names></name>, <name name-style="western"><surname>Crook</surname><given-names>DW</given-names></name>, <name name-style="western"><surname>Wilcox</surname><given-names>MH</given-names></name>, <name name-style="western"><surname>Peto</surname><given-names>TEA</given-names></name>, <name name-style="western"><surname>Walker</surname><given-names>AS</given-names></name></person-group>
<year>2013</year>
<article-title>Diverse sources of <italic>C. difficile</italic> infection identified on whole-genome sequencing</article-title>. <source>N Engl J Med</source>
<volume>369</volume>:<fpage>1195</fpage>&#x02013;<lpage>1205</lpage>. doi:<pub-id pub-id-type="doi">10.1056/NEJMoa1216064</pub-id>.<pub-id pub-id-type="pmid">24066741</pub-id></mixed-citation></ref><ref id="B11"><label>11.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Etienne</surname><given-names>KA</given-names></name>, <name name-style="western"><surname>Roe</surname><given-names>CC</given-names></name>, <name name-style="western"><surname>Smith</surname><given-names>RM</given-names></name>, <name name-style="western"><surname>Vallabhaneni</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Duarte</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Escand&#x000f3;n</surname><given-names>P</given-names></name></person-group>, <string-name><surname>Casta&#x000f1;eda</surname>
<given-names>E</given-names></string-name>, <string-name><surname>G&#x000f3;mez</surname>
<given-names>BL</given-names></string-name>, <string-name><surname>de Bedout</surname>
<given-names>C</given-names></string-name>, <string-name><surname>L&#x000f3;pez</surname>
<given-names>LF</given-names></string-name>, <string-name><surname>Salas</surname>
<given-names>V</given-names></string-name>, <string-name><surname>Hederra</surname>
<given-names>LM</given-names></string-name>, <string-name><surname>Fern&#x000e1;ndez</surname>
<given-names>J</given-names></string-name>, <string-name><surname>Pidal</surname>
<given-names>P</given-names></string-name>, <string-name><surname>Hormazabel</surname>
<given-names>JC</given-names></string-name>, <string-name><surname>Ota&#x000ed;za-O'Ryan</surname>
<given-names>F</given-names></string-name>, <string-name><surname>Vannberg</surname>
<given-names>FO</given-names></string-name>, <string-name><surname>Gillece</surname>
<given-names>J</given-names></string-name>, <string-name><surname>Lemmer</surname>
<given-names>D</given-names></string-name>, <string-name><surname>Driebe</surname>
<given-names>EM</given-names></string-name>, <string-name><surname>Engelthaler</surname>
<given-names>DM</given-names></string-name>, <string-name><surname>Litvintseva</surname>
<given-names>AP</given-names></string-name>. <year>2016</year>
<article-title>Whole-genome sequencing to determine origin of multinational outbreak of <italic>Sarocladium kiliense</italic> bloodstream infections</article-title>. <source>Emerg Infect Dis</source>
<volume>22</volume>:<fpage>476</fpage>&#x02013;<lpage>481</lpage>. doi:<pub-id pub-id-type="doi">10.3201/eid2203.151193</pub-id>.<pub-id pub-id-type="pmid">26891230</pub-id></mixed-citation></ref><ref id="B12"><label>12.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Pallen</surname><given-names>MJ</given-names></name></person-group>
<year>2014</year>
<article-title>Diagnostic metagenomics: potential applications to bacterial, viral and parasitic infections</article-title>. <source>Parasitology</source>
<volume>141</volume>:<fpage>1856</fpage>&#x02013;<lpage>1862</lpage>. doi:<pub-id pub-id-type="doi">10.1017/S0031182014000134</pub-id>.<pub-id pub-id-type="pmid">24576467</pub-id></mixed-citation></ref><ref id="B13"><label>13.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Onderdonk</surname><given-names>AB</given-names></name>, <name name-style="western"><surname>Delaney</surname><given-names>ML</given-names></name>, <name name-style="western"><surname>Fichorova</surname><given-names>RN</given-names></name></person-group>
<year>2016</year>
<article-title>The human microbiome during bacterial vaginosis</article-title>. <source>Clin Microbiol Rev</source>
<volume>29</volume>:<fpage>223</fpage>&#x02013;<lpage>238</lpage>. doi:<pub-id pub-id-type="doi">10.1128/CMR.00075-15</pub-id>.<pub-id pub-id-type="pmid">26864580</pub-id></mixed-citation></ref><ref id="B14"><label>14.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Pearce</surname><given-names>MM</given-names></name>, <name name-style="western"><surname>Hilt</surname><given-names>EE</given-names></name>, <name name-style="western"><surname>Rosenfeld</surname><given-names>AB</given-names></name>, <name name-style="western"><surname>Zilliox</surname><given-names>MJ</given-names></name>, <name name-style="western"><surname>Thomas-White</surname><given-names>K</given-names></name>, <name name-style="western"><surname>Fok</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Kliethermes</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Schreckenberger</surname><given-names>PC</given-names></name>, <name name-style="western"><surname>Brubaker</surname><given-names>L</given-names></name>, <name name-style="western"><surname>Gai</surname><given-names>X</given-names></name>, <name name-style="western"><surname>Wolfe</surname><given-names>AJ</given-names></name></person-group>
<year>2014</year>
<article-title>The female urinary microbiome: a comparison of women with and without urgency urinary incontinence</article-title>. <source>mBio</source>
<volume>5</volume>:<elocation-id>e01283-14</elocation-id>. doi:<pub-id pub-id-type="doi">10.1128/mBio.01283-14</pub-id>.<pub-id pub-id-type="pmid">25006228</pub-id></mixed-citation></ref><ref id="B15"><label>15.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Schubert</surname><given-names>AM</given-names></name>, <name name-style="western"><surname>Rogers</surname><given-names>MAM</given-names></name>, <name name-style="western"><surname>Ring</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Mogle</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Petrosino</surname><given-names>JP</given-names></name>, <name name-style="western"><surname>Young</surname><given-names>VB</given-names></name>, <name name-style="western"><surname>Aronoff</surname><given-names>DM</given-names></name>, <name name-style="western"><surname>Schloss</surname><given-names>PD</given-names></name></person-group>
<year>2014</year>
<article-title>Microbiome data distinguish patients with <italic>Clostridium difficile</italic> infection and non-<italic>C. difficile</italic>-associated diarrhea from healthy controls</article-title>. <source>mBio</source>
<volume>5</volume>:<elocation-id>e01021-14</elocation-id>. doi:<pub-id pub-id-type="doi">10.1128/mBio.01021-14</pub-id>.<pub-id pub-id-type="pmid">24803517</pub-id></mixed-citation></ref><ref id="B16"><label>16.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Naccache</surname><given-names>SN</given-names></name>, <name name-style="western"><surname>Federman</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Veeraraghavan</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Zaharia</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Lee</surname><given-names>D</given-names></name>, <name name-style="western"><surname>Samayoa</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Bouquet</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Greninger</surname><given-names>AL</given-names></name>, <name name-style="western"><surname>Luk</surname><given-names>K-C</given-names></name>, <name name-style="western"><surname>Enge</surname><given-names>B</given-names></name>, <name name-style="western"><surname>Wadford</surname><given-names>DA</given-names></name>, <name name-style="western"><surname>Messenger</surname><given-names>SL</given-names></name>, <name name-style="western"><surname>Genrich</surname><given-names>GL</given-names></name>, <name name-style="western"><surname>Pellegrino</surname><given-names>K</given-names></name>, <name name-style="western"><surname>Grard</surname><given-names>G</given-names></name>, <name name-style="western"><surname>Leroy</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Schneider</surname><given-names>BS</given-names></name>, <name name-style="western"><surname>Fair</surname><given-names>JN</given-names></name>, <name name-style="western"><surname>Mart&#x000ed;nez</surname><given-names>MA</given-names></name>, <name name-style="western"><surname>Isa</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Crump</surname><given-names>JA</given-names></name>, <name name-style="western"><surname>DeRisi</surname><given-names>JL</given-names></name>, <name name-style="western"><surname>Sittler</surname><given-names>T</given-names></name>, <name name-style="western"><surname>Hackett</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Miller</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Chiu</surname><given-names>CY</given-names></name></person-group>
<year>2014</year>
<article-title>A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples</article-title>. <source>Genome Res</source>
<volume>24</volume>:<fpage>1180</fpage>&#x02013;<lpage>1192</lpage>. doi:<pub-id pub-id-type="doi">10.1101/gr.171934.113</pub-id>.<pub-id pub-id-type="pmid">24899342</pub-id></mixed-citation></ref><ref id="B17"><label>17.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Hasman</surname><given-names>H</given-names></name>, <name name-style="western"><surname>Saputra</surname><given-names>D</given-names></name>, <name name-style="western"><surname>Sicheritz-Ponten</surname><given-names>T</given-names></name>, <name name-style="western"><surname>Lund</surname><given-names>O</given-names></name>, <name name-style="western"><surname>Svendsen</surname><given-names>CA</given-names></name>, <name name-style="western"><surname>Frimodt-M&#x000f8;ller</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Aarestrup</surname><given-names>FM</given-names></name></person-group>
<year>2014</year>
<article-title>Rapid whole genome sequencing for the detection and characterization of microorganisms directly from clinical samples</article-title>. <source>J Clin Microbiol</source>
<volume>52</volume>:<fpage>139</fpage>&#x02013;<lpage>146</lpage>. doi:<pub-id pub-id-type="doi">10.1128/JCM.02452-13</pub-id>.<pub-id pub-id-type="pmid">24172157</pub-id></mixed-citation></ref><ref id="B18"><label>18.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Loman</surname><given-names>NJ</given-names></name>, <name name-style="western"><surname>Constantinidou</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Christner</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Rohde</surname><given-names>H</given-names></name>, <name name-style="western"><surname>Chan</surname><given-names>JZM</given-names></name>, <name name-style="western"><surname>Quick</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Weir</surname><given-names>JC</given-names></name>, <name name-style="western"><surname>Quince</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Smith</surname><given-names>GP</given-names></name>, <name name-style="western"><surname>Betley</surname><given-names>JR</given-names></name>, <name name-style="western"><surname>Aepfelbacher</surname><given-names>M</given-names></name></person-group>
<year>2013</year>
<article-title>A culture-independent sequence-based metagenomics approach to the investigation of an outbreak of Shiga-toxigenic <italic>Escherichia coli</italic> O104:H4</article-title>. <source>JAMA</source>
<volume>309</volume>:<fpage>1502</fpage>&#x02013;<lpage>1510</lpage>. doi:<pub-id pub-id-type="doi">10.1001/jama.2013.3231</pub-id>.<pub-id pub-id-type="pmid">23571589</pub-id></mixed-citation></ref><ref id="B19"><label>19.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Endrullat</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Gl&#x000f6;kler</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Franke</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Frohme</surname><given-names>M</given-names></name></person-group>
<year>2016</year>
<article-title>Standardization and quality management in next-generation sequencing</article-title>. <source>Appl Transl Genom</source>
<volume>10</volume>:<fpage>2</fpage>&#x02013;<lpage>9</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.atg.2016.06.001</pub-id>.<pub-id pub-id-type="pmid">27668169</pub-id></mixed-citation></ref><ref id="B20"><label>20.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Salto-Tellez</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Gonzalez de Castro</surname><given-names>D</given-names></name></person-group>
<year>2014</year>
<article-title>Next-generation sequencing: a change of paradigm in molecular diagnostic validation</article-title>. <source>J Pathol</source>
<volume>234</volume>:<fpage>5</fpage>&#x02013;<lpage>10</lpage>. doi:<pub-id pub-id-type="doi">10.1002/path.4365</pub-id>.<pub-id pub-id-type="pmid">24756835</pub-id></mixed-citation></ref><ref id="B21"><label>21.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Goldberg</surname><given-names>B</given-names></name>, <name name-style="western"><surname>Sichtig</surname><given-names>H</given-names></name>, <name name-style="western"><surname>Geyer</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Ledeboer</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Weinstock</surname><given-names>GM</given-names></name></person-group>
<year>2015</year>
<article-title>Making the leap from research laboratory to clinic: challenges and opportunities for next-generation sequencing in infectious disease diagnostics</article-title>. <source>mBio</source>
<volume>6</volume>:<elocation-id>e01888-15</elocation-id>. doi:<pub-id pub-id-type="doi">10.1128/mBio.01888-15</pub-id>.<pub-id pub-id-type="pmid">26646014</pub-id></mixed-citation></ref><ref id="B22"><label>22.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Kwong</surname><given-names>JC</given-names></name>, <name name-style="western"><surname>McCallum</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Sintchenko</surname><given-names>V</given-names></name>, <name name-style="western"><surname>Howden</surname><given-names>BP</given-names></name></person-group>
<year>2015</year>
<article-title>Whole genome sequencing in clinical and public health microbiology</article-title>. <source>Pathology</source>
<volume>47</volume>:<fpage>199</fpage>&#x02013;<lpage>210</lpage>. doi:<pub-id pub-id-type="doi">10.1097/PAT.0000000000000235</pub-id>.<pub-id pub-id-type="pmid">25730631</pub-id></mixed-citation></ref><ref id="B23"><label>23.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Luheshi</surname><given-names>LM</given-names></name>, <name name-style="western"><surname>Raza</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Peacock</surname><given-names>SJ</given-names></name></person-group>
<year>2015</year>
<article-title>Moving pathogen genomics out of the lab and into the clinic: what will it take?</article-title>
<source>Genome Med</source>
<volume>7</volume>:<fpage>132</fpage>. doi:<pub-id pub-id-type="doi">10.1186/s13073-015-0254-z</pub-id>.<pub-id pub-id-type="pmid">26719100</pub-id></mixed-citation></ref><ref id="B24"><label>24.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Oliver</surname><given-names>GR</given-names></name>, <name name-style="western"><surname>Hart</surname><given-names>SN</given-names></name>, <name name-style="western"><surname>Klee</surname><given-names>EW</given-names></name></person-group>
<year>2015</year>
<article-title>Bioinformatics for clinical next generation sequencing</article-title>. <source>Clin Chem</source>
<volume>61</volume>:<fpage>124</fpage>&#x02013;<lpage>135</lpage>. doi:<pub-id pub-id-type="doi">10.1373/clinchem.2014.224360</pub-id>.<pub-id pub-id-type="pmid">25451870</pub-id></mixed-citation></ref><ref id="B25"><label>25.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Fricke</surname><given-names>WF</given-names></name>, <name name-style="western"><surname>Rasko</surname><given-names>DA</given-names></name></person-group>
<year>2014</year>
<article-title>Bacterial genome sequencing in the clinic: bioinformatic challenges and solutions</article-title>. <source>Nat Rev Genet</source>
<volume>15</volume>:<fpage>49</fpage>&#x02013;<lpage>55</lpage>. doi:<pub-id pub-id-type="doi">10.1038/nrg3624</pub-id>.<pub-id pub-id-type="pmid">24281148</pub-id></mixed-citation></ref><ref id="B26"><label>26.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Wyres</surname><given-names>KL</given-names></name>, <name name-style="western"><surname>Conway</surname><given-names>TC</given-names></name>, <name name-style="western"><surname>Garg</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Queiroz</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Reumann</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Holt</surname><given-names>K</given-names></name>, <name name-style="western"><surname>Rusu</surname><given-names>LI</given-names></name></person-group>
<year>2014</year>
<article-title>WGS analysis and interpretation in clinical and public health microbiology laboratories: what are the requirements and how do existing tools compare?</article-title>
<source>Pathogens</source>
<volume>3</volume>:<fpage>437</fpage>&#x02013;<lpage>458</lpage>. doi:<pub-id pub-id-type="doi">10.3390/pathogens3020437</pub-id>.<pub-id pub-id-type="pmid">25437808</pub-id></mixed-citation></ref><ref id="B27"><label>27.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Rhoads</surname><given-names>DD</given-names></name>, <name name-style="western"><surname>Sintchenko</surname><given-names>V</given-names></name>, <name name-style="western"><surname>Rauch</surname><given-names>CA</given-names></name>, <name name-style="western"><surname>Pantanowitz</surname><given-names>L</given-names></name></person-group>
<year>2014</year>
<article-title>Clinical microbiology informatics</article-title>. <source>Clin Microbiol Rev</source>
<volume>27</volume>:<fpage>1025</fpage>&#x02013;<lpage>1047</lpage>. doi:<pub-id pub-id-type="doi">10.1128/CMR.00049-14</pub-id>.<pub-id pub-id-type="pmid">25278581</pub-id></mixed-citation></ref><ref id="B28"><label>28.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Gargis</surname><given-names>AS</given-names></name>, <name name-style="western"><surname>Kalman</surname><given-names>L</given-names></name>, <name name-style="western"><surname>Lubin</surname><given-names>IM</given-names></name></person-group>
<year>2016</year>
<article-title>Assuring the quality of next-generation sequencing in clinical microbiology and public health laboratories</article-title>. <source>J Clin Microbiol</source>
<volume>54</volume>:<fpage>2857</fpage>&#x02013;<lpage>2865</lpage>. doi:<pub-id pub-id-type="doi">10.1128/JCM.00949-16</pub-id>.<pub-id pub-id-type="pmid">27510831</pub-id></mixed-citation></ref><ref id="B29"><label>29.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Lefterova</surname><given-names>MI</given-names></name>, <name name-style="western"><surname>Suarez</surname><given-names>CJ</given-names></name>, <name name-style="western"><surname>Banaei</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Pinsky</surname><given-names>BA</given-names></name></person-group>
<year>2015</year>
<article-title>Next-generation sequencing for infectious disease diagnosis and management: a report of the Association for Molecular Pathology</article-title>. <source>J Mol Diagn</source>
<volume>17</volume>:<fpage>623</fpage>&#x02013;<lpage>634</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.jmoldx.2015.07.004</pub-id>.<pub-id pub-id-type="pmid">26433313</pub-id></mixed-citation></ref><ref id="B30"><label>30.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Aziz</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Zhao</surname><given-names>Q</given-names></name>, <name name-style="western"><surname>Bry</surname><given-names>L</given-names></name>, <name name-style="western"><surname>Driscoll</surname><given-names>DK</given-names></name>, <name name-style="western"><surname>Funke</surname><given-names>B</given-names></name>, <name name-style="western"><surname>Gibson</surname><given-names>JS</given-names></name>, <name name-style="western"><surname>Grody</surname><given-names>WW</given-names></name>, <name name-style="western"><surname>Hegde</surname><given-names>MR</given-names></name>, <name name-style="western"><surname>Hoeltge</surname><given-names>GA</given-names></name>, <name name-style="western"><surname>Leonard</surname><given-names>DGB</given-names></name>, <name name-style="western"><surname>Merker</surname><given-names>JD</given-names></name>, <name name-style="western"><surname>Nagarajan</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Palicki</surname><given-names>LA</given-names></name>, <name name-style="western"><surname>Robetorye</surname><given-names>RS</given-names></name>, <name name-style="western"><surname>Schrijver</surname><given-names>I</given-names></name>, <name name-style="western"><surname>Weck</surname><given-names>KE</given-names></name>, <name name-style="western"><surname>Voelkerding</surname><given-names>KV</given-names></name></person-group>
<year>2014</year>
<article-title>College of American Pathologists' laboratory standards for next-generation sequencing clinical tests</article-title>. <source>Arch Pathol Lab Med</source>
<volume>139</volume>:<fpage>481</fpage>&#x02013;<lpage>493</lpage>. doi:<pub-id pub-id-type="doi">10.5858/arpa.2014-0250-CP</pub-id>.<pub-id pub-id-type="pmid">25152313</pub-id></mixed-citation></ref><ref id="B31"><label>31.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Moran-Gilad</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Sintchenko</surname><given-names>V</given-names></name>, <name name-style="western"><surname>Pedersen</surname><given-names>SK</given-names></name>, <name name-style="western"><surname>Wolfgang</surname><given-names>WJ</given-names></name>, <name name-style="western"><surname>Pettengill</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Strain</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Hendriksen</surname><given-names>RS</given-names></name></person-group>
<year>2015</year>
<article-title>Proficiency testing for bacterial whole genome sequencing: an end-user survey of current capabilities, requirements and priorities</article-title>. <source>BMC Infect Dis</source>
<volume>15</volume>:<fpage>174</fpage>. doi:<pub-id pub-id-type="doi">10.1186/s12879-015-0902-3</pub-id>.<pub-id pub-id-type="pmid">25887164</pub-id></mixed-citation></ref><ref id="B32"><label>32.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Olson</surname><given-names>ND</given-names></name>, <name name-style="western"><surname>Jackson</surname><given-names>SA</given-names></name>, <name name-style="western"><surname>Lin</surname><given-names>NJ</given-names></name></person-group>
<year>2015</year>
<article-title>Report from the Standards for Pathogen Identification via Next-Generation Sequencing (SPIN) Workshop</article-title>. <source>Stand Genomic Sci</source>
<volume>10</volume>:<fpage>119</fpage>. doi:<pub-id pub-id-type="doi">10.1186/s40793-015-0112-z</pub-id>.</mixed-citation></ref><ref id="B33"><label>33.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Gargis</surname><given-names>AS</given-names></name>, <name name-style="western"><surname>Kalman</surname><given-names>L</given-names></name>, <name name-style="western"><surname>Berry</surname><given-names>MW</given-names></name>, <name name-style="western"><surname>Bick</surname><given-names>DP</given-names></name>, <name name-style="western"><surname>Dimmock</surname><given-names>DP</given-names></name>, <name name-style="western"><surname>Hambuch</surname><given-names>T</given-names></name>, <name name-style="western"><surname>Lu</surname><given-names>F</given-names></name>, <name name-style="western"><surname>Lyon</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Voelkerding</surname><given-names>KV</given-names></name>, <name name-style="western"><surname>Zehnbauer</surname><given-names>BA</given-names></name></person-group>
<year>2012</year>
<article-title>Assuring the quality of next-generation sequencing in clinical laboratory practice</article-title>. <source>Nat Biotechnol</source>
<volume>30</volume>:<fpage>1033</fpage>&#x02013;<lpage>1036</lpage>. doi:<pub-id pub-id-type="doi">10.1038/nbt.2403</pub-id>.<pub-id pub-id-type="pmid">23138292</pub-id></mixed-citation></ref><ref id="B34"><label>34.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Gargis</surname><given-names>AS</given-names></name>, <name name-style="western"><surname>Kalman</surname><given-names>L</given-names></name>, <name name-style="western"><surname>Bick</surname><given-names>DP</given-names></name>, <name name-style="western"><surname>da Silva</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Dimmock</surname><given-names>DP</given-names></name>, <name name-style="western"><surname>Funke</surname><given-names>BH</given-names></name>, <name name-style="western"><surname>Gowrisankar</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Hegde</surname><given-names>MR</given-names></name>, <name name-style="western"><surname>Kulkarni</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Mason</surname><given-names>CE</given-names></name>, <name name-style="western"><surname>Nagarajan</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Voelkerding</surname><given-names>KV</given-names></name>, <name name-style="western"><surname>Worthey</surname><given-names>EA</given-names></name>, <name name-style="western"><surname>Aziz</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Barnes</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Bennett</surname><given-names>SF</given-names></name>, <name name-style="western"><surname>Bisht</surname><given-names>H</given-names></name>, <name name-style="western"><surname>Church</surname><given-names>DM</given-names></name>, <name name-style="western"><surname>Dimitrova</surname><given-names>Z</given-names></name>, <name name-style="western"><surname>Gargis</surname><given-names>SR</given-names></name>, <name name-style="western"><surname>Hafez</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Hambuch</surname><given-names>T</given-names></name>, <name name-style="western"><surname>Hyland</surname><given-names>FC</given-names></name>, <name name-style="western"><surname>Luna</surname><given-names>RA</given-names></name>, <name name-style="western"><surname>MacCannell</surname><given-names>D</given-names></name>, <name name-style="western"><surname>Mann</surname><given-names>T</given-names></name>, <name name-style="western"><surname>McCluskey</surname><given-names>MR</given-names></name>, <name name-style="western"><surname>McDaniel</surname><given-names>TK</given-names></name>, <name name-style="western"><surname>Ganova-Raeva</surname><given-names>LM</given-names></name>, <name name-style="western"><surname>Rehm</surname><given-names>HL</given-names></name>, <name name-style="western"><surname>Reid</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Campo</surname><given-names>DS</given-names></name>, <name name-style="western"><surname>Resnick</surname><given-names>RB</given-names></name>, <name name-style="western"><surname>Ridge</surname><given-names>PG</given-names></name>, <name name-style="western"><surname>Salit</surname><given-names>ML</given-names></name>, <name name-style="western"><surname>Skums</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Wong</surname><given-names>LJ</given-names></name>, <name name-style="western"><surname>Zehnbauer</surname><given-names>BA</given-names></name>, <name name-style="western"><surname>Zook</surname><given-names>JM</given-names></name>, <name name-style="western"><surname>Lubin</surname><given-names>IM</given-names></name></person-group>
<year>2015</year>
<article-title>Good laboratory practice for clinical next-generation sequencing informatics pipelines</article-title>. <source>Nat Biotechnol</source>
<volume>33</volume>:<fpage>689</fpage>&#x02013;<lpage>693</lpage>. doi:<pub-id pub-id-type="doi">10.1038/nbt.3237</pub-id>.<pub-id pub-id-type="pmid">26154004</pub-id></mixed-citation></ref><ref id="B35"><label>35.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Duncavage</surname><given-names>EJ</given-names></name>, <name name-style="western"><surname>Abel</surname><given-names>HJ</given-names></name>, <name name-style="western"><surname>Merker</surname><given-names>JD</given-names></name>, <name name-style="western"><surname>Bodner</surname><given-names>JB</given-names></name>, <name name-style="western"><surname>Zhao</surname><given-names>Q</given-names></name>, <name name-style="western"><surname>Voelkerding</surname><given-names>KV</given-names></name>, <name name-style="western"><surname>Pfeifer</surname><given-names>JD</given-names></name></person-group>
<year>2016</year>
<article-title>A model study of in silico proficiency testing for clinical next-generation sequencing</article-title>. <source>Arch Pathol Lab Med</source>
<volume>140</volume>:<fpage>1085</fpage>&#x02013;<lpage>1091</lpage>. doi:<pub-id pub-id-type="doi">10.5858/arpa.2016-0194-CP</pub-id>.<pub-id pub-id-type="pmid">27388684</pub-id></mixed-citation></ref><ref id="B36"><label>36.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Mellmann</surname><given-names>A</given-names></name>, <name name-style="western"><surname>Andersen</surname><given-names>PS</given-names></name>, <name name-style="western"><surname>Bletz</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Friedrich</surname><given-names>AW</given-names></name>, <name name-style="western"><surname>Kohl</surname><given-names>TA</given-names></name>, <name name-style="western"><surname>Lilje</surname><given-names>B</given-names></name>, <name name-style="western"><surname>Niemann</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Prior</surname><given-names>K</given-names></name>, <name name-style="western"><surname>Rossen</surname><given-names>JW</given-names></name>, <name name-style="western"><surname>Harmsen</surname><given-names>D</given-names></name></person-group>
<year>2017</year>
<article-title>High interlaboratory reproducibility and accuracy of next-generation-sequencing-based bacterial genotyping in a ring trial</article-title>. <source>J Clin Microbiol</source>
<volume>55</volume>:<fpage>908</fpage>&#x02013;<lpage>913</lpage>. doi:<pub-id pub-id-type="doi">10.1128/JCM.02242-16</pub-id>.<pub-id pub-id-type="pmid">28053217</pub-id></mixed-citation></ref><ref id="B37"><label>37.</label><mixed-citation publication-type="book"><collab>CLSI</collab>. <year>2014</year>
<article-title>Nucleic acid sequencing methods in diagnostic laboratory medicine: approved guideline</article-title>, <edition>2nd ed</edition>
<source>MM09-A2</source>. <publisher-name>CLSI</publisher-name>, <publisher-loc>Wayne, PA</publisher-loc>.</mixed-citation></ref><ref id="B38"><label>38.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Harris</surname><given-names>SR</given-names></name>, <name name-style="western"><surname>Cartwright</surname><given-names>EJ</given-names></name>, <name name-style="western"><surname>Torok</surname><given-names>ME</given-names></name>, <name name-style="western"><surname>Holden</surname><given-names>MT</given-names></name>, <name name-style="western"><surname>Brown</surname><given-names>NM</given-names></name>, <name name-style="western"><surname>Ogilvy-Stuart</surname><given-names>AL</given-names></name>, <name name-style="western"><surname>Ellington</surname><given-names>MJ</given-names></name>, <name name-style="western"><surname>Quail</surname><given-names>MA</given-names></name>, <name name-style="western"><surname>Bentley</surname><given-names>SD</given-names></name>, <name name-style="western"><surname>Parkhill</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Peacock</surname><given-names>SJ</given-names></name></person-group>
<year>2013</year>
<article-title>Whole-genome sequencing for analysis of an outbreak of meticillin-resistant <italic>Staphylococcus aureus</italic>: a descriptive study</article-title>. <source>Lancet Infect Dis</source>
<volume>13</volume>:<fpage>130</fpage>&#x02013;<lpage>136</lpage>. doi:<pub-id pub-id-type="doi">10.1016/S1473-3099(12)70268-2</pub-id>.<pub-id pub-id-type="pmid">23158674</pub-id></mixed-citation></ref><ref id="B39"><label>39.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Leekitcharoenphon</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Nielsen</surname><given-names>EM</given-names></name>, <name name-style="western"><surname>Kaas</surname><given-names>RS</given-names></name>, <name name-style="western"><surname>Lund</surname><given-names>O</given-names></name>, <name name-style="western"><surname>Aarestrup</surname><given-names>FM</given-names></name></person-group>
<year>2014</year>
<article-title>Evaluation of whole genome sequencing for outbreak detection of <italic>Salmonella enterica</italic></article-title>. <source>PLoS One</source>
<volume>9</volume>:<elocation-id>e87991</elocation-id>. doi:<pub-id pub-id-type="doi">10.1371/journal.pone.0087991</pub-id>.<pub-id pub-id-type="pmid">24505344</pub-id></mixed-citation></ref><ref id="B40"><label>40.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Burd</surname><given-names>EM</given-names></name></person-group>
<year>2010</year>
<article-title>Validation of laboratory-developed molecular assays for infectious diseases</article-title>. <source>Clin Microbiol Rev</source>
<volume>23</volume>:<fpage>550</fpage>&#x02013;<lpage>576</lpage>. doi:<pub-id pub-id-type="doi">10.1128/CMR.00074-09</pub-id>.<pub-id pub-id-type="pmid">20610823</pub-id></mixed-citation></ref><ref id="B41"><label>41.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Richards</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Aziz</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Bale</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Bick</surname><given-names>D</given-names></name>, <name name-style="western"><surname>Das</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Gastier-Foster</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Grody</surname><given-names>WW</given-names></name>, <name name-style="western"><surname>Hegde</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Lyon</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Spector</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Voelkerding</surname><given-names>K</given-names></name>, <name name-style="western"><surname>Rehm</surname><given-names>HL</given-names></name></person-group>
<year>2015</year>
<article-title>Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology</article-title>. <source>Genet Med</source>
<volume>17</volume>:<fpage>405</fpage>&#x02013;<lpage>423</lpage>. doi:<pub-id pub-id-type="doi">10.1038/gim.2015.30</pub-id>.<pub-id pub-id-type="pmid">25741868</pub-id></mixed-citation></ref><ref id="B42"><label>42.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Lesho</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Clifford</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Onmus-Leone</surname><given-names>F</given-names></name>, <name name-style="western"><surname>Appalla</surname><given-names>L</given-names></name>, <name name-style="western"><surname>Snesrud</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Kwak</surname><given-names>Y</given-names></name>, <name name-style="western"><surname>Ong</surname><given-names>A</given-names></name>, <name name-style="western"><surname>Maybank</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Waterman</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Rohrbeck</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Julius</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Roth</surname><given-names>A</given-names></name>, <name name-style="western"><surname>Martinez</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Nielsen</surname><given-names>L</given-names></name>, <name name-style="western"><surname>Steele</surname><given-names>E</given-names></name>, <name name-style="western"><surname>McGann</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Hinkle</surname><given-names>M</given-names></name></person-group>
<year>2016</year>
<article-title>The challenges of implementing next generation sequencing across a large healthcare system, and the molecular epidemiology and antibiotic susceptibilities of carbapenemase-producing bacteria in the healthcare system of the U.S. Department of Defense</article-title>. <source>PLoS One</source>
<volume>11</volume>:<elocation-id>e0155770</elocation-id>. doi:<pub-id pub-id-type="doi">10.1371/journal.pone.0155770</pub-id>.<pub-id pub-id-type="pmid">27196272</pub-id></mixed-citation></ref><ref id="B43"><label>43.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Kalman</surname><given-names>LV</given-names></name>, <name name-style="western"><surname>Datta</surname><given-names>V</given-names></name>, <name name-style="western"><surname>Williams</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Zook</surname><given-names>JM</given-names></name>, <name name-style="western"><surname>Salit</surname><given-names>ML</given-names></name>, <name name-style="western"><surname>Han</surname><given-names>J-Y</given-names></name></person-group>
<year>2016</year>
<article-title>Development and characterization of reference materials for genetic testing: focus on public partnerships</article-title>. <source>Ann Lab Med</source>
<volume>36</volume>:<fpage>513</fpage>&#x02013;<lpage>520</lpage>. doi:<pub-id pub-id-type="doi">10.3343/alm.2016.36.6.513</pub-id>.<pub-id pub-id-type="pmid">27578503</pub-id></mixed-citation></ref><ref id="B44"><label>44.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Lindsey</surname><given-names>RL</given-names></name>, <name name-style="western"><surname>Pouseele</surname><given-names>H</given-names></name>, <name name-style="western"><surname>Chen</surname><given-names>JC</given-names></name>, <name name-style="western"><surname>Strockbine</surname><given-names>NA</given-names></name>, <name name-style="western"><surname>Carleton</surname><given-names>HA</given-names></name></person-group>
<year>2016</year>
<article-title>Implementation of whole genome sequencing (WGS) for identification and characterization of Shiga toxin-producing <italic>Escherichia coli</italic> (STEC) in the United States</article-title>. <source>Front Microbiol</source>
<volume>7</volume>:<fpage>766</fpage>. doi:<pub-id pub-id-type="doi">10.3389/fmicb.2016.00766</pub-id>.<pub-id pub-id-type="pmid">27242777</pub-id></mixed-citation></ref><ref id="B45"><label>45.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Pankhurst</surname><given-names>LJ</given-names></name>, <name name-style="western"><surname>del Ojo Elias</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Votintseva</surname><given-names>AA</given-names></name>, <name name-style="western"><surname>Walker</surname><given-names>TM</given-names></name>, <name name-style="western"><surname>Cole</surname><given-names>K</given-names></name>, <name name-style="western"><surname>Davies</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Fermont</surname><given-names>JM</given-names></name>, <name name-style="western"><surname>Gascoyne-Binzi</surname><given-names>DM</given-names></name>, <name name-style="western"><surname>Kohl</surname><given-names>TA</given-names></name>, <name name-style="western"><surname>Kong</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Lemaitre</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Niemann</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Paul</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Rogers</surname><given-names>TR</given-names></name>, <name name-style="western"><surname>Roycroft</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Smith</surname><given-names>EG</given-names></name>, <name name-style="western"><surname>Supply</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Tang</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Wilcox</surname><given-names>MH</given-names></name>, <name name-style="western"><surname>Wordsworth</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Wyllie</surname><given-names>D</given-names></name>, <name name-style="western"><surname>Xu</surname><given-names>L</given-names></name>, <name name-style="western"><surname>Crook</surname><given-names>DW</given-names></name></person-group>
<year>2016</year>
<article-title>Rapid, comprehensive, and affordable mycobacterial diagnosis with whole-genome sequencing: a prospective study</article-title>. <source>Lancet Respir Med</source>
<volume>4</volume>:<fpage>49</fpage>&#x02013;<lpage>58</lpage>. doi:<pub-id pub-id-type="doi">10.1016/S2213-2600(15)00466-X</pub-id>.<pub-id pub-id-type="pmid">26669893</pub-id></mixed-citation></ref><ref id="B46"><label>46.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Walker</surname><given-names>TM</given-names></name>, <name name-style="western"><surname>Kohl</surname><given-names>TA</given-names></name>, <name name-style="western"><surname>Omar</surname><given-names>SV</given-names></name>, <name name-style="western"><surname>Hedge</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Del Ojo Elias</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Bradley</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Iqbal</surname><given-names>Z</given-names></name>, <name name-style="western"><surname>Feuerriegel</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Niehaus</surname><given-names>KE</given-names></name>, <name name-style="western"><surname>Wilson</surname><given-names>DJ</given-names></name>, <name name-style="western"><surname>Clifton</surname><given-names>DA</given-names></name>, <name name-style="western"><surname>Kapatai</surname><given-names>G</given-names></name>, <name name-style="western"><surname>Ip</surname><given-names>CLC</given-names></name>, <name name-style="western"><surname>Bowden</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Drobniewski</surname><given-names>FA</given-names></name>, <name name-style="western"><surname>Allix-B&#x000e9;guec</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Gaudin</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Parkhill</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Diel</surname><given-names>R</given-names></name>, <name name-style="western"><surname>Supply</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Crook</surname><given-names>DW</given-names></name>, <name name-style="western"><surname>Smith</surname><given-names>EG</given-names></name>, <name name-style="western"><surname>Walker</surname><given-names>AS</given-names></name>, <name name-style="western"><surname>Ismail</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Niemann</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Peto</surname><given-names>TEA</given-names></name></person-group>
<year>2015</year>
<article-title>Whole-genome sequencing for prediction of <italic>Mycobacterium tuberculosis</italic> drug susceptibility and resistance: a retrospective cohort study</article-title>. <source>Lancet Infect Dis</source>
<volume>15</volume>:<fpage>1193</fpage>&#x02013;<lpage>1202</lpage>. doi:<pub-id pub-id-type="doi">10.1016/S1473-3099(15)00062-6</pub-id>.<pub-id pub-id-type="pmid">26116186</pub-id></mixed-citation></ref><ref id="B47"><label>47.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Zankari</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Hasman</surname><given-names>H</given-names></name>, <name name-style="western"><surname>Cosentino</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Vestergaard</surname><given-names>M</given-names></name>, <name name-style="western"><surname>Rasmussen</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Lund</surname><given-names>O</given-names></name>, <name name-style="western"><surname>Aarestrup</surname><given-names>FM</given-names></name>, <name name-style="western"><surname>Larsen</surname><given-names>MV</given-names></name></person-group>
<year>2012</year>
<article-title>Identification of acquired antimicrobial resistance genes</article-title>. <source>J Antimicrob Chemother</source>
<volume>67</volume>:<fpage>2640</fpage>&#x02013;<lpage>2644</lpage>. doi:<pub-id pub-id-type="doi">10.1093/jac/dks261</pub-id>.<pub-id pub-id-type="pmid">22782487</pub-id></mixed-citation></ref><ref id="B48"><label>48.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Arnold</surname><given-names>C</given-names></name></person-group>
<year>2016</year>
<article-title>Considerations in centralizing whole genome sequencing for microbiology in a public health setting</article-title>. <source>Expert Rev Mol Diagn</source>
<volume>16</volume>:<fpage>619</fpage>&#x02013;<lpage>621</lpage>. doi:<pub-id pub-id-type="doi">10.1586/14737159.2016.1164039</pub-id>.<pub-id pub-id-type="pmid">26953799</pub-id></mixed-citation></ref><ref id="B49"><label>49.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Auffray</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Caulfield</surname><given-names>T</given-names></name>, <name name-style="western"><surname>Griffin</surname><given-names>JL</given-names></name>, <name name-style="western"><surname>Khoury</surname><given-names>MJ</given-names></name>, <name name-style="western"><surname>Lupski</surname><given-names>JR</given-names></name>, <name name-style="western"><surname>Schwab</surname><given-names>M</given-names></name></person-group>
<year>2016</year>
<article-title>From genomic medicine to precision medicine: highlights of 2015</article-title>. <source>Genome Med</source>
<volume>8</volume>:<fpage>12</fpage>. doi:<pub-id pub-id-type="doi">10.1186/s13073-016-0265-4</pub-id>.<pub-id pub-id-type="pmid">26825779</pub-id></mixed-citation></ref><ref id="B50"><label>50.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Allard</surname><given-names>MW</given-names></name>, <name name-style="western"><surname>Strain</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Melka</surname><given-names>D</given-names></name>, <name name-style="western"><surname>Bunning</surname><given-names>K</given-names></name>, <name name-style="western"><surname>Musser</surname><given-names>SM</given-names></name>, <name name-style="western"><surname>Brown</surname><given-names>EW</given-names></name>, <name name-style="western"><surname>Timme</surname><given-names>R</given-names></name></person-group>
<year>2016</year>
<article-title>Practical value of food pathogen traceability through building a whole-genome sequencing network and database</article-title>. <source>J Clin Microbiol</source>
<volume>54</volume>:<fpage>1975</fpage>&#x02013;<lpage>1983</lpage>. doi:<pub-id pub-id-type="doi">10.1128/JCM.00081-16</pub-id>.<pub-id pub-id-type="pmid">27008877</pub-id></mixed-citation></ref><ref id="B51"><label>51.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Robilotti</surname><given-names>E</given-names></name>, <name name-style="western"><surname>Kamboj</surname><given-names>M</given-names></name></person-group>
<year>2015</year>
<article-title>Integration of whole-genome sequencing into infection control practices: the potential and the hurdles</article-title>. <source>J Clin Microbiol</source>
<volume>53</volume>:<fpage>1054</fpage>&#x02013;<lpage>1055</lpage>. doi:<pub-id pub-id-type="doi">10.1128/JCM.00349-15</pub-id>.<pub-id pub-id-type="pmid">25673795</pub-id></mixed-citation></ref><ref id="B52"><label>52.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Li</surname><given-names>H</given-names></name>, <name name-style="western"><surname>Handsaker</surname><given-names>B</given-names></name>, <name name-style="western"><surname>Wysoker</surname><given-names>A</given-names></name>, <name name-style="western"><surname>Fennell</surname><given-names>T</given-names></name>, <name name-style="western"><surname>Ruan</surname><given-names>J</given-names></name>, <name name-style="western"><surname>Homer</surname><given-names>N</given-names></name>, <name name-style="western"><surname>Marth</surname><given-names>G</given-names></name>, <name name-style="western"><surname>Abecasis</surname><given-names>G</given-names></name>, <name name-style="western"><surname>Durbin</surname><given-names>R</given-names></name></person-group>, <collab>Genome Project Data Processing Subgroup</collab>. <year>2009</year>
<article-title>The Sequence Alignment/Map format and SAMtools</article-title>. <source>Bioinformatics</source>
<volume>25</volume>:<fpage>2078</fpage>&#x02013;<lpage>2079</lpage>. doi:<pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id>.<pub-id pub-id-type="pmid">19505943</pub-id></mixed-citation></ref><ref id="B53"><label>53.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Danecek</surname><given-names>P</given-names></name>, <name name-style="western"><surname>Auton</surname><given-names>A</given-names></name>, <name name-style="western"><surname>Abecasis</surname><given-names>G</given-names></name>, <name name-style="western"><surname>Albers</surname><given-names>CA</given-names></name>, <name name-style="western"><surname>Banks</surname><given-names>E</given-names></name>, <name name-style="western"><surname>DePristo</surname><given-names>MA</given-names></name>, <name name-style="western"><surname>Handsaker</surname><given-names>RE</given-names></name>, <name name-style="western"><surname>Lunter</surname><given-names>G</given-names></name>, <name name-style="western"><surname>Marth</surname><given-names>GT</given-names></name>, <name name-style="western"><surname>Sherry</surname><given-names>ST</given-names></name>, <name name-style="western"><surname>McVean</surname><given-names>G</given-names></name>, <name name-style="western"><surname>Durbin</surname><given-names>R</given-names></name></person-group>, <collab>Genomes Project Analysis Group</collab>. <year>2011</year>
<article-title>The variant call format and VCFtools</article-title>. <source>Bioinformatics</source>
<volume>27</volume>:<fpage>2156</fpage>&#x02013;<lpage>2158</lpage>. doi:<pub-id pub-id-type="doi">10.1093/bioinformatics/btr330</pub-id>.<pub-id pub-id-type="pmid">21653522</pub-id></mixed-citation></ref><ref id="B54"><label>54.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Seemann</surname><given-names>T</given-names></name></person-group>
<year>2014</year>
<article-title>Prokka: rapid prokaryotic genome annotation</article-title>. <source>Bioinformatics</source>
<volume>30</volume>:<fpage>2068</fpage>&#x02013;<lpage>2069</lpage>. doi:<pub-id pub-id-type="doi">10.1093/bioinformatics/btu153</pub-id>.<pub-id pub-id-type="pmid">24642063</pub-id></mixed-citation></ref><ref id="B55"><label>55.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Cole</surname><given-names>JR</given-names></name>, <name name-style="western"><surname>Wang</surname><given-names>Q</given-names></name>, <name name-style="western"><surname>Fish</surname><given-names>JA</given-names></name>, <name name-style="western"><surname>Chai</surname><given-names>B</given-names></name>, <name name-style="western"><surname>McGarrell</surname><given-names>DM</given-names></name>, <name name-style="western"><surname>Sun</surname><given-names>Y</given-names></name>, <name name-style="western"><surname>Brown</surname><given-names>CT</given-names></name>, <name name-style="western"><surname>Porras-Alfaro</surname><given-names>A</given-names></name>, <name name-style="western"><surname>Kuske</surname><given-names>CR</given-names></name>, <name name-style="western"><surname>Tiedje</surname><given-names>JM</given-names></name></person-group>
<year>2014</year>
<article-title>Ribosomal Database Project: data and tools for high throughput rRNA analysis</article-title>. <source>Nucleic Acids Res</source>
<volume>42</volume>:<fpage>D633</fpage>&#x02013;<lpage>D642</lpage>. doi:<pub-id pub-id-type="doi">10.1093/nar/gkt1244</pub-id>.<pub-id pub-id-type="pmid">24288368</pub-id></mixed-citation></ref><ref id="B56"><label>56.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Larsen</surname><given-names>MV</given-names></name>, <name name-style="western"><surname>Cosentino</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Rasmussen</surname><given-names>S</given-names></name>, <name name-style="western"><surname>Friis</surname><given-names>C</given-names></name>, <name name-style="western"><surname>Hasman</surname><given-names>H</given-names></name>, <name name-style="western"><surname>Marvig</surname><given-names>RL</given-names></name>, <name name-style="western"><surname>Jelsbak</surname><given-names>L</given-names></name>, <name name-style="western"><surname>Sicheritz-Ponten</surname><given-names>T</given-names></name>, <name name-style="western"><surname>Ussery</surname><given-names>DW</given-names></name>, <name name-style="western"><surname>Aarestrup</surname><given-names>FM</given-names></name>, <name name-style="western"><surname>Lund</surname><given-names>O</given-names></name></person-group>
<year>2012</year>
<article-title>Multilocus sequence typing of total-genome-sequenced bacteria</article-title>. <source>J Clin Microbiol</source>
<volume>50</volume>:<fpage>1355</fpage>&#x02013;<lpage>1361</lpage>. doi:<pub-id pub-id-type="doi">10.1128/JCM.06094-11</pub-id>.<pub-id pub-id-type="pmid">22238442</pub-id></mixed-citation></ref><ref id="B57"><label>57.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name name-style="western"><surname>Joensen</surname><given-names>KG</given-names></name>, <name name-style="western"><surname>Scheutz</surname><given-names>F</given-names></name>, <name name-style="western"><surname>Lund</surname><given-names>O</given-names></name>, <name name-style="western"><surname>Hasman</surname><given-names>H</given-names></name>, <name name-style="western"><surname>Kaas</surname><given-names>RS</given-names></name>, <name name-style="western"><surname>Nielsen</surname><given-names>EM</given-names></name>, <name name-style="western"><surname>Aarestrup</surname><given-names>FM</given-names></name></person-group>
<year>2014</year>
<article-title>Real-time whole-genome sequencing for routine typing, surveillance, and outbreak detection of verotoxigenic <italic>Escherichia coli</italic></article-title>. <source>J Clin Microbiol</source>
<volume>52</volume>:<fpage>1501</fpage>&#x02013;<lpage>1510</lpage>. doi:<pub-id pub-id-type="doi">10.1128/JCM.03617-13</pub-id>.<pub-id pub-id-type="pmid">24574290</pub-id></mixed-citation></ref><ref id="B58"><label>58.</label><mixed-citation publication-type="book"><collab>CLSI</collab>. <year>2015</year>
<article-title>Performance standards for antimicrobial susceptibility testing; twenty-fifth informational supplement</article-title>. <source>CLSI document M100-S25</source>. <publisher-name>CLSI</publisher-name>, <publisher-loc>Wayne, PA</publisher-loc>.</mixed-citation></ref><ref id="B59"><label>59.</label><mixed-citation publication-type="book"><collab>CLSI</collab>. <year>2007</year>
<source>Molecular methods for bacterial strain typing; approved guideline, MM11-A</source>. <publisher-name>CLSI</publisher-name>, <publisher-loc>Wayne, PA</publisher-loc>.</mixed-citation></ref></ref-list></back></article>