U.S. flag An official website of the United States government.
Official websites use .gov

A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS

A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

i

HGTector: an automated method facilitating genome-wide discovery of putative horizontal gene transfers

Supporting Files Public Domain


Details

  • Alternative Title:
    BMC Genomics
  • Personal Author:
  • Description:
    Background

    First pass methods based on BLAST match are commonly used as an initial step to separate the different phylogenetic histories of genes in microbial genomes, and target putative horizontal gene transfer (HGT) events. This will continue to be necessary given the rapid growth of genomic data and the technical difficulties in conducting large-scale explicit phylogenetic analyses. However, these methods often produce misleading results due to their inability to resolve indirect phylogenetic links and their vulnerability to stochastic events.

    Results

    A new computational method of rapid, exhaustive and genome-wide detection of HGT was developed, featuring the systematic analysis of BLAST hit distribution patterns in the context of a priori defined hierarchical evolutionary categories. Genes that fall beyond a series of statistically determined thresholds are identified as not adhering to the typical vertical history of the organisms in question, but instead having a putative horizontal origin. Tests on simulated genomic data suggest that this approach effectively targets atypically distributed genes that are highly likely to be HGT-derived, and exhibits robust performance compared to conventional BLAST-based approaches. This method was further tested on real genomic datasets, including Rickettsia genomes, and was compared to previous studies. Results show consistency with currently employed categories of HGT prediction methods. In-depth analysis of both simulated and real genomic data suggests that the method is notably insensitive to stochastic events such as gene loss, rate variation and database error, which are common challenges to the current methodology. An automated pipeline was created to implement this approach and was made publicly available at: https://github.com/DittmarLab/HGTector. The program is versatile, easily deployed, has a low requirement for computational resources.

    Conclusions

    HGTector is an effective tool for initial or standalone large-scale discovery of candidate HGT-derived genes.

    Electronic supplementary material

    The online version of this article (doi:10.1186/1471-2164-15-717) contains supplementary material, which is available to authorized users.

  • Subjects:
  • Source:
    BMC Genomics. 15(1).
  • Document Type:
  • Volume:
    15
  • Issue:
    1
  • Collection(s):
  • Main Document Checksum:
    urn:sha256:2a7f04faf5753b1f3ec69ad790b8c4267491ae8f0e7c6fda692065d4b6e844bc
  • Download URL:
  • File Type:
    Filetype[PDF - 1.81 MB ]
ON THIS PAGE

CDC STACKS serves as an archival repository of CDC-published products including scientific findings, journal articles, guidelines, recommendations, or other public health information authored or co-authored by CDC or funded partners.

As a repository, CDC STACKS retains documents in their original published format to ensure public access to scientific information.