Welcome to CDC Stacks | Robust Regression Analysis of Copy Number Variation Data based on a Univariate Score - 27853 | CDC Public Access
Stacks Logo
Advanced Search
Select up to three search categories and corresponding keywords using the fields to the right. Refer to the Help section for more detailed instructions.
 
 
Help
Clear All Simple Search
Advanced Search
Robust Regression Analysis of Copy Number Variation Data based on a Univariate Score
Filetype[PDF - 539.42 KB]


Details:
  • Funding:
    HL077663/HL/NHLBI NIH HHS/United States
    MH080129/MH/NIMH NIH HHS/United States
    MH080583/MH/NIMH NIH HHS/United States
    MH083722/MH/NIMH NIH HHS/United States
    MH084680/MH/NIMH NIH HHS/United States
  • Document Type:
  • Collection(s):
  • Description:
    Motivation

    The discovery that copy number variants (CNVs) are widespread in the human genome has motivated development of numerous algorithms that attempt to detect CNVs from intensity data. However, all approaches are plagued by high false discovery rates. Further, because CNVs are characterized by two dimensions (length and intensity) it is unclear how to order called CNVs to prioritize experimental validation.

    Results

    We developed a univariate score that correlates with the likelihood that a CNV is true. This score can be used to order CNV calls in such a way that calls having larger scores are more likely to overlap a true CNV. We developed cnv.beast, a computationally efficient algorithm for calling CNVs that uses robust backward elimination regression to keep CNV calls with scores that exceed a user-defined threshold. Using an independent dataset that was measured using a different platform, we validated our score and showed that our approach performed better than six other currently-available methods.

    Availability

    cnv.beast is available at http://www.duke.edu/~asallen/Software.html.