Conceived and designed the experiments: CN AF MDF. Performed the experiments: AC DEA. Analyzed the data: DF KCP. Contributed reagents/materials/analysis tools: CLA AC MDF. Wrote the paper: DF KCP CLA CN AF MDF.
Autism is a common heritable neurodevelopmental disorder with complex etiology. Several genome-wide linkage and association scans have been carried out to identify regions harboring genes related to autism or autism spectrum disorders, with mixed results. Given the overlap in autism features with genetic abnormalities known to be associated with imprinting, one possible reason for lack of consistency would be the influence of parent-of-origin effects that may mask the ability to detect linkage and association.
We have performed a genome-wide linkage scan that accounts for potential parent-of-origin effects using 16,311 SNPs among families from the Autism Genetic Resource Exchange (AGRE) and the National Institute of Mental Health (NIMH) autism repository. We report parametric (GH, Genehunter) and allele-sharing linkage (Aspex) results using a broad spectrum disorder case definition. Paternal-origin genome-wide statistically significant linkage was observed on chromosomes 4 (LODGH = 3.79, empirical p<0.005 and LODAspex = 2.96, p = 0.008), 15 (LODGH = 3.09, empirical p<0.005 and LODAspex = 3.62, empirical p = 0.003) and 20 (LODGH = 3.36, empirical p<0.005 and LODAspex = 3.38, empirical p = 0.006).
These regions may harbor imprinted sites associated with the development of autism and offer fruitful domains for molecular investigation into the role of epigenetic mechanisms in autism.
Autism is a neurodevelopmental disorder that is manifested in early childhood and is characterized by impairments in reciprocal social interactions and language, and a restricted range of behaviors and interests. Autism is considered a spectrum disorder (ASD) with heterogeneity in symptom presentation. Inheritance clearly plays a major role in susceptibility to autism
Epigenetic factors, which are often heritable, yet not part of the DNA sequence, are one element which may contribute to this etiologic complexity. Imprinting is an epigenetic modification that is parental origin specific, leading to preferential expression of a specific parental allele in somatic cells of the offspring
If imprinting plays a role in the heritable etiology of ASD, the power of linkage analyses to identify susceptibility loci may be improved by accounting for allelic parent-of-origin. This has been observed for specific autism-implicated genomic regions such as 7q, where both paternal and maternal allele sharing have been observed to account for the linkage to an autism locus in this region
To date, however, no genome-scale parent-of-origin-specific linkage analysis has been reported for ASD. Here we apply parent-of-origin linkage analysis to the genome-wide SNP data recently reported by Weiss et al. in a common set of multiplex autism families
The samples used here were previously described by Weiss et al.
All samples used in this study arose from investigations approved by the appropriate Institutional Review Boards for institutions where participants were recruited, evaluated, or where genotype data were generated. Written informed consent was obtained for all adult study participants; for children under age 18, both the consent of the parents or guardians and the assent of the child were obtained. This secondary analysis of de-identified data was considered to be exempt from IRB review.
SNP genotyping was previously described
Parametric and non-parametric parent-of-origin linkage methods were applied. Parametric linkage analysis was conducted using GENEHUNTER-IMPRINTING 2.1 (GHI)
For non-parametric linkage analysis, maximum likelihood estimates of allele sharing at each locus were computed using the ASPEX “sib_ibd” command. The “sex_split” option was implemented to evaluate evidence for linkage based on maternal and paternal sharing separately. 1070 affected sibling pairs were informative (had ≥1 parental genotype) for these analyses. LOD scores indicate the log ratio of the likelihood of the marker data at this position with the MLE estimate of the sibling recurrence risk ratio, versus the likelihood assuming a sibling risk ratio of 1.
Empirical genome-wide p values were calculated via simulation. The program Merlin
Genome-wide results for maternal and paternal linkage analyses via both parametric and non-parametric methods are shown in
Panel A: Parametric results. Panel B: Allele sharing results. Maternal scores are shown in red; paternal scores are in blue; no-imprinting scores are shown in gray.
Maternal scores are shown in red; paternal scores are in blue.
| Parametric (GHI) | Allele-Sharing (Aspex) | |||||||||
| Chr | Map (cM) | No-imp HLOD | Pat. HLOD | Mat. HLOD | Pval | Map (cM) | No-imp LOD | Pat. LOD | Mat. LOD | Pval |
| 1 | 156.55 | 0.66 | 0 | 0.01 | 168.67 | 0.76 | 1.52 | −0.76 | 0.17 | |
| 2 | 82.90 | 0.43 | 0.12 | 1.00 | 1.00 | 15.77 | 0.85 | −0.66 | 1.52 | 0.21 |
| 3 | 189.11 | 1.03 | 0 | 1.64 | 0.22 | 192.74 | 1.67 | −0.1 | 1.79 | 0.13 |
| 4 | 67.85 | 0.01 | 74.81 | 2.01 | 2.96 | −0.95 | 0.008 | |||
| 5 | 182.85 | 0.49 | 1.85 | 0 | 0.08 | 205.55 | 1.82 | −0.42 | 2.24 | 0.05 |
| 6 | 177.24 | 3.46 | 1.01 | 0.01 | 188.29 | 3.87 | 1.15 | 2.72 | 0.02 | |
| 7 | 20.32 | 0.55 | 0 | 1.79 | 0.13 | 156.43 | 2.99 | 0.84 | 2.16 | 0.06 |
| 8 | NA | <1 | <1 | <1 | NA | 120.57 | 1.952 | 0.63 | 1.29 | 0.31 |
| 9 | 158.19 | 0.73 | 0 | 1.44 | 0.76 | 156.77 | 2.00 | −0.31 | 2.32 | 0.04 |
| 10 | 169.29 | 0.72 | 2.69 | 0 | 0.01 | 103.52 | 2.22 | 0.36 | 1.86 | 0.12 |
| 11 | 111.01 | 0.77 | 1.13 | 0 | 1.00 | 114.49 | 1.40 | 1.87 | −0.48 | 0.08 |
| 12 | 134.99 | 0.30 | 0 | 1.19 | 1.00 | 29.46 | 0.80 | 1.14 | −0.34 | 0.23 |
| 13 | 56.98 | 0.86 | 0.02 | 1.04 | 1.00 | 113.79 | 2.75 | 2.05 | 0.71 | 0.06 |
| 14 | 62.33 | 0.43 | 0 | 2.38 | 0.01 | 18.67 | 0.58 | 1.15 | −0.56 | 0.37 |
| 15 | 79.68 | 1.78 | 0.03 | 0.01 | 92.16 | 3.52 | 3.62 | −0.11 | 0.003 | |
| 16 | 121.30 | 0 | 1.16 | 0 | 1.00 | 78.81 | 1.41 | 1.77 | −0.35 | 0.10 |
| 17 | 45.23 | 2.15 | 2.03 | 0.58 | 0.01 | 81.45 | 1.56 | 2.02 | −0.46 | 0.07 |
| 18 | NA | <1 | <1 | <1 | NA | 91.97 | 1.33 | 1.27 | 0.06 | 0.3 |
| 19 | 92.91 | 0.16 | 1.27 | 0 | 0.92 | 37.73 | 2.75 | 0.99 | 1.77 | 0.13 |
| 20 | 3.38 | 3.04 | 0.83 | <0.005 | 3.20 | 4.13 | 3.38 | 0.75 | 0.006 | |
| 21 | 3.58 | 0.66 | 0.28 | 1.55 | 0.58 | 6.12 | 1.23 | 0.17 | 1.06 | 0.45 |
| 22 | NA | <1 | <1 | <1 | NA | 2.25 | 0.60 | −0.75 | 1.35 | 0.28 |
*Optimal parametric model for this linkage signal (see
**Empirical genome-wide p values based on 5000 simulations.
| Model | P(+/+) | P(d/+) | P(+/d) | P(d/d) | |
| 0.001 | 0.999 | 0.001 | 0.999 | ||
| 0.001 | 0.001 | 0.999 | 0.999 | ||
| 0.001 | 0.499 | 0.001 | 0.499 | ||
| 0.001 | 0.399 | 0.099 | 0.499 | ||
| 0.001 | 0.599 | 0.199 | 0.799 | ||
| 0.001 | 0.499 | 0.001 | 0.499 | ||
| 0.001 | 0.799 | 0.199 | 0.999 |
*Paternally inherited allele named first. +: wild-type allele; d: disease allele; P: penetrance.
**Parameter values for best linkage signal after exploratory sensitivity analysis.
Although significant maternal peaks were observed in both parametric and allele-sharing methods, no consistency in signal was seen. A significant maternal peak was observed on chromosome 14 in parametric analyses (rs923485:rs17177789, HLOD = 2.38, p = 0.01), although this was not observed in allele-sharing analysis. Maternal allele sharing peaks were observed on chromosomes 5, 6, 7, and 9 (
Given the vital role of imprinted genes in development, the fact that many known imprinted genes are expressed in the brain, and evidence of overlapping features in autism and imprinting disorders, we investigated the effect of incorporating allelic parent-of-origin into an autosomal linkage scan for autism. To our knowledge, this is the most extensive linkage analysis for parent-of-origin effects in autism to date. We found the strongest evidence for parent-of-origin effects on chromosomes 4, 20 and 15, implicating sites where imprinted loci related to autism may reside.
The section of chromosome 4 located between markers rs6826933 and rs17088473 showed several significant results in our analysis and spans the region between 4q12-4q13.2. Recently, Weiss et al. found an association between one SNP (rs17088254, p = 8.5×10−6) located on this region and autism using the same data without regard to parental origin. The strongest candidate gene in this region is
Analyses of a panel of microsatellite markers in 348 AGRE families from previously reported linkage analysis
A region of chromosome 15 (15q23-15q25.3) also shows paternal linkage. This region was previously implicated using traditional linkage analysis in these SNP data
We have also reported a strong paternal linkage on chromosome 20p, which was previously implicated in the linkage analyses reported by Weiss et al
Other suggestive parent-specific linkage regions are located throughout the genome (on chromosomes 1, 5, 6, 7, 8, 9, 10, 13, 14, 17 and 21). The paternally linked region on chromosome 1 (1q23-1q24.2) was previously associated with autism. Wassink
A maternally linked region was observed on chromosome 5, with the peak at 5p13.1. Recent genome-wide association studies have reported risk loci for autism at 5p14.1
The signal found on 6q25.3-6q27 region was previously linked to autism
Two other groups have previously reported parent-of-origin linkage with autism for closely located loci on chromosome 7; one was a paternal contribution in the region 7q31.33-7q34
Few previous studies have considered parent-or-origin effects in autism. Those that have used previous-generation marker sets and much smaller samples than the results presented here. Two previous studies observed parent-or-origin linkage on chromosome 7, but with different regions and types of parental sharing
In an attempt to detect loci with possible parent-of-origin effects, we used multiple statistical approaches, rather than relying on a single strategy. Consistent evidence of linkage across multiple methods increases support for a true linkage. However, the appropriate interpretation of inconsistent results across parametric and non-parametric analyses is not entirely clear. These may be due to chance findings in one analysis, or they may be true linkage that only one method was sufficiently powered to detect. For example, the chromosome 1 peak was significant only in the parametric analysis, a method which is more powerful given that the parameters are correctly specified. While it is unrealistic to believe that we could actually have specified the “correct” parameters given the complex nature of autism, those selected may have been sufficiently close. Peaks on chromosomes 6 and 9 were significant in the non-parametric ASPEX analysis; however, the parametric GHI analysis did not find significant peaks on these chromosomes, which may be due to selection of “incorrect” parameters for the models run.
These analyses considered as affected all children with an ASD, as defined by the ADI-R and ADOS in the NIMH sample. However, in AGRE, we included those with autistic disorder, as well as those with “not quite autism” and “broad spectrum” to encompass Asperger's and PDD-NOS. This may have contributed to some heterogeneity or misclassification in our data, but was considered more appropriately inclusive and comparable to the NIMH ASD families than excluding a larger number of AGRE families with an ASD other than autistic disorder.
Our results suggest the usefulness of genome-wide analysis with evaluation of parent-of-origin effects, although future studies are necessary to determine if these results can be replicated. Given the potential role for imprinting and other epigenetic mechanisms in neuropsychiatric disorders such as autism
Parent-of-Origin Linkage Analysis for Microsatellite Markers in 384 AGRE families. A: Parametric results. Panel B: Allele sharing results. Maternal scores are shown in red; paternal scores are in blue.
(0.68 MB TIF)
Click here for additional data file.
We would like to acknowledge the families who participated in the AGRE and NIMH repositories. The NIMH collection Principal Investigators and Co-Investigators were: Neil Risch, Richard M. Myers, Donna Spiker, Linda J. Lotspeich, Joachim F. Hallmayer, Helena C. Kraemer, Roland D. Ciaranello, Luigi Luca Cavalli-Sforza (Stanford University, Stanford); William M. McMahon and P. Brent Petersen (University of Utah, Salt Lake City). The Stanford team is indebted to the parent groups and clinician colleagues who referred families and extends their gratitude to the families with individuals with autism who were partners in this research. The collection data and biomaterials also come from the Autism Genetic Resource Exchange (AGRE) collection. The AGRE collection Principal Investigator is Daniel H. Geschwind (UCLA). The Co-Principal Investigators include Stanley F. Nelson and Rita M. Cantor (UCLA), Christa Lese Martin (Univ. Chicago), T. Conrad Gilliam (Columbia). Co-Investigators include Maricela Alarcon (UCLA), Kenneth Lange (UCLA), Sarah J. Spence (UCLA), David H. Ledbetter (Emory) and Hank Juo (Columbia). Scientific oversight of the AGRE program is provided by a steering committee (Chair: Daniel H. Geschwind; Members: W. Ted Brown, Maja Bucan, Joseph D. Buxbaum, T. Conrad Gilliam, David Greenberg, David H. Ledbetter, Bruce Miller, Stanley F. Nelson, Jonathan Pevsner, Carol Sprouse, Gerard D. Schellenberg and Rudolph Tanzi).