Open Research Newcastle
Browse

Leveraging genomic annotations and pleiotropic enrichment for improved replication rates in schizophrenia GWAS

Download (1.89 MB)
journal contribution
posted on 2025-05-08, 18:39 authored by Yunpeng Wang, Wesley K. Thompson, Anna Devor, Bipolar Disorder and Schizophrenia Working Group of the Psychiatric Genomics Consortium, Enhancing Neuro Imaging Genetics through Meta Analysis Consortium, Murray CairnsMurray Cairns, Frans HenskensFrans Henskens, Brian KellyBrian Kelly, Carmel LoughlandCarmel Loughland, Patricia MichiePatricia Michie, Ulrich A. Schall, Rodney ScottRodney Scott, Andrew J. Schork, Paul TooneyPaul Tooney, Jing Qin Wu, Dominic Holland, Chi-Hua Chen, Francesco Bettella, Rahul S. Desikan, Wen Li, Aree Witoelar, Verena Zuber
Most of the genetic architecture of schizophrenia (SCZ) has not yet been identified. Here, we apply a novel statistical algorithm called Covariate-Modulated Mixture Modeling (CM3), which incorporates auxiliary information (heterozygosity, total linkage disequilibrium, genomic annotations, pleiotropy) for each single nucleotide polymorphism (SNP) to enable more accurate estimation of replication probabilities, conditional on the observed test statistic (“z-score”) of the SNP. We use a multiple logistic regression on z-scores to combine information from auxiliary information to derive a “relative enrichment score” for each SNP. For each stratum of these relative enrichment scores, we obtain nonparametric estimates of posterior expected test statistics and replication probabilities as a function of discovery z-scores, using a resampling-based approach that repeatedly and randomly partitions meta-analysis sub-studies into training and replication samples. We fit a scale mixture of two Gaussians model to each stratum, obtaining parameter estimates that minimize the sum of squared differences of the scale-mixture model with the stratified nonparametric estimates. We apply this approach to the recent genome-wide association study (GWAS) of SCZ (n = 82,315), obtaining a good fit between the model-based and observed effect sizes and replication probabilities. We observed that SNPs with low enrichment scores replicate with a lower probability than SNPs with high enrichment scores even when both they are genome-wide significant (p < 5x10-8). There were 693 and 219 independent loci with model-based replication rates ≥80% and ≥90%, respectively. Compared to analyses not incorporating relative enrichment scores, CM3 increased out-of-sample yield for SNPs that replicate at a given rate. This demonstrates that replication probabilities can be more accurately estimated using prior enrichment information with CM3.

History

Journal title

PLOS Genetics

Volume

12

Issue

1

Publisher

Public Library of Science (PLoS)

Language

  • en, English

College/Research Centre

Faculty of Health and Medicine

School

School of Biomedical Sciences and Pharmacy

Rights statement

© 2016 Wang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Usage metrics

    Publications

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC