Opin vísindi

Polygenic risk prediction : why and when out-of-sample prediction R2 can exceed SNP-based heritability

Show simple item record

dc.contributor.author Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium
dc.date.accessioned 2024-04-25T01:05:41Z
dc.date.available 2024-04-25T01:05:41Z
dc.date.issued 2023-07-06
dc.identifier.citation Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium 2023 , ' Polygenic risk prediction : why and when out-of-sample prediction R 2 can exceed SNP-based heritability ' , American Journal of Human Genetics , vol. 110 , no. 7 , pp. 1207-1215 . https://doi.org/10.1016/j.ajhg.2023.06.006
dc.identifier.issn 0002-9297
dc.identifier.other 182433905
dc.identifier.other 1372cbbf-fb4b-4e7f-ab4c-5b477e23818d
dc.identifier.other 85164270154
dc.identifier.other 37379836
dc.identifier.other unpaywall: 10.1016/j.ajhg.2023.06.006
dc.identifier.uri https://hdl.handle.net/20.500.11815/4836
dc.description Copyright © 2023 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
dc.description.abstract In polygenic score (PGS) analysis, the coefficient of determination (R2) is a key statistic to evaluate efficacy. R2 is the proportion of phenotypic variance explained by the PGS, calculated in a cohort that is independent of the genome-wide association study (GWAS) that provided estimates of allelic effect sizes. The SNP-based heritability (hSNP2, the proportion of total phenotypic variances attributable to all common SNPs) is the theoretical upper limit of the out-of-sample prediction R2. However, in real data analyses R2 has been reported to exceed hSNP2, which occurs in parallel with the observation that hSNP2 estimates tend to decline as the number of cohorts being meta-analyzed increases. Here, we quantify why and when these observations are expected. Using theory and simulation, we show that if heterogeneities in cohort-specific hSNP2 exist, or if genetic correlations between cohorts are less than one, hSNP2 estimates can decrease as the number of cohorts being meta-analyzed increases. We derive conditions when the out-of-sample prediction R2 will be greater than hSNP2 and show the validity of our derivations with real data from a binary trait (major depression) and a continuous trait (educational attainment). Our research calls for a better approach to integrating information from multiple cohorts to address issues of between-cohort heterogeneity.
dc.format.extent 9
dc.format.extent 3218327
dc.format.extent 1207-1215
dc.language.iso en
dc.relation.ispartofseries American Journal of Human Genetics; 110(7)
dc.rights info:eu-repo/semantics/openAccess
dc.subject meta-analysis
dc.subject out-of-sample prediction R
dc.subject polygenic risk prediction
dc.subject SNP-based heritability
dc.subject Genome-Wide Association Study
dc.subject Polymorphism, Single Nucleotide/genetics
dc.subject Phenotype
dc.subject Humans
dc.subject Computer Simulation
dc.subject Multifactorial Inheritance/genetics
dc.subject Genetics (clinical)
dc.subject Genetics
dc.title Polygenic risk prediction : why and when out-of-sample prediction R2 can exceed SNP-based heritability
dc.type /dk/atira/pure/researchoutput/researchoutputtypes/contributiontojournal/article
dc.description.version Peer reviewed
dc.identifier.doi 10.1016/j.ajhg.2023.06.006
dc.relation.url http://www.scopus.com/inward/record.url?scp=85164270154&partnerID=8YFLogxK
dc.contributor.department Faculty of Medicine
dc.contributor.department Other departments


Files in this item

This item appears in the following Collection(s)

Show simple item record