{"id":192762,"date":"2025-09-30T17:25:09","date_gmt":"2025-09-30T17:25:09","guid":{"rendered":"https:\/\/www.newsbeep.com\/us\/192762\/"},"modified":"2025-09-30T17:25:09","modified_gmt":"2025-09-30T17:25:09","slug":"limited-overlap-between-genetic-effects-on-disease-susceptibility-and-disease-survival","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/us\/192762\/","title":{"rendered":"Limited overlap between genetic effects on disease susceptibility and disease survival"},"content":{"rendered":"<p>Ethics statement<\/p>\n<p>This study was conducted in compliance with the relevant ethical guidelines and approved by the appropriate ethics committees. Details of the ethics committees of each participating biobank are provided in the Acknowledgements.<\/p>\n<p>Selection of diseases<\/p>\n<p>We selected nine common complex diseases spanning various disease categories for the analyses. The diseases are selected to meet following criteria: (1) have high epidemiological HR on mortality, so that mortality can be viewed as a reasonable prognosis; (2) constitute high global disease burden in terms of disability adjusted life years<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 44\" title=\"Abbafati, C. et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990&#x2013;2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet 396, 1204&#x2013;1222 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR44\" id=\"ref-link-section-d32773356e2478\" rel=\"nofollow noopener\" target=\"_blank\">44<\/a>; (3) be relatively common (\u2009&gt;\u20091% prevalence) in population and have reasonable patient bodies in all biobanks and (4) be heritable and have large-scale GWAS available to construct PGSs. All disease endpoints were defined as a composition of ICD-10 codes curated by the clinical expert groups from FinnGen, Institute for Molecular Medicine Finland and Finnish Institute for Health and Welfare<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 19\" title=\"Kurki, M. I. et al. FinnGen provides genetic insights from a well-phenotyped isolated population. Nature 613, 508&#x2013;518 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR19\" id=\"ref-link-section-d32773356e2482\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a>. The same disease definitions, in terms of ICD-10 codes, were adopted by all participating biobanks to the maximum possible extent. See Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> for a list of diseases and relevant descriptive statistics.<\/p>\n<p>Progression definition<\/p>\n<p>For all selected diseases, we defined mortality as our outcome. Precisely, we were interested in both all-cause mortalities, namely simple death status of the patient regardless of relevance to the disease, and disease-specific mortalities, meaning the death caused directly or indirectly by disease of interest specifically. Disease progression was evaluated as patients\u2019 survival from each type of mortality after being diagnosed with the disease. For all mortality GWASs, we consider only disease-specific mortality whenever possible for each participating biobank, whereas for the PGS analysis, both all-cause and disease-specific mortalities were evaluated. Similar to the disease endpoints, cause of death linked to each disease was also curated by clinical expert groups and defined in terms of ICD-10 codes<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 45\" title=\"World Health Organization ICD-10: International Statistical Classification of Diseases and Related Health Problems: Tenth Revision. Second Edition (WHO, 2004).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR45\" id=\"ref-link-section-d32773356e2497\" rel=\"nofollow noopener\" target=\"_blank\">45<\/a>. The same definitions were systematically applied to all biobanks to the possible extent. See Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> for definitions of cause-specific mortality for each disease of interest and available sample sizes from each biobank.<\/p>\n<p>Within-patient mortality GWAS<\/p>\n<p>To achieve variant-level effect comparison, a within-patient mortality GWAS was carried out for each selected disease using GATE26 for all biobanks, except Generation Scotland, which used SPACox<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 27\" title=\"Bi, W., Fritsche, L. G., Mukherjee, B., Kim, S. &amp; Lee, S. A fast and accurate method for genome-wide time-to-event data analysis and its application to UK Biobank. Am. J. Hum. Genet. 107, 222&#x2013;233 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR27\" id=\"ref-link-section-d32773356e2512\" rel=\"nofollow noopener\" target=\"_blank\">27<\/a> as an alternative. The event of interest in this GWAS was patients\u2019 survival after disease diagnosis. For each disease of interest, GWAS was carried out separately within each ancestry group for biobanks that have a cause-specific mortality event count of 50 at minimum after quality control. Eligible individuals were restricted to patients having a follow-up time after diagnosis of three months (0.25 years) at minimum. We used the model below to examine SNP association with patients\u2019 survival:<\/p>\n<p>surv(duration of follow-up after diagnosis\u2009|\u2009disease-specific mortality)\u2009~\u2009SNP\u2009+\u2009patient\u2019s age of diagnosis\u2009+\u2009patient\u2019s birth year\u2009+\u2009sex\u2009+\u2009PCs\u2009+\u2009study-specific covariates,<\/p>\n<p>where study-specific covariates included other available nonheritable biobank-specific covariates, such as genotyping chip or batch.<\/p>\n<p>For analyses in the UK Biobank, to minimize potential impact of survivor bias, only patients with disease diagnosed after enrollment were considered.<\/p>\n<p>Results quality control and meta-analysis<\/p>\n<p>After conducting mortality GWAS for selected diseases within each contributing biobank, we then filtered the resulting summary statistics by imputation INFO scores and minor allele counts. We retained only variants with an imputation INFO score &gt;0.7 and at least 20 minor allele counts for each summary statistic. For GWAS summary statistics with a different human genome build, we used the UCSC LiftOver tool<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 46\" title=\"Kuhn, R. M., Haussler, D. &amp; Kent, W. J. The UCSC genome browser and associated tools. Brief. Bioinform. 14, 144&#x2013;161 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR46\" id=\"ref-link-section-d32773356e2534\" rel=\"nofollow noopener\" target=\"_blank\">46<\/a> to convert their genome coordinates into the hg38 assembly. Subsequently, for each disease, we meta-analyzed GWAS results from each biobank using fixed-effect meta-analysis implemented in METAL<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 47\" title=\"Willer, C., Li, Y. &amp; Abecasis, G. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190&#x2013;2191 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR47\" id=\"ref-link-section-d32773356e2538\" rel=\"nofollow noopener\" target=\"_blank\">47<\/a>, with which we also scanned for heterogeneity in effect sizes across different biobanks using Cochran\u2019s Q test. We applied an inverse-variance weighted meta-analysis scheme whenever possible. However, since SPACox does not have effect size or s.e. output, in Generation Scotland, we estimated direction of effect under a logistic regression model using PLINK<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 48\" title=\"Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559&#x2013;575 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR48\" id=\"ref-link-section-d32773356e2545\" rel=\"nofollow noopener\" target=\"_blank\">48<\/a> and subsequently proceeded with a sample-size weighted meta-analysis using the Z-scores. This was done for four of the nine diseases for which Generation Scotland was one of the data sources: atrial fibrillation, breast cancer, coronary artery disease and type 2 diabetes.<\/p>\n<p>Variant-level effect size comparison<\/p>\n<p>We compared our mortality GWAS results for each disease of interest with large-scale published GWAS on diagnosis of the same disease. For disease diagnosis GWAS, we extracted SNP effects of reported genome-wide significant leading SNPs at independently associated loci from each study. For chronic kidney disease, a large GWAS on estimated glomerular filtration rate was considered<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 34\" title=\"Wuttke, M. et al. A catalog of genetic loci associated with kidney function from analyses of a million individuals. Nat. Genet. 51, 957&#x2013;972 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR34\" id=\"ref-link-section-d32773356e2560\" rel=\"nofollow noopener\" target=\"_blank\">34<\/a>. Specifically, we examined the effect sizes of independent lead SNPs on the binary diagnosis of chronic kidney disease reported in the study, ensuring a more comparable scale of measurement. For our meta-analyzed mortality GWAS, we identify independent genome-wide loci using summary statistics based on conditional analysis implemented in GCTA-COJO. We merged 5,000 Finnish genomes, which is one of the largest GWAS cohorts in this study, with EUR from Human Genome Diversity Project as linkage disequilibrium (LD) reference for this step. To carry out the effect size comparison for all diseases, we reran the meta-analysis of mortality GWAS, excluding results from Generation Scotland due to the use of an incomparable GWAS approach for the cohort.<\/p>\n<p>Comparison of genetic architectures<\/p>\n<p>We compared genetic architectures between disease diagnosis and mortality in terms of SNP heritability estimated from the meta-analyzed mortality GWAS summary statistics using LD score regression<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291&#x2013;295 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR49\" id=\"ref-link-section-d32773356e2572\" rel=\"nofollow noopener\" target=\"_blank\">49<\/a>. For eligible traits, that is, traits with nonzero estimated SNP heritability, we further analyzed genetic correlation across disease diagnosis, mortality, and general longevity GWAS using the same tool.<\/p>\n<p>Down-sampled GWAS on age of diagnosis<\/p>\n<p>To ensure heritability comparison between disease susceptibility and progression endpoints not being subject to power issues resulting from difference in sample sizes and GWAS models, for each disease of interest, we also ran time-to-event GWAS to find SNP association with age of diagnosis using a randomly down-sampled cohort which had comparable number of total individuals and event counts as what was available for the within-patient mortality GWAS. The down-sampled GWAS was carried out under the model below:<\/p>\n<p>surv(follow-up from birth until diagnosis\u2009|\u2009disease diagnosis)\u2009~\u2009SNP\u2009+\u2009patient\u2019s birth year\u2009+\u2009sex\u2009+\u2009PCs\u2009+\u2009study-specific covariates.<\/p>\n<p>This analysis was also carried out using GATE<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 26\" title=\"Dey, R. et al. Efficient and accurate frailty model approach for genome-wide survival association analysis in large-scale biobanks. Nat. Commun. 13, 5437 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR26\" id=\"ref-link-section-d32773356e2590\" rel=\"nofollow noopener\" target=\"_blank\">26<\/a> but in FinnGen and UK Biobank only, which are two of the largest participating biobanks in this study (see Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> for sample sizes).<\/p>\n<p>Computation of individual-level PGS<\/p>\n<p>For each selected disease, we derived variant weights for PGSs from GWAS summary statistics listed in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> using MegaPRS<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 50\" title=\"Zhang, Q., Priv&#xE9;, F., Vilhj&#xE1;lmsson, B. &amp; Speed, D. Improved genetic prediction of complex traits from individual-level data or summary statistics. Nat. Commun. 12, 4192 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR50\" id=\"ref-link-section-d32773356e2608\" rel=\"nofollow noopener\" target=\"_blank\">50<\/a>. Heritability contributed by each variant was estimated under the BLD-LDAK model as recommended. For weight estimation, we used the \u2018mega\u2019 option, which allows the software to determine the most appropriate model based on the data. Since we studied mortality, apart from the nine selected diseases, we also computed PGS weights for general longevity using the largest GWAS on lifespan<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 29\" title=\"Timmers, P. R. et al. Genomics of 1 million parent lifespans implicates novel pathways and common diseases and distinguishes survival chances. eLife 8, e39856 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR29\" id=\"ref-link-section-d32773356e2612\" rel=\"nofollow noopener\" target=\"_blank\">29<\/a>. Due to the heterogeneous and polygenic nature of lifespan, we used the LDAK-Thin model for SNP-level heritability estimation for this trait instead. Unlike the BLD-LDAK model used in variant weighting for other diseases, LDAK-Thin model does not take functional annotations into account but estimates SNP heritability only as function of SNP allele frequencies and local linkage structures. Variant weights were derived for 1,330,820 common SNPs (minor allele frequency\u2009&gt;\u20090.1) lying in the intersection of HapMap3 (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 51\" title=\"International HapMap 3 Consortium et al. Integrating common and rare genetic variation in diverse human populations. Nature 467, 52&#x2013;58 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR51\" id=\"ref-link-section-d32773356e2616\" rel=\"nofollow noopener\" target=\"_blank\">51<\/a>) and 1000 Genomes<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 52\" title=\"1000 Genomes Project Consortium A global reference for human genetic variation. Nature 526, 68&#x2013;74 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR52\" id=\"ref-link-section-d32773356e2620\" rel=\"nofollow noopener\" target=\"_blank\">52<\/a> that are available for each GWAS summary statistic.<\/p>\n<p>Once the SNP weights were derived, individual-level PGSs for each disease and general longevity were subsequently computed as a weighted sum of effect allele counts using PLINK<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 48\" title=\"Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559&#x2013;575 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR48\" id=\"ref-link-section-d32773356e2627\" rel=\"nofollow noopener\" target=\"_blank\">48<\/a>. Scores were standardized to have 0 mean and 1 as variance within each ancestry group.<\/p>\n<p>For the composite mortality PGS, we used sex-stratified SNP weights developed by ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 30\" title=\"Meisner, A. et al. Combined utility of 25 disease and risk factor polygenic risk scores for stratifying risk of all-cause mortality. Am. J. Hum. Genet. 107, 418&#x2013;431 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR30\" id=\"ref-link-section-d32773356e2634\" rel=\"nofollow noopener\" target=\"_blank\">30<\/a>. Scores for males and females were computed separately and subsequently combined during the association step to obtain a population effect estimate.<\/p>\n<p>Association between PGS and disease of interest<\/p>\n<p>As a baseline, we first examined whether the disease PGSs were associated with their diagnoses. For each selected disease, the association was first tested using a general linear model on case\u2013control status as below:<\/p>\n<p>logit(Pr(Individual is diagnosed))\u2009~\u2009disease PGS\u2009+\u2009birth year\u2009+\u2009sex\u2009+\u2009PC1-10.<\/p>\n<p>To achieve a fairer comparison with the other experiments, we also evaluated such relationship using a survival model on the age of diagnosis as below:<\/p>\n<p>surv(follow-up from birth until diagnosis\u2009|\u2009disease diagnosis)\u2009~\u2009disease PGS\u2009+\u2009birth year\u2009+\u2009sex\u2009+\u2009PC1-10.<\/p>\n<p>The two analyses above were conducted using all eligible individuals from the biobanks. Then, for each selected disease, we extracted only the patient group for further analysis. To reduce noise in measurements, we limited these within-patient analyses to individuals having a follow-up time of at least three months (0.25 years) after the diagnosis. We tested the association of disease PGSs with our defined prognosis, namely patient survival, using the model below:<\/p>\n<p>surv(duration of follow-up after diagnosis\u2009|\u2009mortality)\u2009~\u2009disease PGS\u2009+\u2009birth year\u2009+\u2009sex\u2009+\u2009PC1-10\u2009+\u2009age of diagnosis,<\/p>\n<p>as well as the association of general longevity PGS with patient survival as below:<\/p>\n<p>surv(duration of follow-up after diagnosis\u2009|\u2009mortality)\u2009~\u2009general longevity PGS\u2009+\u2009birth year\u2009+\u2009sex\u2009+\u2009PC1-10\u2009+\u2009age of diagnosis.<\/p>\n<p>For both associations, we examined both all-cause mortality and cause-specific mortality within the patient group. All analyses were corrected for sex, except in analyses for breast cancer and prostate cancer, where only female\/male individuals were used.<\/p>\n<p>These analyses were carried out independently for each ancestry group within each participating biobank. We only included biobanks where the count of events of interest in the analyzed ancestry group was 50 or more. We subsequently meta-analyzed effect sizes for the same ancestry group across biobanks using the inverse-variance weighted approach.<\/p>\n<p>Mortality PGSs and their performance in FinnGen<\/p>\n<p>For diseases with sufficient power, we derived mortality PGS weights using meta-analyzed mortality GWAS results of European populations from all available biobanks, except for FinnGen or Generation Scotland. Apart from FinnGen, which was used as a test cohort, we also left out results from Generation Scotland for this analysis because their summary statistics did not have effect size or s.e. and therefore cannot be used for inverse-variance weighted meta-analysis, which returns necessary statistics for weight derivation. After deriving PGS weights using MegaPRS<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 50\" title=\"Zhang, Q., Priv&#xE9;, F., Vilhj&#xE1;lmsson, B. &amp; Speed, D. Improved genetic prediction of complex traits from individual-level data or summary statistics. Nat. Commun. 12, 4192 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#ref-CR50\" id=\"ref-link-section-d32773356e2683\" rel=\"nofollow noopener\" target=\"_blank\">50<\/a>, we subsequently computed individual-level disease-mortality PGS for patients of each corresponding disease within FinnGen cohort. The weights and scores are computed in the same manner as mentioned in the \u2018Computation of individual-level PGS\u2019. We evaluated the effects of these scores on predicting patients\u2019 disease mortality in FinnGen using the model below:<\/p>\n<p>surv(duration of follow-up after diagnosis\u2009|\u2009mortality)\u2009~\u2009disease-mortality PGS\u2009+\u2009birth year\u2009+\u2009sex\u2009+\u2009PC1-10\u2009+\u2009age of diagnosis<\/p>\n<p>Sensitivity analyses for PGS experiments<\/p>\n<p>We ran a series of sensitivity analyses in eligible biobanks to ensure our observations on the PGSs association were robust, under considerations listed below. Similarly, analyses were conducted for each eligible ancestry within each biobank and then meta-analyzed.<\/p>\n<p>First, to demonstrate the impact of relevance between disease progression and susceptibility as shown in our theories, we examined the association between susceptibility PGS and all-cause mortality and compared the results with disease-specific mortality in FinnGen (see Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a> for these results). We then considered other factors that may bias the results.<\/p>\n<p>Survivor bias<\/p>\n<p>Depending on each biobank\u2019s recruitment scheme, some patients were diagnosed before the start of their follow-up, which may lead to biased results due to the survivor effect. Therefore, we also ran these analyses for each disease using only samples from individuals enrolled before their first onset of the disease of interest (see Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">11a<\/a> for these results).<\/p>\n<p>Relevance between cause of mortality in death certificate and disease diagnosis<\/p>\n<p>In this study, we aimed to define disease progression as accurately as possible by focusing our analysis on disease-caused mortality. However, some national death registries may not precisely capture the immediate cause of death, and some mortalities, while documented with the disease as one of the causes, may not be truly relevant to the diagnosed disease. To address this concern, we ran the same analysis using only patients with a restricted maximum follow-up length, since death taking place reasonably soon after being diagnosed might have more to do with the diagnosis, compared to death taking place decades after. Under this consideration, we varied the maximum duration of follow-up after diagnosis by 2, 5 or 10 years. The minimum is still 0.25 years for this analysis (see Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">11b<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a> for these results; see also Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> for sample size breakdown by duration of follow-up in each biobank). To facilitate comparability between results, we reported the regression coefficients for PGS effect sizes on nine diseases for each sensitivity analysis and the main results.<\/p>\n<p>The effect of diagnosed age<\/p>\n<p>As shown above, we included the age of diagnosis as one of the covariates in all within-patient main analysis models to specifically investigate PGSs\u2019 unique genetic effect on disease progression by correcting for the diagnosis. As part of our sensitivity analysis, we also examined the role of these diagnosed ages in more detail. We repeated all the within-patient analyses for each disease by stratifying patients into early onset and late onset groups using 50% age of diagnosis quantile as a cutoff and compared the PGS effects across the two groups (see Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a> for these results).<\/p>\n<p>Sample relatedness<\/p>\n<p>We included all eligible individuals of each biobank in our main analysis, and one may argue that this could impact our effect size estimates. Therefore, we ran the same analysis in FinnGen with up to second-degree relatives removed (see Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">13<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a> for these results).<\/p>\n<p>Results from non-European ancestry populations<\/p>\n<p>Since only patients were considered for most of our analyses, although some of the biobanks (for example, UK Biobank and BioMe) were known to be rather diverse, we ended up with enough power for the main results only for the European super-population. Nevertheless, comparison of results with other less powered but available populations can be found in Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">15<\/a> for reference.<\/p>\n<p>Forest plot for effects from each biobank is presented in Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>.<\/p>\n<p>Alternative progression definitions for type 2 diabetes<\/p>\n<p>For type 2 diabetes, we explored the genetics of two additional widely considered progressions\u2014macrovascular and microvascular complications. For macrovascular complications, we only consider patients who did not have any coronary artery disease, stroke or peripheral arterial disease incidents before the onset of type 2 diabetes. Among those, we define the ones having at least one of the aforementioned diagnoses after type 2 diabetes as cases for macrovascular complications. Event time is defined as the duration from a patient\u2019s diagnosis of type 2 diabetes to the earliest diagnosis of a macrovascular complication. Similarly, for microvascular complications, we consider onset of diabetic retinopathy, nephropathy and neuropathy after the patients\u2019 diagnosis of type 2 diabetes. For both definitions of progression, our analysis only included individuals with &gt;0.25 year of follow-up, meaning the patients\u2019 death\/onset of progression\/biobank censoring take place &gt;0.25 year after their diagnosis of type 2 diabetes.<\/p>\n<p>For macrovascular complications, for which we identified genome-wide significant signals among diabetic patients, we further carried out a down-sampled time-to-event GWAS on population-comparable phenotypes, matching the case\u2013control count in the progression GWAS. For this down-sampled GWAS, we considered onset of coronary artery disease, stroke, or peripheral arterial disease in nondiabetic population.<\/p>\n<p>Simulation to explore the impact of index event bias<\/p>\n<p>Please see section \u2018Simulation to explore the impact of index event bias\u2019 from <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Note<\/a> for details.<\/p>\n<p>Reporting summary<\/p>\n<p>Further information on research design is available in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02342-8#MOESM2\" rel=\"nofollow noopener\" target=\"_blank\">Nature Portfolio Reporting Summary<\/a> linked to this article.<\/p>\n","protected":false},"excerpt":{"rendered":"Ethics statement This study was conducted in compliance with the relevant ethical guidelines and approved by the appropriate&hellip;\n","protected":false},"author":2,"featured_media":192763,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[50],"tags":[2342,13114,258,8869,38738,13113,257,23710,200,3870,79],"class_list":{"0":"post-192762","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-genetics","8":"tag-agriculture","9":"tag-animal-genetics-and-genomics","10":"tag-biomedicine","11":"tag-cancer-research","12":"tag-clinical-genetics","13":"tag-gene-function","14":"tag-general","15":"tag-genetic-association-study","16":"tag-genetics","17":"tag-human-genetics","18":"tag-science"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/192762","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/comments?post=192762"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/192762\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media\/192763"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media?parent=192762"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/categories?post=192762"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/tags?post=192762"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}