{"id":508109,"date":"2026-04-02T01:50:19","date_gmt":"2026-04-02T01:50:19","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/508109\/"},"modified":"2026-04-02T01:50:19","modified_gmt":"2026-04-02T01:50:19","slug":"large-scale-exome-analyses-reveal-new-rare-variant-contributions-in-amyotrophic-lateral-sclerosis","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/508109\/","title":{"rendered":"Large-scale exome analyses reveal new rare variant contributions in amyotrophic lateral sclerosis"},"content":{"rendered":"<p>Building a harmonized ALS exome dataset for rare variant analysis<\/p>\n<p>To identify rare coding variants involved in ALS, we harmonized 18 whole-exome (WXS) and whole-genome (WGS) sequencing datasets into a discovery cohort totaling 94,545 people. All data were realigned uniformly to GRCh38 and called jointly using the functional equivalence pipeline<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\" title=\"Van der Auwera, G. D. &amp; O&#x2019;Connor, B. D. Genomics in the Cloud: Using Docker, GATK, and WDL in Terra (O&#x2019;Reilly Media, Inc., 2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR10\" id=\"ref-link-section-d66612860e3171\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\" title=\"Regier, A. A. et al. Functional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects. Nat. Commun. 9, 4038 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR11\" id=\"ref-link-section-d66612860e3174\" rel=\"nofollow noopener\" target=\"_blank\">11<\/a>, substantially reducing technical variation (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">1a<\/a>). Moreover, the distributions of exome-wide URV counts were aligned between ancestry-matched WGS (Project MinE) and WXS (UK Biobank) samples, indicating that sequencing technologies were comparable after joint processing and quality control (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">1b<\/a>). Following strict filtering, the final dataset comprised 13,138 unrelated cases and 69,775 controls of predominantly European ancestry, with 5,207,138 variants (2,367,861 predicted moderate or high impact; Supplementary Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>).<\/p>\n<p>Rare single-variant analyses identify five new risk variants and largely recapitulate known rare variant architecture of ALS<\/p>\n<p>We conducted single-variant analyses of 272,925 rare variants that fell within our testable minor allele frequency (MAF) range (5\u2009\u00d7\u200910\u22125\u2009&lt;\u2009MAF\u2009&lt;\u20090.05) while also satisfying variant effect prediction criteria of either moderate- (missense mutations, in-frame deletions and untranslated region (UTR) truncations) or high-impact (nonsense, splice acceptor\/donor and frameshift mutations) annotations. For each variant, we used Firth\u2019s logistic regression to test for an association between ALS status and minor allele count (MAC), adjusting for sex, ten principal components (PCs) and the total number of rare synonymous variants in each person<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\" title=\"Firth, D. Bias reduction of maximum likelihood estimates. Biometrika 80, 27&#x2013;38 (1993).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR12\" id=\"ref-link-section-d66612860e3200\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 13\" title=\"Hop, P. J. &amp; Kenna, K. P. RVAT: Rare Variant Association Toolkit. GitHub &#010;                https:\/\/github.com\/kennalab\/rvat&#010;                &#010;               (2026).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR13\" id=\"ref-link-section-d66612860e3203\" rel=\"nofollow noopener\" target=\"_blank\">13<\/a>. The resulting test statistics showed no systematic inflation, indicating no residual confounding (\u03bb1,000\u2009=\u20091.01), and significant variants passed subsequent validation and sensitivity analyses (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig3\" rel=\"nofollow noopener\" target=\"_blank\">1a,b<\/a>).<\/p>\n<p>We identified 15 exome-wide significant variants across 11 distinct genes (P\u2009&lt;\u20091.83\u2009\u00d7\u200910\u22127; Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1a<\/a>, Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>, Extended Data Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab3\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>, Extended Data Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig3\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>), for all of which the minor allele was associated with increased ALS risk (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1c<\/a>). Among the 15 associated variants, 10 were located in genes previously shown to be related to ALS: SOD1, CFAP410, NEK1, KIF5A, FUS and TBK1 (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1a<\/a> and Extended Data Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab3\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). The remaining five have not been reported previously in ALS (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). These encompass intermediate frequency variants with modest effect size, including HTR3C p.T186A (odds ratio (OR)\u2009=\u20093.41, P\u2009=\u20091.87\u2009\u00d7\u200910\u22128) and YKT6 p.Y64C (OR\u2009=\u20092.84, P\u2009=\u20099.08\u2009\u00d7\u200910\u22128) as well as rare variants with high effect size, including GBGT1 p.R152L (OR\u2009=\u200926.9, P\u2009=\u20091.68\u2009\u00d7\u200910\u221210), CAPN2 p.I530V (OR\u2009=\u200925.3, P\u2009=\u20093.66\u2009\u00d7\u200910\u22129), and KNTC1 p.W287R (OR\u2009=\u200927.7, P\u2009=\u20091.07\u2009\u00d7\u200910\u22127).<\/p>\n<p>Fig. 1: Rare single-variant analyses.<img decoding=\"async\" aria-describedby=\"figure-1-desc\" src=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2026\/04\/41588_2026_2535_Fig1_HTML.png\" alt=\"Fig. 1: Rare single-variant analyses.\" loading=\"lazy\" width=\"685\" height=\"429\"\/><\/p>\n<p>a, y\u2009axis: exome-wide single variant associations estimated using Firth\u2019s logistic regression with profile penalized likelihood CIs (\u2212log10(P)); x\u2009axis: genomic coordinates (GRCh38). Dashed line: exome-wide significance threshold (P\u2009&lt;\u20091.83\u2009\u00d7\u200910\u22127). New variants are highlighted in orange. b, Rare single-variant analyses among ALS-linked genes curated by the ALS GCEP. y\u2009axis: single-variant associations estimated using Firth\u2019s logistic regression with profile penalized likelihood CIs (\u2212log10(P)); x\u2009axis: genomic coordinates (GRCh38). Variants are colored by the clinical validity classification as curated by the ALS GCEP. Lower dashed line: significance threshold across variants in ALS-linked genes (P\u2009&lt;\u20093.20\u2009\u00d7\u200910\u22125); upper dashed line: exome-wide significance threshold as presented in a. c, ORs (y axis) and 95% CIs (gray shaded area) plotted against the risk allele frequency in controls (x\u2009axis) for significant variants identified in either the exome-wide or GCEP analysis. For variants where the control risk allele frequency was 0, it was set to half the lowest nonzero risk allele frequency observed in the control group. P\u2009values are two-tailed and are presented uncorrected for multiple testing.<\/p>\n<p>Table 1 New rare single variants achieving significance<\/p>\n<p>We also performed a targeted analysis of variants within 51 ALS-linked genes curated by the ALS Gene Curation Expert Panel (GCEP)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 14\" title=\"Dilliott, A. A. et al. Clinical testing panels for ALS: global distribution, consistency, and challenges. Amyotroph. Lateral Scler. Front. Degener. 24, 420&#x2013;435 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR14\" id=\"ref-link-section-d66612860e3987\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>. To ensure the inclusion of the full set of GCEP-curated genes, we did not apply the per-supercohort call-rate filter for this analysis, allowing for the assessment of genes exhibiting subpar call-rates in certain subcohorts. This identified eight additional variants across six genes (P\u2009&lt;\u20093.20\u2009\u00d7\u200910\u22125; Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1b,c<\/a>, Extended Data Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab3\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM5\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>), including variants in genes that were not detected in the exome-wide analysis (ARPP21, ANXA11, UBQLN2 and TARDBP). For all identified variants, the minor allele was associated with increased ALS risk (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1c<\/a> and Extended Data Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab3\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). We provide independent evidence for two rare variants in ARPP21 (p.P563L and p.P747L)\u2014a gene that is currently considered as having limited evidence according to GCEP (p.P563L: OR\u2009=\u200944.8, P\u2009=\u20092.55\u2009\u00d7\u200910\u221210; p.P747L: OR\u2009=\u200975.8, P\u2009=\u20091.45\u2009\u00d7\u200910\u22126) (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1b,c<\/a>). Of note, the ARPP21 p.P563L variant had subpar call-rates in some exome cohorts. However, even when restricting the analysis to cohorts meeting stringent call-rate thresholds, the association remained exome-wide significant with a similar odds ratio (P\u2009=\u20091.09\u2009\u00d7\u200910\u22128, OR\u2009=\u200938.1; Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig3\" rel=\"nofollow noopener\" target=\"_blank\">1c,d<\/a>).<\/p>\n<p>Principal component analysis (PCA) suggested a mixed pattern of geographical distribution for carriers of the identified variants (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>). For some variants, we observed that carriers exhibited relatively tight clustering in PCA space. This was observed for both well-established ALS variants such as UBQLN2 p.P509S (Sweden) and SOD1 p.A5V (USA), as well for the new CAPN2 p.I540V variant (the Netherlands). Conversely, other variants were distributed more uniformly across patient populations (for example, YKT6 p.Y64C and ARPP21 p.P563L). In silico pathogenicity prediction tools also yielded varying annotations for both previously established and new ALS-associated variants (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). Nonetheless, we observed that YKT6 p.Y64C was consistently predicted as damaging by all predictors, and KNTC1 p.W287R was predicted as damaging by all but SIFT.<\/p>\n<p>Ultrarare burden analyses identify new ALS-associated genes<\/p>\n<p>To detect associations among URVs (five or fewer carriers), we performed burden tests using Firth\u2019s logistic regression to evaluate their cumulative effects. URVs were aggregated across several functional units, including genes and protein domains. To enrich for potentially pathogenic variants, we used four filtering strategies based on two criteria: (1) variant frequency\u2014either all URVs or singleton-only variants; (2) variant impact\u2014either only high-impact variants or both high- and moderate-impact variants. Tests across these filtering strategies were combined using the ACAT omnibus test<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 15\" title=\"Liu, Y. et al. ACAT: a fast and powerful p value combination method for rare-variant analysis in sequencing studies. Am. J. Hum. Genet. 104, 410&#x2013;421 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR15\" id=\"ref-link-section-d66612860e4092\" rel=\"nofollow noopener\" target=\"_blank\">15<\/a>. We observed no evidence of genomic inflation in any of the analyses performed (gene \u03bb1,000\u2009=\u20091.011, domain \u03bb1,000\u2009=\u20091.006; Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">3a<\/a>), and all presented genes passed subsequent sensitivity analyses (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">3e\u2013g<\/a>).<\/p>\n<p>URV gene burden analyses across 17,324 protein-coding genes identified eight genes that reached exome-wide significance (P\u2009&lt;\u20092.89\u2009\u00d7\u200910\u22126) (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2a<\/a>, Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab2\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>, Extended Data Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>, Extended Data Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>, Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM5\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM5\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>). Among these were four established ALS genes: SOD1 (P\u2009&lt;\u20091\u2009\u00d7\u200910\u221216), TBK1 (P\u2009&lt;\u20091\u2009\u00d7\u200910\u221216), NEK1 (P\u2009=\u20096.49\u2009\u00d7\u200910\u221213) and TARDBP (P\u2009=\u20095.02\u2009\u00d7\u200910\u22128) (Extended Data Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>). Furthermore, we identified DNAJC7 (P\u2009=\u20098.77\u2009\u00d7\u200910\u22128), which is currently classified as having limited evidence (ClinGen gene curation panel<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 14\" title=\"Dilliott, A. A. et al. Clinical testing panels for ALS: global distribution, consistency, and challenges. Amyotroph. Lateral Scler. Front. Degener. 24, 420&#x2013;435 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR14\" id=\"ref-link-section-d66612860e4190\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>), and here reaches exome-wide significance for the first time in an exome-wide discovery analysis. New candidate genes included TTC3 (P\u2009=\u20094.16\u2009\u00d7\u200910\u22127), UNC13C (P\u2009=\u20092.80\u2009\u00d7\u200910\u22127) and KIF4A (P\u2009=\u20091.62\u2009\u00d7\u200910\u22126), in all of which higher URV burden increased risk of ALS (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab2\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> and Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">3b<\/a>). A targeted analysis among the 51 ALS-linked genes classified by GCEP also revealed a significant association for OPTN (P\u2009=\u20091.56\u2009\u00d7\u200910\u22125), which is classified by GCEP as a definitive ALS gene (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2b<\/a> and Extended Data Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>).<\/p>\n<p>Fig. 2: URV burden analyses.<img decoding=\"async\" aria-describedby=\"figure-2-desc\" src=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2026\/04\/41588_2026_2535_Fig2_HTML.png\" alt=\"Fig. 2: URV burden analyses.\" loading=\"lazy\" width=\"685\" height=\"393\"\/><\/p>\n<p>a, y\u2009axis: exome-wide gene-based URV associations (\u2212log10(P)); x\u2009axis: genomic coordinates (GRCh38). Dashed line: exome-wide significance threshold (P\u2009&lt;\u20092.9\u2009\u00d7\u200910\u22126). New risk genes are highlighted in orange. b, URV burden analyses among ALS-linked genes curated by the ALS GCEP. y\u2009axis: gene-based URV associations (\u2212log10(P)); x\u2009axis: genomic coordinates (GRCh38). Lower dashed line: significance threshold across ALS-linked genes (P\u2009&lt;\u20091\u2009\u00d7\u200910\u22123); upper dashed line: exome-wide significance threshold as presented in a. c, Domain-based URV analyses. y\u2009axis: domain associations (\u2212log10(P)); x\u2009axis: genomic coordinates (GRCh38). Dashed line: exome-wide significance threshold (P\u2009&lt;\u20097.68\u2009\u00d7\u200910\u22127). d, Association P\u2009values for URV geneset burden analyses excluding exome-wide significant genes (y\u2009axis) versus including exome-wide significant genes (x\u2009axis). The dashed lines indicate the multiple testing threshold (P &lt; 4.25 \u00d7 10\u22126). Association statistics were estimated using Firth\u2019s logistic regression with profile penalized likelihood CIs. P\u2009values are from the ACAT omnibus test combining the four variant filtering strategies (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"section anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Sec11\" rel=\"nofollow noopener\" target=\"_blank\">Methods<\/a>) and are two-tailed and uncorrected for multiple testing.<\/p>\n<p>Table 2 New genes achieving exome-wide significance in URV burden analyses<\/p>\n<p>The URV domain analyses across 65,071 domains identified three partially overlapping domains in TBK1 (protein kinase, kinase-like and CCD1 domains), one domain in SOD1 (SOD_Cu\/Zn_BS domain) and one domain in VCP (CDC48 domain\u20092-like domain) at exome-wide significance (P\u2009&lt;\u20097.68\u2009\u00d7\u200910\u22127; Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2c<\/a>, Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">3c<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM5\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM5\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>). Unlike SOD1 and TBK1, VCP did not reach significance in the whole-gene analysis (Pgene\u2009=\u20098.09\u2009\u00d7\u200910\u22123), suggesting that the CDC48 domain\u20092-like region harbors the primary association signal with a markedly stronger effect (Pdomain\u2009=\u20092.16\u2009\u00d7\u200910\u22127). This domain constitutes the second subdomain of the N-terminal domain, in which most known pathogenic mutations are concentrated<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 16\" title=\"Scarian, E. et al. The role of VCP mutations in the spectrum of amyotrophic lateral sclerosis&#x2014;frontotemporal dementia. Front. Neurol. 13, 841394 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR16\" id=\"ref-link-section-d66612860e4803\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a>.<\/p>\n<p>Across burden analyses, ORs were generally similar when including all URVs compared to including singletons only, with the notable exceptions of NEK1 and KIF4A, which showed markedly higher ORs in the singleton-only analyses (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">3b\u2013d<\/a>). The observed associations were driven primarily by moderate-impact variants: NEK1 and TBK1 were the only genes showing a significant signal when analyses were restricted to high-impact variants (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">3k\u2013m<\/a>), although some signal among high-impact variants was observed for DNAJC7 and OPTN. Single nucleotide variants (SNVs) were the primary drivers of the associations, with insertions\/deletions (INDELs) contributing substantially to the association P\u2009values only for NEK1 and DNAJC7 (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">3h\u2013j<\/a>). For UNC13C, TTC3 and OPTN, we identified a small subset of people carrying two URVs, whereas for TBK1 and NEK1, we found people with both a URV and a more common (0.01\u2009&lt;\u2009MAF\u2009&lt;\u20090.05) risk variant (p.V464A and p.R261H, respectively). No increased risk was observed in these cases, although this may be due to the low number of co-occurrences (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">4a<\/a>).<\/p>\n<p>Assessing geneset burden and variant co-occurrence<\/p>\n<p>We performed URV geneset burden analyses across 11,777 Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Reactome genesets from the Molecular Signatures Database (MSigDB v.7.5)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 17\" title=\"Liberzon, A. et al. Molecular signatures database (MSigDB) 3.0. Bioinformatics 27, 1739&#x2013;1740 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR17\" id=\"ref-link-section-d66612860e4871\" rel=\"nofollow noopener\" target=\"_blank\">17<\/a>, using the same procedure as the single gene analyses (\u03bb1,000\u2009=\u20091.006, Supplementary Data\u2009<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM5\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM5\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a>). After excluding genesets driven solely by one highly significant gene, two genesets remained significant: \u2018GOBP: regulation of mRNA splicing via spliceosome\u2019 (<a href=\"http:\/\/amigo.geneontology.org\/amigo\/term\/GO:0048024\" rel=\"nofollow noopener\" target=\"_blank\">GO:0048024<\/a>, 96 genes, P\u2009=\u20092.97\u2009\u00d7\u200910\u22127) and its parent term \u2018GOBP: regulation of RNA splicing\u2019 (<a href=\"http:\/\/amigo.geneontology.org\/amigo\/term\/GO:0043484\" rel=\"nofollow noopener\" target=\"_blank\">GO:0043484<\/a>, 142 genes, P\u2009=\u20093.50\u2009\u00d7\u200910\u22126) (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2d<\/a> and Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">3d<\/a>). As \u2018regulation of mRNA splicing via spliceosome\u2019 is a subset of \u2018regulation of RNA splicing,\u2019 we performed a conditional analysis to assess its independent contribution. This revealed that residual signal remains in \u2018regulation of mRNA splicing via spliceosome\u2019 (P\u2009=\u20090.0084), suggesting it captures a more specific association within this pathway. Among the 153 unique genes across these two genesets, 30 reached nominal significance (P\u2009&lt;\u20090.05), with top genes including HSPA8, HABP4, NOVA2, HNRNPL and SNW1 (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">7a<\/a>). We also performed a geneset analysis among the 51 ALS-linked genes curated by GCEP<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 14\" title=\"Dilliott, A. A. et al. Clinical testing panels for ALS: global distribution, consistency, and challenges. Amyotroph. Lateral Scler. Front. Degener. 24, 420&#x2013;435 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR14\" id=\"ref-link-section-d66612860e4942\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>. As expected, this showed that the \u2018Definitive\u2019 category was highly significant (P\u2009&lt;\u20091\u2009\u00d7\u200910\u221216) across allele frequency thresholds, whereas the \u2018Limited\u2019 category showed only modest enrichment (P\u2009=\u20090.0015), and no enrichment was seen among the other categories (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">7b,c<\/a>).<\/p>\n<p>We next examined whether carrying several variants among \u2018Definitive\u2019 ALS genes as classified by GCEP confers cumulative risk. We observed a clear dose\u2013response relationship across low-frequency variants (MAF\u2009&lt;\u20090.05): the OR increased progressively as people carried one (OR\u2009=\u20091.19, P\u2009=\u20092.11\u2009\u00d7\u200910\u221215), two (OR\u2009=\u20091.35, P\u2009=\u20098.43\u2009\u00d7\u200910\u221213), three (OR\u2009=\u20091.84, P\u2009=\u20092.78\u2009\u00d7\u200910\u22128) or four (OR\u2009=\u20094.26, P\u2009=\u20095.35\u2009\u00d7\u200910\u22125) qualifying variants (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">4c<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM5\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>). This relationship persisted when burden was assessed at the gene level, where several variants within the same gene were counted as a single event (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">4d<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM5\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>). Analyses restricted to rarer variants were underpowered due to the low number of people carrying several variants (Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM5\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>). We did not observe a similar dose\u2013response relationship when we tested for an association with age at onset and survival (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a>).<\/p>\n<p>We next focused on co-occurrence among the specific risk variants identified in this study. Focusing on single variants in \u2018Definitive\u2019 GCEP genes, we found that 11.1% of cases carried one variant and 0.54% carried two, whereas the co-occurrence of three or more variants was not observed (Extended Data Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab5\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>). When including variants in genes with \u2018Limited\u2019 evidence and new single variants identified in this study, the proportions increased to 14.5% for one, 1.1% for two and 0.0076% for three variants. The proportions increased further to 18.2%, 1.7% and 0.099%, respectively, when also including qualifying variants from the URV burden analyses. Finally, when C9orf72 repeat expansion status was also considered (available for 66% of cases), these totals rose to 23.5%, 3.12% and 0.22%, respectively, totaling 26.9% of cases. The observed co-occurrence rates did not deviate from those expected under an additive model using permutation analyses (P\u2009=\u20090.39). When examining specific variant pairs, we observed numerous instances of cases carrying several variants (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>). For example, 20% of C9orf72 repeat expansion carriers harbored additional risk variants. Furthermore, some pairs, including CFAP410 p.V58L\u2009\u00d7\u2009NEK1 p.R261H, showed trends suggestive of a synergistic effect (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">4e<\/a>). To formally test whether any of these pairs showed nonadditive effects, we performed pairwise co-occurrence and interaction analyses. No pairs reached significance after correction for multiple testing (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>). This was consistent with our power calculations (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a>), which showed that the study was underpowered to detect all but the largest deviations from additivity for specific variant pairs, and then only for pairs including at least one low-frequency variant (0.01\u2009&lt;\u2009MAF\u2009&lt;\u20090.05).<\/p>\n<p>                        ARPP21 p.P563L is associated with earlier disease onset and shorter disease duration<\/p>\n<p>To assess the impact of genetic variants on disease progression, we analyzed survival and age at onset across candidate genes and variants (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig8\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a> and Supplementary Data <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM5\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a>). Consistent with previous reports, SOD1 p.A5V and p.D91A were associated significantly with a lower age at onset (p.A5V: B\u2009=\u2009\u22129.44, P\u2009=\u20095.61\u2009\u00d7\u200910\u22124; p.D91A: B\u2009=\u2009\u22124.82, P\u2009=\u20091.11\u2009\u00d7\u200910\u22125), with p.A5V linked to shorter survival and p.D91A to longer survival (p.A5V: hazard ratio (HR)\u2009=\u200913.0, P\u2009=\u20091.19\u2009\u00d7\u200910\u22128; p.D91A: HR\u2009=\u20090.453, P\u2009=\u20091.48\u2009\u00d7\u200910\u22127). Similarly, FUS p.R521C and p.P525L were associated with earlier onset (p.R521C: B\u2009=\u2009\u221216.2, P\u2009=\u20091.90\u2009\u00d7\u200910\u22124; p.525L: B\u2009=\u2009\u221239.1, P\u2009=\u20091.53\u2009\u00d7\u200910\u221210), with p.P525L specifically associated with shorter survival (HR\u2009=\u200941.75, P\u2009=\u20091.41\u2009\u00d7\u200910\u221210). Notably, ARPP21 p.P563L was associated with a significantly lower age at onset (B\u2009=\u2009\u221212.7, P\u2009=\u20095.44\u2009\u00d7\u200910\u22124) and shorter survival (HR\u2009=\u20095.96, \u2206survival time\u2009=\u2009\u221219.5 months, P\u2009=\u20092.54\u2009\u00d7\u200910\u22126), showing effect sizes comparable to SOD1 p.A5V (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig8\" rel=\"nofollow noopener\" target=\"_blank\">6a<\/a>). Among URVs, SOD1 was associated with longer survival (HR\u2009=\u20090.45, P\u2009=\u20090.0022), whereas no significant associations were observed for other genes (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig8\" rel=\"nofollow noopener\" target=\"_blank\">6b<\/a>).<\/p>\n<p>Replication confirms YKT6 and supports HTR3C, GBGT1 and KNTC1 as ALS risk genes<\/p>\n<p>For replication, we generated a cohort comprising 4,781 individuals with ALS and 130,928 controls after applying stringent quality control criteria identical to those used in the discovery set (Supplementary Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">11<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a>). Power analyses based on the (winner\u2019s curse adjusted) effect sizes observed in the discovery dataset indicated that this provides between 32% and 91% statistical power for replication across candidate variants and genes (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a>).<\/p>\n<p>Of the five new single variants identified in the discovery phase, all showed a consistent direction of effect in the replication cohort (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>; \u03bb1,000\u2009=\u20090.965). Moreover, all five reached exome-wide significance in a meta-analysis of the combined discovery and replication data, with all but CAPN2 p.I530V showing greater significance compared to the discovery phase alone (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). Furthermore, YKT6 p.Y64C achieved replication-wide significance (P\u2009&lt;\u20090.0063), correcting for the eight new associations from the discovery phase (five single variants and three URV genes). Among the three candidate URV genes, a consistent direction of effect was seen only for KIF4A (OR\u2009=\u20092.46, P\u2009=\u20090.26), and none reached replication-wide significance (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Tab2\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>; \u03bb1,000\u2009=\u20091.046).<\/p>\n<p>Establishing independent evidence for ARPP21, DNAJC7 and CFAP410<\/p>\n<p>Next, for the genes that were significant in our discovery analysis that are currently classified by GCEP with \u2018Limited\u2019 evidence (ARPP21, CFAP410 and DNAJC7), we aimed to confirm the independence of our findings.<\/p>\n<p>For ARPP21, we identified two rare variants: p.P747L, which has not previously been reported in the scientific literature, and p.P563L, previously reported in UK and Spanish families as candidate variants<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 18\" title=\"Cooper-Knock, J. et al. Mutations in the glycosyltransferase domain of GLT8D1 are associated with familial amyotrophic lateral sclerosis. Cell Rep. 26, 2298&#x2013;2306 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR18\" id=\"ref-link-section-d66612860e5233\" rel=\"nofollow noopener\" target=\"_blank\">18<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 19\" title=\"Dols-Icardo, O. et al. Identification of a pathogenic mutation in ARPP21 in patients with amyotrophic lateral sclerosis. J. Neurol. Neurosurg. Psychiatry 96, 132&#x2013;139 (2025).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR19\" id=\"ref-link-section-d66612860e5236\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a>. To confirm independence for p.P563L, we excluded four potentially overlapping UK carriers (no Spanish carriers were identified). The association remained (OR\u2009=\u200928.3, P\u2009=\u20093.47\u2009\u00d7\u200910\u22127; Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig9\" rel=\"nofollow noopener\" target=\"_blank\">7a<\/a>) and was further supported by our replication dataset, which had no potential overlap with previous studies (OR\u2009=\u200916.5, P\u2009=\u20093.29\u2009\u00d7\u200910\u22123). A meta-analysis of these two independent datasets yielded a highly significant association (P\u2009=\u20094.31\u2009\u00d7\u200910\u22129), confirming a strong, independent signal. We also validated the reported effects of age of onset and progression<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 19\" title=\"Dols-Icardo, O. et al. Identification of a pathogenic mutation in ARPP21 in patients with amyotrophic lateral sclerosis. J. Neurol. Neurosurg. Psychiatry 96, 132&#x2013;139 (2025).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR19\" id=\"ref-link-section-d66612860e5259\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a> in our nonoverlapping cohort (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig8\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>). Finally, ARPP21 carriers were observed across several cohorts beyond those from the UK and Spain, significantly expanding its known population distribution (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>).<\/p>\n<p>CFAP410 p.V58L was previously identified in two common variant genome-wide association studies (GWAS) (MAF\u2009=\u20090.013)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\" title=\"Van Rheenen, W. et al. Genome-wide association analyses identify new risk variants and the genetic architecture of amyotrophic lateral sclerosis. Nat. Genet. 48, 1043&#x2013;1048 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR4\" id=\"ref-link-section-d66612860e5278\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 20\" title=\"van Rheenen, W. et al. Common and rare variant association analyses in amyotrophic lateral sclerosis identify 15 risk loci with distinct genetic architectures and neuron-specific biology. Nat. Genet. 53, 1636&#x2013;1648 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR20\" id=\"ref-link-section-d66612860e5281\" rel=\"nofollow noopener\" target=\"_blank\">20<\/a>. To confirm independence, we excluded 8,372 cases and 4,159 controls that were duplicated or had second-degree or closer genetic relatedness to the original GWAS cohorts. The association remained highly significant after this exclusion (Pmeta\u2009=\u20091.34\u2009\u00d7\u200910\u221214), with consistent ORs in both discovery (OR\u2009=\u20091.81, P\u2009=\u20091.32\u2009\u00d7\u200910\u221210) and replication (OR\u2009=\u20091.61, P\u2009=\u20091.09\u2009\u00d7\u200910\u22125) cohorts (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig9\" rel=\"nofollow noopener\" target=\"_blank\">7b<\/a>).<\/p>\n<p>For DNAJC7, implicated previously in a case\u2013control study of ALS<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 21\" title=\"Farhan, S. M. K. et al. Exome sequencing in amyotrophic lateral sclerosis implicates a novel gene, DNAJC7, encoding a heat-shock protein. Nat. Neurosci. 22, 1966&#x2013;1974 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#ref-CR21\" id=\"ref-link-section-d66612860e5311\" rel=\"nofollow noopener\" target=\"_blank\">21<\/a>, we re-evaluated the association after excluding overlapping cohorts (excluding 5,722 cases and 9,849 controls). In this reduced discovery dataset, there remained a robust association with a consistent odds ratio (ncases\u2009=\u20097,606, ncontrols\u2009=\u200959,926; OR\u2009=\u20092.56, P\u2009=\u20091.36\u2009\u00d7\u200910\u22124; Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig9\" rel=\"nofollow noopener\" target=\"_blank\">7c<\/a>). This was further supported by our replication cohort, which had minimal overlap (190 cases) with the previous study (OR\u2009=\u20092.41, P\u2009=\u20092.82\u2009\u00d7\u200910\u22123; Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02535-9#Fig9\" rel=\"nofollow noopener\" target=\"_blank\">7c<\/a>). Meta-analysis across these two datasets confirmed a strong, independent signal (P\u2009=\u20092.96\u2009\u00d7\u200910\u22126).<\/p>\n","protected":false},"excerpt":{"rendered":"Building a harmonized ALS exome dataset for rare variant analysis To identify rare coding variants involved in ALS,&hellip;\n","protected":false},"author":2,"featured_media":508110,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[10],"tags":[5083,5085,3251,5082,8815,59,5084,3250,18005,102,5081,48562,56,54,55],"class_list":{"0":"post-508109","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-health","8":"tag-agriculture","9":"tag-animal-genetics-and-genomics","10":"tag-biomedicine","11":"tag-cancer-research","12":"tag-dna-sequencing","13":"tag-gb","14":"tag-gene-function","15":"tag-general","16":"tag-genome-wide-association-studies","17":"tag-health","18":"tag-human-genetics","19":"tag-motor-neuron-disease","20":"tag-uk","21":"tag-united-kingdom","22":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/508109","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=508109"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/508109\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/508110"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=508109"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=508109"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=508109"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}