{"id":210959,"date":"2025-10-09T05:33:17","date_gmt":"2025-10-09T05:33:17","guid":{"rendered":"https:\/\/www.newsbeep.com\/us\/210959\/"},"modified":"2025-10-09T05:33:17","modified_gmt":"2025-10-09T05:33:17","slug":"sex-and-smoking-bias-in-the-selection-of-somatic-mutations-in-human-bladder","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/us\/210959\/","title":{"rendered":"Sex and smoking bias in the selection of somatic mutations in human bladder"},"content":{"rendered":"<p>Sample collection<\/p>\n<p>Two epithelial brushes (2\u20133\u2009cm2) from the bladder top (dome) and the bladder floor (trigone) were obtained from 53 people without known bladder pathology and no history of bladder cancer upon autopsy (average 4\u2009days post-mortem) at the University of Washington. Next of kin consented to autopsies and research on leftover specimens. The study of de-identified collected specimens and linked clinical history from the deceased donors was reviewed and deemed not human subjects research by the University of Washington Institutional Review Board (STUDY00016707; IRB Federal Wide Assurance number, FWA 00006878). We obtained the following relevant clinical information for all donors: age, sex, BMI, tobacco smoking history, alcohol use, previous cancer and chemotherapy exposure (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). Three donors with active or chronic inflammation and four donors with insufficient DNA were discarded. The two samples from another donor were also discarded from the study upon visual inspection of their mutational profile, resulting in a total of 45 deceased donors in this study (Supplementary Notes\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>).<\/p>\n<p>Duplex DNA sequencingCapture panel design<\/p>\n<p>We designed a panel including ten genes identified in a previous study as being under positive selection in the normal urothelium and with more than ten mutations across the 1,647 microbiopsies analysed<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\" title=\"Lawson, A. R. J. et al. Extensive heterogeneity in somatic mutation and selection in the human bladder. Science 370, 75&#x2013;82 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR10\" id=\"ref-link-section-d255693234e2002\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a>. We added five genes that are mutated frequently in bladder tumours<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 30\" title=\"Tate, J. G. et al. COSMIC: the catalogue of somatic mutations in cancer. Nucleic Acids Res. 47, D941&#x2013;D947 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR30\" id=\"ref-link-section-d255693234e2006\" rel=\"nofollow noopener\" target=\"_blank\">30<\/a> and the TERT promoter\u2014also known to be under positive selection in bladder carcinomas<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Rheinbay, E. et al. Analyses of non-coding somatic drivers in 2,658 cancer whole genomes. Nature 578, 102&#x2013;111 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR22\" id=\"ref-link-section-d255693234e2013\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 23\" title=\"Sabarinathan, R. et al. The whole-genome panorama of cancer drivers. Preprint at bioRxiv &#010;                https:\/\/doi.org\/10.1101\/190330&#010;                &#010;               (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR23\" id=\"ref-link-section-d255693234e2016\" rel=\"nofollow noopener\" target=\"_blank\">23<\/a>. This resulted in a panel containing the entire (or almost entire) coding region of 12 genes (ARID1A, NOTCH2, FOXQ1, CDKN1A, KMT2D, RB1, CREBBP, TP53, EP300, KDM6A, RBM10 and STAG2), three genes for which only selected regions were targeted because of clustering of cancer mutations in those regions and\/or difficulties for capturing the full gene (PIK3CA, FGFR3 and KMT2C) and the TERT promoter<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\" title=\"Lawson, A. R. J. et al. Extensive heterogeneity in somatic mutation and selection in the human bladder. Science 370, 75&#x2013;82 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR10\" id=\"ref-link-section-d255693234e2071\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 61\" title=\"Ju, Y. S. The mutational signatures and molecular alterations of bladder cancer. Transl. Cancer Res. 6, S689&#x2013;S701 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR61\" id=\"ref-link-section-d255693234e2074\" rel=\"nofollow noopener\" target=\"_blank\">61<\/a>. The panel was constructed by TwinStrand Biosciences and covered 111,876\u2009base pairs, including 65,086\u2009base pairs in coding regions and 46,790\u2009base pairs in non-coding regions (Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a> and Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>).<\/p>\n<p>DNA extraction, duplex library preparation and sequencing<\/p>\n<p>After centrifugation of epithelial brushes, DNA was extracted using the DNeasy Blood and Tissue (Qiagen) kit, following manufacturer\u2019s instructions with some variations (Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>). The DNA integrity number (DIN) was measured using Agilent 4200 TapeStation Genomic tapes. Duplex sequencing libraries were prepared using commercially available kits (TwinStrand Biosciences)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 13\" title=\"Kennedy, S. R. et al. Detecting ultralow-frequency mutations by duplex sequencing. Nat. Protoc. 9, 2586&#x2013;2606 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR13\" id=\"ref-link-section-d255693234e2099\" rel=\"nofollow noopener\" target=\"_blank\">13<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 14\" title=\"Schmitt, M. W. et al. Detection of ultra-rare mutations by next-generation sequencing. Proc. Natl Acad. Sci. USA 109, 14508&#x2013;14513 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR14\" id=\"ref-link-section-d255693234e2102\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Dillon, L. W. et al. Quantification of measurable residual disease using duplex sequencing in adults with acute myeloid leukemia. Haematologica 109, 401&#x2013;410 (2024).\" href=\"#ref-CR62\" id=\"ref-link-section-d255693234e2105\">62<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Woolston, D. W. et al. Ultra-deep mutational landscape in chronic lymphocytic leukemia uncovers dynamics of resistance to targeted therapies. Haematologica 109, 835&#x2013;845 (2024).\" href=\"#ref-CR63\" id=\"ref-link-section-d255693234e2105_1\">63<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 64\" title=\"Oshima, M. U. et al. Characterization of clonal dynamics using duplex sequencing in donor-recipient pairs decades after hematopoietic cell transplantation. Sci. Transl. Med. 16, eado5108 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR64\" id=\"ref-link-section-d255693234e2108\" rel=\"nofollow noopener\" target=\"_blank\">64<\/a> and 250\u2009ng of genomic DNA. The DNA fragmentation step was carried out taking into account the starting DIN of samples and monitored using TapeStation. Fragmented DNA was subject to end-repair, A-tailing, ligation to duplex sequencing adaptors, library conditioning and PCR amplification, according to protocol. The PCR product was captured at 65\u2009\u00b0C for 16\u201320\u2009h. Upon PCR amplification, libraries were pooled for sequencing. Sequencing was performed with a NovaSeq 6000 at the Department of Laboratory Medicine and Pathology at University of Washington or a NovaSeq X Plus at Novogene or the Fred Hutchinson Cancer Center using 2\u2009\u00d7\u2009150\u2009base-pair paired-end reads (around 115\u2009million reads per sample; Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>).<\/p>\n<p>Duplex DNA sequencing was successful for 34 donors on both samples. For 11 other donors, we produced duplex DNA sequencing data only for dome or trigone because of insufficient DNA or too fragmented DNA (DIN\u2009&lt;\u20091.4) in the paired sample. Thus, the final number of donors with available data for at least one sample was 45 (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>) and the final number of samples processed was 79 (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>).<\/p>\n<p>Somatic mutation calling<\/p>\n<p>We constructed a computational pipeline (deepUMIcaller) in Nextflow<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 65\" title=\"Di Tommaso, P. et al. Nextflow enables reproducible computational workflows. Nat. Biotechnol. 35, 316&#x2013;319 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR65\" id=\"ref-link-section-d255693234e2133\" rel=\"nofollow noopener\" target=\"_blank\">65<\/a> to call mutations from duplex sequencing data on the basis of an early version of nf-core\/fastquorum pipeline<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 66\" title=\"Ewels, P. A. et al. The nf-core framework for community-curated bioinformatics pipelines. Nat. Biotechnol. 38, 276&#x2013;278 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR66\" id=\"ref-link-section-d255693234e2137\" rel=\"nofollow noopener\" target=\"_blank\">66<\/a>, which implements the fgbio Best Practices FASTQ to Consensus Pipeline (<a href=\"https:\/\/github.com\/fulcrumgenomics\/fgbio\/blob\/main\/docs\/best-practice-consensus-pipeline.md\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/fulcrumgenomics\/fgbio\/blob\/main\/docs\/best-practice-consensus-pipeline.md<\/a>) and downstream variant calling by VarDictJava (<a href=\"https:\/\/github.com\/AstraZeneca-NGS\/VarDictJava\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/AstraZeneca-NGS\/VarDictJava<\/a>). A series of filters to discard potential artefacts are included in the pipeline. Code implementing deepUMIcaller is available at (<a href=\"http:\/\/github.com\/bbglab\/deepUMIcaller\" rel=\"nofollow noopener\" target=\"_blank\">github.com\/bbglab\/deepumicaller<\/a>). For a detailed description of the pipeline, see Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>.<\/p>\n<p>To estimate the error rate of TwinStrand DNA duplex sequencing, we compared the density of mutations and mutational profile identified across three cord blood samples (purchased from StemCell) with that expected on the basis of colonies obtained from human haematopoietic stem cells<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 67\" title=\"Osorio, F. G. et al. Somatic mutations reveal lineage relationships and age-related mutagenesis in human hematopoiesis. Cell Rep. 25, 2308&#x2013;2316 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR67\" id=\"ref-link-section-d255693234e2169\" rel=\"nofollow noopener\" target=\"_blank\">67<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 68\" title=\"Machado, H. E. et al. Diverse mutational landscapes in human lymphocytes. Nature 608, 724&#x2013;732 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR68\" id=\"ref-link-section-d255693234e2172\" rel=\"nofollow noopener\" target=\"_blank\">68<\/a>. The estimated error rate resulting from this analysis was around 4\u2009\u00d7\u200910\u22128, which is two orders of magnitude lower than the mutation density across samples in this study (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">2a<\/a> and Supplementary Notes\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a>).<\/p>\n<p>Mutational signaturesDe novo signature extraction<\/p>\n<p>We extracted mutational signatures de novo with a Bayesian hierarchical Dirichlet process using HDP_sigExtraction pipeline (<a href=\"https:\/\/github.com\/McGranahanLab\/HDP_sigExtraction\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/McGranahanLab\/HDP_sigExtraction<\/a>) and the R-package hdp developed by N. Roberts (<a href=\"https:\/\/github.com\/nicolaroberts\/hdp\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/nicolaroberts\/hdp<\/a>)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\" title=\"Lee-Six, H. et al. The landscape of somatic mutation in normal colorectal epithelial cells. Nature 574, 532&#x2013;537 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR9\" id=\"ref-link-section-d255693234e2213\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a> and SigProfilerExtractor (<a href=\"https:\/\/github.com\/AlexandrovLab\/SigProfilerExtractor\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/AlexandrovLab\/SigProfilerExtractor<\/a>)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 26\" title=\"Islam, S. M. A. et al. Uncovering novel mutational signatures by de novo extraction with SigProfilerExtractor. Cell Genomics 2, 100179 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR26\" id=\"ref-link-section-d255693234e2224\" rel=\"nofollow noopener\" target=\"_blank\">26<\/a>. The HDP_sigExtraction pipeline was run with the default parameters with no previous signatures assigned. Five signatures in addition to the null signature were extracted. De novo signatures were extracted using SigProfiler using the nonnegative matrix factorization approach. The same input data were used. The upper bound for the number of signatures was set to ten; however, the most robust solution was three signatures that were similar to the three most active signatures extracted by HDP. These two sets of mutational signatures were decomposed into known COSMIC signatures when possible (Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>).<\/p>\n<p>Assessing biases in trinucleotide composition of the panel<\/p>\n<p>To account for biases due to the trinucleotide composition of the panel, the number of substitutions of each class was re-calculated. The rate of each substitution was calculated as the number of observed substitutions with the consequence in question divided by the number of corresponding sequenced trinucleotide sites (number of trinucleotides with this consequence in the panel weighted by sequencing depth). This mutational probability was multiplied by the number of the corresponding trinucleotide in the genome to get the expected number of substitutions of this type in the whole genome. The mutational probability of each substitution for the whole genome was calculated by dividing the expected number of substitutions with this consequence per genome by the sum of all expected substitutions per genome. Finally, the expected number of substitutions with each consequence was obtained by multiplying the probability by the total number of substitutions observed in the sample to keep the absolute number of observed mutations.<\/p>\n<p>Statistical association between mutational signatures and clinical variables<\/p>\n<p>We tested for possible associations of de novo extracted signatures with age, sex and bladder location. To do this we used linear mixed-effect regression models (Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>). We tested both the number of mutations attributed to the signature (counts) and the relative contribution (proportion) of each signature as a response variable. Age, sex and bladder location were used as fixed effects (separated regression was prepared for each fixed effect variable) and donors were used as random effects to control for non-independence between dome and trigone samples from the same person. P\u2009values for the association were obtained using likelihood-ratio tests (ANOVA function) comparing models including and excluding the variable of interest. Associations with all other clinical variables (smoking history, drinking history, chemotherapy history, and so on) were tested in the same way but always including age, sex and bladder location in the regression together with the variable of interest.<\/p>\n<p>Positive selection<\/p>\n<p>We used four methods to compute positive selection on the mutations observed across genes. One method (Omega)\u2014a dN\/dS approach to assess the strength of selection on the mutational pattern of genes\u2014was developed de novo for this study and is described at length in Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a> (<a href=\"https:\/\/github.com\/bbglab\/omega\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/bbglab\/omega<\/a>). Two others, OncodriveFML and Oncodrive3D, which compute the deviation in the average functional impact and clustering in the three-dimensional structure of proteins, respectively, from those expected under neutrality, had been developed previously, and were adapted here to work on duplex sequencing data. These are also described in Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>. A fourth method, assessing the relative enrichment for frameshift indels observed across genes was also developed de novo for this study and is described thoroughly in Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>. For the TERT promoter, all mutations that were observed at least twice across 8,136 whole-genome sequencing tumour samples sequenced by the Hartwig Medical Foundation and the Pan-Cancer Analysis of Whole Genomes consortium (Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>) were considered activating. The remaining mutations were used to calculate the expected number of activating mutations under neutrality.<\/p>\n<p>Fraction of the urothelium covered by clones with driver mutations<\/p>\n<p>The number of missense and truncating driver mutations of each gene in a sample was obtained from the dN\/dS missense and dN\/dS truncating values. From these values the number of missense and truncating mutations in excess in each sample were calculated. Specifically, the 95% confidence intervals of both dN\/dS values were used to compute two extreme values of driver mutations in each sample, whereas the mean dN\/dS value was used to compute an expected number of driver mutations.<\/p>\n<p>This mean number was thus used to select the most likely driver mutations in the sample. To this end, mutations were ranked in descending order according to their VAF, and the top number of mutations corresponding to the expected number of driver mutations were selected. Then, for a gene G, we computed the fraction of genomes bearing driver mutations out of those sequenced at each genomic position covered by the DNA duplex sequencing as:<\/p>\n<p>$$P(G)=\\mathop{\\sum }\\limits_{i=1}^{n}{(-1)}^{i-1}\\sum _{| S| =i}\\prod _{x\\in S}{p}_{x}=\\sum _{x}{p}_{x}-\\sum _{x\\ne y}{p}_{x}{p}_{y}+\\cdots +{(-1)}^{n-1}{p}_{{x}_{1}}{p}_{{x}_{2}}\\cdots {p}_{{x}_{n}}$$<\/p>\n<p>where px is the all-molecules VAF of a driver mutation x (considering both duplex and non-duplex reads; Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>) and n is the number of driver mutations from the previously selected set in the gene. The formula is the realization of the probabilistic principle of inclusion-exclusion assuming independence across mutations<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 69\" title=\"Grinstead, C. M &amp; Snell, J. L. Introduction to Probability: Second Revised Edition (American Mathematical Society, 1997).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR69\" id=\"ref-link-section-d255693234e2644\" rel=\"nofollow noopener\" target=\"_blank\">69<\/a>.<\/p>\n<p>The fraction of genomes with driver mutations is equal to the fraction of cells with driver mutations under the assumption that two mutations, one in each homologous copy of the gene, are required to drive the clonal expansion. This extreme would be true in the case that all studied genes behave as classical tumour suppressors and bear deleterious mutations in both alleles. An exception to this rule are the mutations in genes in the X chromosome in men, whereby one mutation would suffice to drive the clonal expansion. In general, it is possible for some genes to be capable of driving the clonal expansion (or mutations at specific positions in a gene) with only one mutated allele. It is also possible that the other allele is affected by a large deletion or methylation event not seen by DNA duplex sequencing. In the extreme that only one mutation per cell per gene is required, the fraction of cells with driver mutations will be double the fraction of genomes. In the plot in Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#Fig10\" rel=\"nofollow noopener\" target=\"_blank\">5b<\/a>, we use the two-hit assumption.<\/p>\n<p>Association of the urothelium clonal structure with bladder cancer risk factors<\/p>\n<p>We designed a two-step strategy to assess the influence of known bladder cancer risk factors on features of the urothelium clonal structure across samples from donors in the cohort. As dependent variables representing the urothelium clonal structure, we selected the (protein-affecting and non-protein-affecting) mutation density, and the magnitude of positive selection on genic mutations (dN\/dS missense and dN\/dS truncating). As the dependent variables are continuous, we chose linear regressions, specifically mixed-effects linear models, to account for two samples from the same donor and use all available samples. As independent variables, we included a list of available clinical features: age (in decades), sex, smoking history (binarized as ever or never smoker), alcohol consumption history (also binarized), BMI (re-scaled within the 0\u20131 interval), and exposure to chemo\/radiotherapy (binarized). For the case of TERT promoter mutation density association, we used the interaction of age and smoking history instead of age and smoking history separately. In the first step, we applied univariate mixed-effects linear models to identify any clinical feature with a significant association with any of the dependent variables for any gene or for all genes. In the second step, we applied multivariate mixed-effects linear models to rule out confounding effects for the associations that seemed significant in the first step. Associations with FDR below 0.2 were deemed significant. We also carried out a binomial test to rule out a spurious dependence of the mutation density on group differences in terms of sequencing depth. For details, see Supplementary Notes\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a>.<\/p>\n<p>Tumour data<\/p>\n<p>Mutations identified across 622 muscle-invasive and 105 non-muscle-invasive bladder cancer (MIBC and NMBIC, respectively) cohorts were downloaded from cBioPortal<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"de Bruijn, I. et al. Analysis and visualization of longitudinal genomic and clinical data from the AACR project GENIE Biopharma Collaborative in cBioPortal. Cancer Res. 83, 3861&#x2013;3867 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR49\" id=\"ref-link-section-d255693234e2679\" rel=\"nofollow noopener\" target=\"_blank\">49<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Cerami, E. et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2, 401&#x2013;404 (2012).\" href=\"#ref-CR70\" id=\"ref-link-section-d255693234e2682\">70<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Gao, J. et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci. Signal. 6, pl1 (2013).\" href=\"#ref-CR71\" id=\"ref-link-section-d255693234e2682_1\">71<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Van Allen, E. M. et al. Somatic ERCC2 mutations correlate with cisplatin sensitivity in muscle-invasive urothelial carcinoma. Cancer Discov. 4, 1140&#x2013;1153 (2014).\" href=\"#ref-CR72\" id=\"ref-link-section-d255693234e2682_2\">72<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 73\" title=\"Pietzak, E. J. et al. Genomic differences between &#x2018;primary&#x2019; and &#x2018;secondary&#x2019; muscle-invasive bladder cancer as a basis for disparate outcomes to cisplatin-based neoadjuvant chemotherapy. Eur. Urol. 75, 231&#x2013;239 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR73\" id=\"ref-link-section-d255693234e2685\" rel=\"nofollow noopener\" target=\"_blank\">73<\/a> together with the clinical data of the combined study. From the MIBC Beijing Genomics Institute cohort<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 74\" title=\"Guo, G. et al. Whole-genome and whole-exome sequencing of bladder cancer identifies frequent alterations in genes involved in sister chromatid cohesion and segregation. Nat. Genet. 45, 1459&#x2013;1463 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR74\" id=\"ref-link-section-d255693234e2689\" rel=\"nofollow noopener\" target=\"_blank\">74<\/a> only samples labelled as invasive were considered MIBC and added to the MIBC dataset. Mutations identified in a further cohort of 79 NMIBCs were obtained from the literature<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 40\" title=\"Hurst, C. D. et al. Genomic subtypes of non-invasive bladder cancer with distinct metabolic profile and female gender bias in KDM6A mutation frequency. Cancer Cell 32, 701&#x2013;715 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR40\" id=\"ref-link-section-d255693234e2693\" rel=\"nofollow noopener\" target=\"_blank\">40<\/a> and included as part of the NMIBC dataset. Only non-silent protein-affecting mutations in 14 of the genes included in the panel were analysed. In case of duplicated samples across MIBC and NMBIC cohorts, the mutations from the most recent published study were kept. In total, mutations across 806 bladder tumours were obtained (622 MIBC and 184 NMIBC). The mutation density per gene across the two datasets was calculated by dividing the number of observed mutations per gene by the length of its coding region. To compute a mutation density metric comparable with that used in the normal urothelium that accounts for the number of megabases sequenced, we multiplied the coding length by two times the number of samples in which that gene was sequenced, to take into account the two copies in a diploid genome. Under the assumption that, in tumours each mutation belongs to a single clonal expansion and in normal tissues we have a mixture of clones, we reasoned that the number of different tumour genomes sequenced would be equivalent to the number of genomes sequenced in normal urothelium. The mutation density per megabase was calculated multiplying the mutation density by 106.<\/p>\n<p>Mutations identified across 33,218 tumours (892 bladder tumours, mostly MIBC) in intOGen<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Mart&#xED;nez-Jim&#xE9;nez, F. et al. A compendium of mutational cancer driver genes. Nat. Rev. Cancer 20, 555&#x2013;572 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR24\" id=\"ref-link-section-d255693234e2702\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a> were downloaded from <a href=\"https:\/\/www.intogen.org\/\" rel=\"nofollow noopener\" target=\"_blank\">intogen.org<\/a>. These mutations were used to obtain the total number of mutations observed in each gene, their distribution along the sequence of the genes in the study and the percentage of sites affected by different numbers of mutations. The same data of 109,017 tumours (3,909 bladder tumours) were obtained from the GENIE project<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 50\" title=\"AACR Project GENIE Consortium AACR project GENIE: powering precision medicine through an International Consortium. Cancer Discov. 7, 818&#x2013;831 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR50\" id=\"ref-link-section-d255693234e2713\" rel=\"nofollow noopener\" target=\"_blank\">50<\/a> to calculate the frequency of mutations in each of the genes and the TERT promoter. Mutations in the TERT promoter were also obtained from two cohorts of tumours (included in intOGen) sequenced at the whole-genome level (Hartwig Medical Foundation<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 21\" title=\"Mart&#xED;nez-Jim&#xE9;nez, F. et al. Pan-cancer whole-genome comparison of primary and metastatic solid tumours. Nature 618, 333&#x2013;341 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR21\" id=\"ref-link-section-d255693234e2724\" rel=\"nofollow noopener\" target=\"_blank\">21<\/a>: N\u2009=\u20095,582 and Pan-Cancer Analysis of Whole Genomes<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 19\" title=\"Campbell, P. J. et al. Pan-cancer analysis of whole genomes. Nature 578, 82&#x2013;93 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR19\" id=\"ref-link-section-d255693234e2731\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a>: N\u2009=\u20092,554). The classification (and score) of all possible mutations in TP53 into drivers and passengers through in silico saturation mutagenesis was obtained from boostDM (<a href=\"http:\/\/intogen.org\/boostDM\" rel=\"nofollow noopener\" target=\"_blank\">intogen.org\/boostDM<\/a>). Details of analyses involving tumour mutations appear in Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">11<\/a>.<\/p>\n<p>Natural saturation mutagenesis<\/p>\n<p>To compare the distribution of somatic mutations along the sequence of each of the genes in normal urothelium and bladder tumours, we first downloaded somatic mutations identified across 892 bladder tumours from the intOGen (<a href=\"https:\/\/www.intogen.org\/\" rel=\"nofollow noopener\" target=\"_blank\">intogen.org<\/a>) platform. Although this cohort is larger than the 806 muscle-invasive and non-muscle-invasive tumours used to explore differences in mutation density, it is composed mostly of muscle-invasive carcinomas.<\/p>\n<p>For each gene, we started the analysis with all genomic sites in the coding sequences and splicing sites that passed the pipeline\u2019s filtering criteria of sufficient duplex coverage across samples (Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>). We then mapped these DNA sites to protein positions using the Ensembl REST API<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 75\" title=\"Harrison, P. W. et al. Ensembl 2024. Nucleic Acids Res. 52, D891&#x2013;D899 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR75\" id=\"ref-link-section-d255693234e2773\" rel=\"nofollow noopener\" target=\"_blank\">75<\/a>, allowing us to determine the total number of protein positions (including amino acid residues and stop codon) covered by the panel. Then, we computed the number of mutations affecting each protein residue of every gene studied here across the 79 normal urothelium samples and the 892 bladder tumours. Next, we obtained the consequence type (missense, synonymous, nonsense or splice-affecting) of these mutations on the MANE transcript<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 76\" title=\"Morales, J. et al. A joint NCBI and EMBL&#x2013;EBI transcript set for clinical genomics and research. Nature 604, 310&#x2013;315 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR76\" id=\"ref-link-section-d255693234e2777\" rel=\"nofollow noopener\" target=\"_blank\">76<\/a> of each gene from the output of the Variant Effect Predictor v.111 (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 77\" title=\"McLaren, W. et al. The Ensembl variant effect predictor. Genome Biol. 17, 122 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR77\" id=\"ref-link-section-d255693234e2781\" rel=\"nofollow noopener\" target=\"_blank\">77<\/a>), run within the intOGen pipeline<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Mart&#xED;nez-Jim&#xE9;nez, F. et al. A compendium of mutational cancer driver genes. Nat. Rev. Cancer 20, 555&#x2013;572 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR24\" id=\"ref-link-section-d255693234e2785\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a> (<a href=\"https:\/\/github.com\/bbglab\/intogen-plus\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/bbglab\/intogen-plus<\/a>). The method to compute the natural saturation mutagenesis kinetics in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5b<\/a> is described in Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a>. The residue-level sequencing depth shown in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5c<\/a>, Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#Fig14\" rel=\"nofollow noopener\" target=\"_blank\">9d<\/a> and Supplementary Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a> was calculated as the average depth across all sites corresponding to each codon.<\/p>\n<p>Calculation of site selection<\/p>\n<p>We developed a metric to measure selection per site by comparing the observed with the expected number of mutations at each site or residue. The expected number of mutations per site was obtained by distributing the expected number of mutations in a given gene under neutrality along all the possible changes in that same genomic region, with the assumption that only the mutation probability of each trinucleotide and the sequencing depth are responsible for the within-gene differences of mutation probability. We computed this value for each possible mutation in the positively selected genes including the TERT promoter, and for the protein-coding genes we also obtained a value per residue. A more complete explanation on this metric (and the detection of positive selection in sub-genic structures such as exons and domains) can be found in Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>.<\/p>\n<p>Structural representation and features<\/p>\n<p>Structural models for all proteins used to run Oncodrive3D were obtained from the AlphaFold database (AlphaFold\u20092\u2009v.4)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 78\" title=\"Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583&#x2013;589 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR78\" id=\"ref-link-section-d255693234e2839\" rel=\"nofollow noopener\" target=\"_blank\">78<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 79\" title=\"Varadi, M. et al. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 50, D439&#x2013;D444 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR79\" id=\"ref-link-section-d255693234e2842\" rel=\"nofollow noopener\" target=\"_blank\">79<\/a>. Three-dimensional protein structures and protein structural features represented across figures were also obtained from the AlphaFold database. Solvent accessibility and secondary structure information were extracted from the AlphaFold-predicted PDB structures using PDB_Tool (<a href=\"https:\/\/github.com\/realbigws\/PDB_Tool\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/realbigws\/PDB_Tool<\/a>). Structural visualizations of proteins were produced using UCSF ChimeraX<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 80\" title=\"Pettersen, E. F. et al. UCSF ChimeraX: structure visualization for researchers, educators, and developers. Protein Sci. 30, 70&#x2013;82 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR80\" id=\"ref-link-section-d255693234e2853\" rel=\"nofollow noopener\" target=\"_blank\">80<\/a>.<\/p>\n<p>Comparison with experimental saturation mutagenesis<\/p>\n<p>The results of two saturation mutagenesis experiments estimating the functional impact of mutations in the TP53 DNA-binding domain<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 51\" title=\"Funk, J. S. et al. Deep CRISPR mutagenesis characterizes the functional diversity of TP53 mutations. Nat. Genet. 57, 140&#x2013;153 (2025).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR51\" id=\"ref-link-section-d255693234e2868\" rel=\"nofollow noopener\" target=\"_blank\">51<\/a> and along the sequence of the TERT promoter<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 55\" title=\"Kircher, M. et al. Saturation mutagenesis of twenty disease-associated regulatory elements at single base-pair resolution. Nat. Commun. 10, 3583 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR55\" id=\"ref-link-section-d255693234e2875\" rel=\"nofollow noopener\" target=\"_blank\">55<\/a> were obtained directly from supplementary tables from both papers. In the case of TP53, we used the transformed score. In the case of the TERT promoter, values calculated for the glioblastoma SF7996 cells system were used. In these comparisons, the site selection of individual mutations was used.<\/p>\n<p>Comparison of orthogonal studies of normal bladder<\/p>\n<p>We obtained the mutations identified through whole-genome sequencing in clonal or quasi-clonal samples obtained from the normal bladder of donors by laser capture microdissection in a previous study<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\" title=\"Lawson, A. R. J. et al. Extensive heterogeneity in somatic mutation and selection in the human bladder. Science 370, 75&#x2013;82 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#ref-CR10\" id=\"ref-link-section-d255693234e2894\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a>. We used these samples to construct the mutational profile of normal urothelium, which we compared with that reconstructed in the present study through ultradeep sequencing (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">2b<\/a>). From the same previous study, we obtained the mutations identified through whole-exome sequencing. Mutations obtained from samples sequenced at depth lower than 80\u00d7 were discarded. From these data, we calculated the number of mutations per megabase observed in each sample, and averaged this value across donors. A linear regression was then constructed between these values and the age of donors, and a trend line calculated. The number of mutations per megabase calculated across the samples in our cohort was overlaid upon the obtained regression. The results of this comparison are presented in Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">2c<\/a>.<\/p>\n<p>Reporting summary<\/p>\n<p>Further information on research design is available in the\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09521-x#MOESM2\" rel=\"nofollow noopener\" target=\"_blank\">Nature Portfolio Reporting Summary<\/a> linked to this article.<\/p>\n","protected":false},"excerpt":{"rendered":"Sample collection Two epithelial brushes (2\u20133\u2009cm2) from the bladder top (dome) and the bladder floor (trigone) were obtained&hellip;\n","protected":false},"author":2,"featured_media":210960,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[34],"tags":[55992,58797,27379,97,1159,1160,79,120459],"class_list":{"0":"post-210959","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-health","8":"tag-cancer-epidemiology","9":"tag-cancer-genomics","10":"tag-genome-informatics","11":"tag-health","12":"tag-humanities-and-social-sciences","13":"tag-multidisciplinary","14":"tag-science","15":"tag-urological-cancer"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/210959","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/comments?post=210959"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/210959\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media\/210960"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media?parent=210959"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/categories?post=210959"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/tags?post=210959"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}