{"id":4301,"date":"2025-07-18T02:04:29","date_gmt":"2025-07-18T02:04:29","guid":{"rendered":"https:\/\/www.newsbeep.com\/ca\/4301\/"},"modified":"2025-07-18T02:04:29","modified_gmt":"2025-07-18T02:04:29","slug":"concurrent-loss-of-the-y-chromosome-in-cancer-and-t-cells-impacts-outcome","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/ca\/4301\/","title":{"rendered":"Concurrent loss of the Y chromosome in cancer and T cells impacts outcome"},"content":{"rendered":"<p>TCGA data acquisition and processing<\/p>\n<p>In this study, we used bulk RNA-seq, WES somatic mutation data, and clinical data (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>) sourced from TCGA project (<a href=\"https:\/\/portal.gdc.cancer.gov\/\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/portal.gdc.cancer.gov\/<\/a>). The data matrices obtained from UCSC Xena (<a href=\"https:\/\/xena.ucsc.edu\/\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/xena.ucsc.edu\/<\/a>) have been standardized, normalized and corrected for batch effects and platform differences. Additionally, mutation data generated by the PanCancer Atlas consortium (<a href=\"https:\/\/gdc.cancer.gov\/about-data\/publications\/pancanatlas\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/gdc.cancer.gov\/about-data\/publications\/pancanatlas<\/a>) were incorporated. A total of 29 tumour types and 4,127 male participants were included in our analysis. Survival outcome metrics, including OS, OS time, DSS and DSS time were calculated as in Liu et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 46\" title=\"Liu, J. et al. An integrated TCGA pan-cancer clinical data resource to drive high-quality survival outcome analytics. Cell 173, 400&#x2013;416.e411 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR46\" id=\"ref-link-section-d51900554e2641\" rel=\"nofollow noopener\" target=\"_blank\">46<\/a> (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>).<\/p>\n<p>Classification of LOY based on transcriptome data<\/p>\n<p>We used the DESeq2 R package (v.1.42.1) to uncover Y chromosome gene expression differences between LOYDNA and WTYDNA<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 47\" title=\"Diboun, I., Wernisch, L., Orengo, C. A. &amp; Koltzenburg, M. Microarray analysis after RNA amplification can detect pronounced differences in gene expression using limma. BMC Genomics 7, 252 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR47\" id=\"ref-link-section-d51900554e2660\" rel=\"nofollow noopener\" target=\"_blank\">47<\/a>. Differentially expressed genes were called on the basis of a log2FC cut-off of \u22121 and\u00a0a \u2212log10-adjusted P\u2009value cut-off of 200. This gene set resulted in the LOY prediction signature. Subsequently, we conducted a gene set intersection analysis with a gene set in the male-specific region of the Y chromosome seen expressed across 24 human tissues. This analysis required that genes exceeded 0.1 reads per kilobase of transcript per million reads mapped per tissue or that they had presence in IHC data from the Human Protein Atlas RNA-seq<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\" title=\"Prokop, J. W. &amp; Deschepper, C. F. Chromosome Y genetic variants: impact in animal models and on human disease. Physiol. Genomics 47, 525&#x2013;537 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR11\" id=\"ref-link-section-d51900554e2672\" rel=\"nofollow noopener\" target=\"_blank\">11<\/a>. This approach identified nine signature genes making up our Y chromosome signature (YchrS): DDX3Y, UTY, KDM5D, USP9Y, ZFY, RPS4Y1, TMSB4Y, EIF1AY and NLGN4Y. Based on single-sample GSEA (ssGSEA)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 48\" title=\"Barbie, D. A. et al. Systematic RNA interference reveals that oncogenic-driven cancers require TBK1. Nature 462, 108&#x2013;112 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR48\" id=\"ref-link-section-d51900554e2704\" rel=\"nofollow noopener\" target=\"_blank\">48<\/a> conducted with the GSVA<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"H&#xE4;nzelmann, S., Castelo, R. &amp; Guinney, J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinf. 14, 7 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR49\" id=\"ref-link-section-d51900554e2708\" rel=\"nofollow noopener\" target=\"_blank\">49<\/a> R package (v.1.44.5), we observed that patients with low levels of YchrS exhibited characteristics similar to those of people with LOYDNA, whereas those with high YchrS levels resembled people with an intact Y chromosome (WTYDNA). We partitioned all patients into low YchrS group (LOYBR) and high YchrS group (WTYBR) with a mean value cut-off. Additionally, a similar approach was applied to analyse the Ywhole signature (YwholeS), using a signature comprising all Y chromosome genes.<\/p>\n<p>YchrS validation by using CCLE data<\/p>\n<p>To validate the nine-gene Ychr Signature (YchrS), we downloaded the batch-corrected transcriptomic data and corresponding CNA data for 778 male cancer cell lines from the\u00a0CCLE project (<a href=\"https:\/\/depmap.org\/portal\/ccle\/\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/depmap.org\/portal\/ccle\/<\/a>). YchrS was calculated by the same method as for\u00a0TCGA data. Cell lines with a YchrS lower than the mean value were categorized as LOYBR, otherwise they were called WTYBR. Average CNA for chromosome i in each cell line was calculated using the following formula to evaluate its integrity:<\/p>\n<p>$${\\rm{Average}}\\;{{\\rm{CNA}}}_{i}=\\,\\frac{{\\sum }_{j=1}^{n}({\\rm{estimated}}\\;{\\rm{segment}}\\;{{\\rm{CNA}}}_{j}\\times {\\rm{segment}}\\;{{\\rm{length}}}_{j})}{{\\sum }_{j=1}^{n}{\\rm{segment}}\\;{{\\rm{length}}}_{j}}$$<\/p>\n<p>n is the total number of segments in chromosome i.<\/p>\n<p>Genetic ancestry<\/p>\n<p>Consensus ancestry for TCGA cases was obtained from ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 14\" title=\"Carrot-Zhang, J. et al. Comprehensive analysis of genetic ancestry and its molecular correlates in cancer. Cancer Cell 37, 639&#x2013;654 e636 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR14\" id=\"ref-link-section-d51900554e2927\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>, determined by combining ancestry inference from five independent classification methods using SNP array and\/or WES data. Only ancestries with more than 50 patients were included in the survival analysis, which spanned 3,893 patients: European (n\u2009=\u20093,319), East Asian (n\u2009=\u2009286), African (n\u2009=\u2009190)\u00a0and African-admixed (n\u2009=\u200998).<\/p>\n<p>Genomic instability and stemness features<\/p>\n<p>Aneuploidy scores for TCGA cases were obtained from ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 50\" title=\"Taylor, A. M. et al. Genomic and functional approaches to understanding cancer aneuploidy. Cancer Cell 33, 676&#x2013;689.e3 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR50\" id=\"ref-link-section-d51900554e2952\" rel=\"nofollow noopener\" target=\"_blank\">50<\/a>. Arm-level statistics were calculated using the GISTIC (v.2.0)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 51\" title=\"Mermel, C. H. et al. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol. 12, R41 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR51\" id=\"ref-link-section-d51900554e2956\" rel=\"nofollow noopener\" target=\"_blank\">51<\/a> copy number significance software. These scores were derived by tallying the total number of amplified or deleted arms, collectively termed \u2018altered\u2019. Samples were initially categorized by tumour type, alteration type (amplification or deletion) and chromosome arm. Subsequently, samples were clustered on the basis of specific arm attributes, and arms were classified as altered if part of a cluster had a mean fraction altered of at least 80%. Intratumour heterogeneity used to generate DNA damage scores was determined by ABSOLUTE<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 52\" title=\"Carter, S. L. et al. Absolute quantification of somatic DNA alterations in human cancer. Nat. Biotechnol. 30, 413&#x2013;421 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR52\" id=\"ref-link-section-d51900554e2960\" rel=\"nofollow noopener\" target=\"_blank\">52<\/a>. ABSOLUTE analysed segmentation data from Affymetrix SNP6.0 arrays and variant calls from the MC3 variant file.<\/p>\n<p>We used two measures to assess HRD. The first was derived by Knijnenburg et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 53\" title=\"Knijnenburg, T. A. et al. Genomic and molecular landscape of DNA damage repair deficiency across The Cancer Genome Atlas. Cell Rep. 23, 239&#x2013;254.e6 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR53\" id=\"ref-link-section-d51900554e2967\" rel=\"nofollow noopener\" target=\"_blank\">53<\/a> and quantifies HRD by aggregating separate metrics of genomic scarring: large (more than 15\u2009Mb) non-arm-level regions with LOH<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 20\" title=\"Abkevich, V. et al. Patterns of genomic loss of heterozygosity predict homologous recombination repair defects in epithelial ovarian cancer. Br. J. Cancer 107, 1776&#x2013;1782 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR20\" id=\"ref-link-section-d51900554e2971\" rel=\"nofollow noopener\" target=\"_blank\">20<\/a>, large-scale state transitions (breaks between adjacent segments of greater than 10\u2009Mb, LST<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 21\" title=\"Popova, T. et al. Ploidy and large-scale genomic instability consistently identify basal-like breast carcinomas with BRCA1\/2 inactivation. Cancer Res. 72, 5454&#x2013;5462 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR21\" id=\"ref-link-section-d51900554e2975\" rel=\"nofollow noopener\" target=\"_blank\">21<\/a> and subtelomeric regions with allelic imbalance. The second measure, introduced by Telli et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 19\" title=\"Telli, M. L. et al. Homologous recombination deficiency (HRD) score predicts response to platinum-containing neoadjuvant chemotherapy in patients with triple-negative breast cancer. Clin. Cancer Res. 22, 3764&#x2013;3773 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR19\" id=\"ref-link-section-d51900554e2979\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a>, incorporates LOH and TAI<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Birkbak, N. J. et al. Telomeric allelic imbalance indicates defective DNA repair and sensitivity to DNA-damaging agents. Cancer Discov. 2, 366&#x2013;375 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR22\" id=\"ref-link-section-d51900554e2983\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a>. The HRD score for samples analysed via custom hybridization sequencing assay were computed using reads covering SNP positions<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 54\" title=\"Timms, K. M. et al. Association of defects with genomic scores predictive of DNA damage repair deficiency among breast cancer subtypes. Breast Cancer Res. 16, 475 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR54\" id=\"ref-link-section-d51900554e2988\" rel=\"nofollow noopener\" target=\"_blank\">54<\/a>. The HRD score was determined as the unweighted sum of LOH, TAI and LST, represented as: HRD score\u2009=\u2009LOH\u2009+\u2009TAI\u2009+\u2009LST.<\/p>\n<p>We screened samples using mRNA and DNA methylation profiles to compute four stemness scores: DNA methylation-based stemness score, epigenetically regulated DNA methylation-based stemness score, differentially methylated probe-based stemness score and enhancer elements\/DNA methylation-based stemness score as outlined in previous studies<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 23\" title=\"Malta, T. M. et al. Machine learning identifies stemness features associated with oncogenic dedifferentiation. Cell 173, 338&#x2013;354.e15 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR23\" id=\"ref-link-section-d51900554e2995\" rel=\"nofollow noopener\" target=\"_blank\">23<\/a>.<\/p>\n<p>Quantification of TNB and mutation data<\/p>\n<p>Two methodologies were used to identify potential neoantigens arising from SNVs and Indels. For SNVs, somatic nonsynonymous coding SNVs were identified and minimal peptides encompassing mutation sites were extracted, followed by prediction of binding to autologous MHC using NetMHCpan v.3.0 (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 55\" title=\"Nielsen, M. &amp; Andreatta, M. NetMHCpan-3.0; improved prediction of binding to MHC class I molecules integrating information from multiple receptor and peptide length datasets. Genome Med. 8, 33 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR55\" id=\"ref-link-section-d51900554e3007\" rel=\"nofollow noopener\" target=\"_blank\">55<\/a>). On the other hand, Indel variants meeting specific criteria were extracted, and downstream protein sequences obtained to generate nine-mer peptides. These peptides were then evaluated for their ability to bind MHC molecules using the pVAC-Seq v.4.0.8 pipeline<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 56\" title=\"Hundal, J. et al. pVAC-Seq: a genome-guided in silico approach to identify tumor neoantigens. Genome Med. 8, 11.\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR56\" id=\"ref-link-section-d51900554e3011\" rel=\"nofollow noopener\" target=\"_blank\">56<\/a>, also using NetMHCpan v.3.0. The mutation data, specifically encompassing missense mutations and nonsense mutations, were obtained from the PanCancer Atlas consortium and used in this analysis. The dataset (<a href=\"https:\/\/gdc.cancer.gov\/about-data\/publications\/pancanatlas\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/gdc.cancer.gov\/about-data\/publications\/pancanatlas<\/a>) underwent filtering, requiring mutation calls to be generated by two or more mutation callers (NCALLERS\u2009&gt;\u20091).<\/p>\n<p>Signature calculation for bulk-seq data<\/p>\n<p>To study the consequences of LOY, a literature review was performed, and a variety of tumour-associated signatures gathered (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Cristescu, R. et al. Molecular analysis of gastric cancer identifies subtypes associated with distinct clinical outcomes. Nat. Med. 21, 449&#x2013;456 (2015).\" href=\"#ref-CR57\" id=\"ref-link-section-d51900554e3033\">57<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Li, Y. et al. Pan-cancer characterization of immune-related lncRNAs identifies potential oncogenic biomarkers. Nat. Commun. 11, 1000 (2020).\" href=\"#ref-CR58\" id=\"ref-link-section-d51900554e3033_1\">58<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Zeng, D. Q. et al. IOBR: multi-omics immuno-oncology biological research to decode tumor microenvironment and signatures. Front. Immunol. 12, 687975 (2021).\" href=\"#ref-CR59\" id=\"ref-link-section-d51900554e3033_2\">59<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Chen, X. Y. et al. Turning up the heat on non-immunoreactive tumours: pyroptosis influences the tumor immune microenvironment in bladder cancer. Oncogene 40, 6381&#x2013;6393 (2021).\" href=\"#ref-CR60\" id=\"ref-link-section-d51900554e3033_3\">60<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Chen, X. Y. et al. CD8 T effector and immune checkpoint signatures predict prognosis and responsiveness to immunotherapy in bladder cancer. Oncogene 40, 6223&#x2013;6234 (2021).\" href=\"#ref-CR61\" id=\"ref-link-section-d51900554e3033_4\">61<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 62\" title=\"Mariathasan, S. et al. TGF&#x3B2; attenuates tumour response to PD-L1 blockade by contributing to exclusion of T cells. Nature 554, 544&#x2013;548 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR62\" id=\"ref-link-section-d51900554e3036\" rel=\"nofollow noopener\" target=\"_blank\">62<\/a>. Each signature was assessed using ssGSEA implemented through the GSVA R package<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"H&#xE4;nzelmann, S., Castelo, R. &amp; Guinney, J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinf. 14, 7 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR49\" id=\"ref-link-section-d51900554e3040\" rel=\"nofollow noopener\" target=\"_blank\">49<\/a> (v.1.44.5). Scoring methods are in the figure legends. Where information is not provided in the figure legends, the methodologies were documented in the respective citations.<\/p>\n<p>To validate the CRISPR\u2013Cas9-mediated Y-KO MB49 (Y-KO)-derived gene expression signature in human BLCA data, we first performed a differential expression analysis between the Y-KO and Y-Scr groups, which identified a robust set of upregulated and downregulated genes. We then calculated a Y phenotype score by dividing the signature scores of upregulated genes by those of downregulated genes, with all values scaled to the [0,1] interval.<\/p>\n<p>Pan-cancer scRNA-seq data collection<\/p>\n<p>For pan-cancer scRNA-seq data, transcriptome data of 346 samples from 251 people across 20 scRNA-seq datasets were obtained from public studies<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Li, R. Y. et al. Mapping single-cell transcriptomes in the intra-tumoral and associated territories of kidney cancer. Cancer Cell 40, 1583&#x2013;1599.e10 (2022).\" href=\"#ref-CR63\" id=\"ref-link-section-d51900554e3055\">63<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Zhang, Y. P. et al. Single-cell analyses of renal cell cancers reveal insights into tumor microenvironment, cell of origin, and therapy response. Proc. Natl Acad. Sci. USA 118, e2103240118 (2021).\" href=\"#ref-CR64\" id=\"ref-link-section-d51900554e3055_1\">64<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Werba, G. et al. Single-cell RNA sequencing reveals the effects of chemotherapy on human pancreatic adenocarcinoma and its tumor microenvironment. Nat. Commun. 14, 797 (2023).\" href=\"#ref-CR65\" id=\"ref-link-section-d51900554e3055_2\">65<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Ma, L. C. et al. Tumor cell biodiversity drives microenvironmental reprogramming in liver cancer. Cancer Cell 36, 418&#x2013;430.e6 (2019).\" href=\"#ref-CR66\" id=\"ref-link-section-d51900554e3055_3\">66<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Regner, M. J. et al. A multi-omic single-cell landscape of human gynecologic malignancies. Mol. Cell 81, 4924&#x2013;4941.e10 (2021).\" href=\"#ref-CR67\" id=\"ref-link-section-d51900554e3055_4\">67<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Chen, Y. P. et al. Single-cell transcriptomics reveals regulators underlying immune cell diversity and immune subtypes associated with prognosis in nasopharyngeal carcinoma. Cell Res. 30, 1024&#x2013;1042 (2020).\" href=\"#ref-CR68\" id=\"ref-link-section-d51900554e3055_5\">68<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Xu, J. F. et al. Single-cell RNA sequencing reveals the tissue architecture in human high-grade serous ovarian cancer. Clin. Cancer Res. 28, 3590&#x2013;3602 (2022).\" href=\"#ref-CR69\" id=\"ref-link-section-d51900554e3055_6\">69<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Li, C. B. et al. Single-cell transcriptomics reveals cellular heterogeneity and molecular stratification of cervical cancer. Commun. Biol. 5, 1208 (2022).\" href=\"#ref-CR70\" id=\"ref-link-section-d51900554e3055_7\">70<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Luo, H. et al. Pan-cancer single-cell analysis reveals the heterogeneity and plasticity of cancer-associated fibroblasts in the tumor microenvironment. Nat. Commun. 13, 6619 (2022).\" href=\"#ref-CR71\" id=\"ref-link-section-d51900554e3055_8\">71<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Sathe, A. et al. Single-cell genomic characterization reveals the cellular reprogramming of the gastric tumor microenvironment. Clin. Cancer Res. 26, 2640&#x2013;2653 (2020).\" href=\"#ref-CR72\" id=\"ref-link-section-d51900554e3055_9\">72<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Kim, N. et al. Single-cell RNA sequencing demonstrates the molecular and cellular reprogramming of metastatic lung adenocarcinoma. Nat. Commun. 11, 2285 (2020).\" href=\"#ref-CR73\" id=\"ref-link-section-d51900554e3055_10\">73<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Lambrechts, D. et al. Phenotype molding of stromal cells in the lung tumor microenvironment. Nat. Med. 24, 1277&#x2013;1289 (2018).\" href=\"#ref-CR74\" id=\"ref-link-section-d51900554e3055_11\">74<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Chen, S. J. et al. Single-cell analysis reveals transcriptomic remodellings in distinct cell types that contribute to human prostate cancer progression. Nat. Cell Biol. 23, 87&#x2013;98 (2021).\" href=\"#ref-CR75\" id=\"ref-link-section-d51900554e3055_12\">75<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Ma, X. S. et al. Identification of a distinct luminal subgroup diagnosing and stratifying early stage prostate cancer by tissue-based single-cell RNA sequencing. Mol. Cancer 19, 147 (2020).\" href=\"#ref-CR76\" id=\"ref-link-section-d51900554e3055_13\">76<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Luo, H. et al. Characterizing dedifferentiation of thyroid cancer by integrated analysis. Sci. Adv. 7, eabf3657 (2021).\" href=\"#ref-CR77\" id=\"ref-link-section-d51900554e3055_14\">77<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Chen, Z. H. et al. Single-cell RNA sequencing highlights the role of inflammatory cancer-associated fibroblasts in bladder urothelial carcinoma. Nat. Commun. 11, 5077 (2020).\" href=\"#ref-CR78\" id=\"ref-link-section-d51900554e3055_15\">78<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Qian, J. B. et al. A pan-cancer blueprint of the heterogeneous tumor microenvironment revealed by single-cell profiling. Cell Res. 30, 745&#x2013;762 (2020).\" href=\"#ref-CR79\" id=\"ref-link-section-d51900554e3055_16\">79<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Zhang, M. et al. Single-cell transcriptomic architecture and intercellular crosstalk of human intrahepatic cholangiocarcinoma. J. Hepatol. 73, 1118&#x2013;1130 (2020).\" href=\"#ref-CR80\" id=\"ref-link-section-d51900554e3055_17\">80<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Peng, J. Y. et al. Single-cell RNA-seq highlights intra-tumoral heterogeneity and malignant progression in pancreatic ductal adenocarcinoma. Cell Res. 29, 725&#x2013;738 (2019).\" href=\"#ref-CR81\" id=\"ref-link-section-d51900554e3055_18\">81<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 82\" title=\"Lin, W. et al. Single-cell transcriptome analysis of tumor and stromal compartments of pancreatic ductal adenocarcinoma primary tumors and metastatic lesions. Genome Med. 12, 80 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR82\" id=\"ref-link-section-d51900554e3058\" rel=\"nofollow noopener\" target=\"_blank\">82<\/a>, from which tumour samples were selected for later analysis. Accession numbers for each scRNA-seq dataset and detailed clinical information for patients and samples are summarized in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>. To avoid issues related to platform heterogeneity, only datasets generated from 10x Genomics droplet based scRNA-seq datasets were included.<\/p>\n<p>Quality control and preprocessing of pan-cancer scRNA-seq data<\/p>\n<p>We performed quality control filtering and integration using the Scanpy package (v.1.9.5). Filtering was performed based on (1) confirmation that information was available for all nine Y signature genes, (2) cells had greater than 200 detected genes and (3) the mitochondrial gene counts were below 20%. Additional quality filters were applied to the data to remove barcodes that fell into any of the following categories: possible debris with too few genes expressed (less than 400) and too few unique molecular identifiers (UMIs) (less than 500), possibility of duplicate cells based on genes expressed (more than 5,500) or UMIs (more than 30,000). Count matrices and AnnData objects were then combined using a concatenate function, normalized to log transcripts per million units using the \u2018sc.pp.normalize_total\u2019 function, and log-transformed using the \u2018sc.pp.log1p\u2019 function. The normalized HNSC dataset from <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE150430\" rel=\"nofollow noopener\" target=\"_blank\">GSE150430<\/a> was then combined. Subsequently, non-tumour samples were removed, and we retained 1,030,968 high-quality cells and 14,689 genes for downstream analysis.<\/p>\n<p>Combining and batch effect correction of pan-cancer scRNA-seq data<\/p>\n<p>We used the scVI Python package (scvi-tools; v.1.0.4)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 83\" title=\"Lopez, R., Regier, J., Cole, M. B., Jordan, M. I. &amp; Yosef, N. Deep generative modeling for single-cell transcriptomics. Nat. Methods 15, 1053&#x2013;1058 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR83\" id=\"ref-link-section-d51900554e3088\" rel=\"nofollow noopener\" target=\"_blank\">83<\/a> to integrate and batch correct scRNA-seq data. The scVI model was trained on the scRNA-seq data, considering samples as covariates. Following batch correction, the corrected data were integrated if multiple batches were present. The effectiveness of batch correction was evaluated by assessing the reduction in batch-specific variation while ensuring preservation of signal. Downstream analyses such as clustering, differential expression analysis or pathway enrichment analysis\u00a0were performed on the batch-corrected data. Visualization of the results was achieved through two-dimensional UMAP plots, illustrating cell types, batches, datasets, gender, organs and cancer types.<\/p>\n<p>Cell-type annotation of pan-cancer scRNA-seq data<\/p>\n<p>To annotate cells, we used the scANVI algorithm from the scVI Python packages (scvi-tools; v.1.0.4) and the Luo et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 71\" title=\"Luo, H. et al. Pan-cancer single-cell analysis reveals the heterogeneity and plasticity of cancer-associated fibroblasts in the tumor microenvironment. Nat. Commun. 13, 6619 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR71\" id=\"ref-link-section-d51900554e3101\" rel=\"nofollow noopener\" target=\"_blank\">71<\/a> dataset, where cells were pre-labelled as epithelium, endothelium, fibroblast, lymphocyte, myeloid or plasma cell. Subsequently, we performed unsupervised clustering of the scANVI latent space, and then used Leiden clustering, followed by cluster assignment to specific cell types. The scANVI model was configured with max_epochs\u2009=\u200920 and cluster labels were transferred and predicted, guided by a sample size of n_samples_per_label\u2009=\u2009100. The integrated latent embedding provided by scANVI served as the basis for downstream analysis, with the dataset segregated by cell type for further investigation. To delineate cell subtypes within myeloid cell, lymphocyte and plasma cell populations, we merged corresponding AnnData and mitigated batch effects and other sources of variation using scVI. Subsequently, we predicted subtypes using Celltypist<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 84\" title=\"Conde, C. D. et al. Cross-tissue immune cell analysis reveals tissue-specific features in humans. Science 376, eabl5197 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR84\" id=\"ref-link-section-d51900554e3105\" rel=\"nofollow noopener\" target=\"_blank\">84<\/a> (v.1.6.2), using \u2018majority voting\u2019 with default parameters and the pre-trained \u2018Immune_All_Low.pkl\u2019 model.<\/p>\n<p>Annotating LOY cells in pan-cancer scRNA-seq data via Random Forest<\/p>\n<p>We collected scRNA-seq pan-cancer data from paired tumour and adjacent normal samples and, following preprocessing, categorized cells from adjacent normal samples as: male cells as having wild-type Y (WTY) chromosomes and female cells as LOY cells. Employing the train_test_split function from sklearn.model_selection, we divided data from the normal samples into training and test sets, with a split ratio of 70% for training and 30% for testing. Next, we trained a Random Forest classifier model (RandomForestClassifier from the sklearn.ensemble Python package (v.1.3.2)) to differentiate LOY and WTY cells based on the expression levels of the nine Y chromosome genes used for the bulk RNA-seq classification of LOY samples. The performance of the model was assessed using the test set, achieving an accuracy score of 0.83.<\/p>\n<p>To further validate LOY prediction by a Random Forest model, we obtained 23 samples from Liu et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Liu, X. et al. Th17 cells secrete TWEAK to trigger epithelial-mesenchymal transition and promote colorectal cancer liver metastasis. Cancer Res. 84, 1352&#x2013;1371 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR24\" id=\"ref-link-section-d51900554e3120\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a>, including sequencing data for single-cell RNA (<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE245552\" rel=\"nofollow noopener\" target=\"_blank\">GSE245552<\/a>), bulk RNA (<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE255163\" rel=\"nofollow noopener\" target=\"_blank\">GSE255163<\/a>) and WES (<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE255165\" rel=\"nofollow noopener\" target=\"_blank\">GSE255165<\/a>). YchrS and average CNA of Y chromosome were calculated using the same method as CCLE data. LOY at the single-cell level were predicted by the same Random Forest model used for the pan-cancer single-cell datasets. Owing to absence of RPS4Y1 expression, it was set as 0 for all the cells when applying the Random Forest model.<\/p>\n<p>Genomic DNA isolation and WES<\/p>\n<p>Genomic DNA was isolated from CRISPR Y-KO cells and CRISPR\u2013Cas9-mediated Scr MB49 Y+ control (CRISPR Y-Scr) cell lines using the Invitrogen kit (catalogue no. K1820) following the manufacturer\u2019s instructions. DNA samples for WES were submitted to Novogene, where library preparation, sequencing and bioinformatics analysis were conducted. The genomic DNA was fragmented randomly into short pieces, end-repaired, A-tailed and ligated with Illumina adaptors. Following PCR amplification, size selection and purification, hybridization capture of libraries was performed. Captured libraries were further enriched by PCR amplification and assessed for quality using Qubit and bioanalyzer systems. The libraries were pooled and sequenced using Illumina platforms with the PE150 strategy. Sequencing data were processed using the GATK best practices workflow. Paired-end clean reads were aligned to the mouse reference genome (GRCm39\/mm39) using the Burrows\u2013Wheeler aligner. The resulting alignments were sorted with Sambamba and duplicate reads were marked using Picard. The coverage and sequencing depth were computed, and SNP and INDEL variants were identified.<\/p>\n<p>scRNA-seq of mouse tumour tissues<\/p>\n<p>A total of 1\u2009\u00d7\u2009105 LOY MB49 cells (a LOY clonal line, MB49 clone 5 (C5)) were injected subcutaneously into C57Bl\/6N mice (n\u2009=\u20094). Once the tumours reached 500\u2009mm3,\u00a0they were removed and processed for scRNA-seq. The tumours were cut and transferred immediately to MACS C-tubes along with chilled DMEM and tumour dissociation enzymes for mouse (Miltenyi Biotech, catalogue no. 130-096-730). The dissociated tumours were then processed using ACK Lysis buffer (Gibco, catalogue no. 2537772), dead cell removal kit (Stem Cell Technologies, catalogue no. 17899) and EasySep Mouse CD45 Positive Selection Kit (Stem Cell Technologies, catalogue no. 18945). The CD45-enriched cells were next stained with Hashtag antibodies (TotalSeq-B0301 anti-mouse Hashtag 1 Antibody; TotalSeq-B0302 anti-mouse Hashtag 2 Antibody) and stained sequentially for CD45, CD3, CD4, CD8, CD11b, F4\/80 and B220 along with 4\u2032,6-diamidino-2-phenylindole (DAPI) and sorted for CD4+, CD8+ and CD11b+ and mixed into equal ratios. This mixture of highly enriched CD45+ cells was combined in a 1:1 ratio with live CD45\u2212 cells to make a final mixture that was sent for scRNA-seq.<\/p>\n<p>The Cedars-Sinai Applied Genomics, Computation and Translational Core used 10x genomics 3\u2032 scRNA-seq to sequence all samples to around 60% saturation. Samples were processed using Cell Ranger (10x genomics) based on a pre-mRNA GRCh38 reference. Since the samples were not hashed, potential doublet cells were identified using Scrublet applied to the filtered feature barcode matrices from Cell Ranger. Scrublet analysed the 10% most variable genes, identified by Scanpy package (v.1.9.5, scanpy.pp.highly_variable_genes function), predicting a 10% doublet rate, and then discarded doublet cells. Finally, nuclei with over 10% of their UMIs linked to mitochondrial genes or those in the top and bottom 5% based on the number of unique genes and UMI count were also removed.<\/p>\n<p>One female and one male normal healthy C57BL\/6N bladder sample from our previous study<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 31\" title=\"Abdel-Hafiz, H. A. et al. Single-cell profiling of murine bladder cancer identifies sex-specific transcriptional signatures with prognostic relevance. iScience 26, 107703 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR31\" id=\"ref-link-section-d51900554e3190\" rel=\"nofollow noopener\" target=\"_blank\">31<\/a> were also analysed. Filtering was performed based on: (1) cells had more than 200 detected genes and (2) the mitochondrial gene counts were below 20%. Additional quality filters were applied to the data to remove barcodes that fell into any of the following categories: possible debris with too few genes expressed (less than 400) and too few UMIs (less than 500), possibility of duplicate cells based on genes expressed (more than 30,000) or UMIs (more than 5,500). We normalized the data to 1\u2009\u00d7\u2009104 counts per cell and calculated the base-10 logarithm. We used sc.pp.combat to remove the batch effect and applied subsequent downstream analyses on the batch-corrected data. To annotate cells, we used the scVI and scANVI algorithm from the scVI Python packages (scvi-tools).<\/p>\n<p>Analysis for xenograft scRNA-seq datasets<\/p>\n<p>To further investigate the ability of LOY malignant cells to induce LOY in benign cells in the TME, we downloaded public scRNA-seq datasets of human xenografts in immunocompromised mice from <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE254890\" rel=\"nofollow noopener\" target=\"_blank\">GSE254890<\/a> (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 35\" title=\"Hosohama, L. et al. Colorectal cancer stem cell subtypes orchestrate distinct tumor microenvironments. Preprint at bioRxiv &#010;                  https:\/\/doi.org\/10.1101\/2024.04.25.591144&#010;                  &#010;                 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR35\" id=\"ref-link-section-d51900554e3211\" rel=\"nofollow noopener\" target=\"_blank\">35<\/a>) (SW480 cells from a male patient\u00a0with CRC injected into male mice, 14 samples were incorporated as SW480 group), <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE110501\" rel=\"nofollow noopener\" target=\"_blank\">GSE110501<\/a> (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 85\" title=\"Zhao, Q. et al. Single-cell transcriptome analyses reveal endothelial cell heterogeneity in tumors and changes following antiangiogenic treatment. Cancer Res. 78, 2370&#x2013;2382 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR85\" id=\"ref-link-section-d51900554e3222\" rel=\"nofollow noopener\" target=\"_blank\">85<\/a>; only eight samples from normal tissues were incorporated as male control) and <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE144236\" rel=\"nofollow noopener\" target=\"_blank\">GSE144236<\/a> (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 86\" title=\"Ji, A. L. et al. Multimodal analysis of composition and spatial architecture in human squamous cell carcinoma. Cell 182, 1661&#x2013;1662 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR86\" id=\"ref-link-section-d51900554e3234\" rel=\"nofollow noopener\" target=\"_blank\">86<\/a>; A431, SCC and CAL27 injected into female mice, three samples incorporated as female control). Based on scRNA-seq and bulk RNA-seq data provided, SW480 cells used were LOY cells, which matched with the Y chromosome information obtained by our CCLE analysis on RNA-seq data. Mouse cells were selected either by tumour cell depletion using FITC conjugated antibodies, or by expression level of mouse genes compared with human genes. Potential debris (cells with fewer than 200 expressed genes or 400 UMIs) and possible doublets (cells with more than 8,500 expressed genes or 30,000 UMIs) were filtered out. After normalization, batch correction and cell type annotation were performed by scVI and scANVI. YchrS was calculated \u2018scanpy.tl.score_genes\u2019 using all Ychr gene expression.<\/p>\n<p>Mouse HCC studiesMice<\/p>\n<p>MUP-uPA+ on C57BL\/6 (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 87\" title=\"Weglarz, T. C., Degen, J. L. &amp; Sandgren, E. P. Hepatocyte transplantation into diseased mouse liver. Kinetics of parenchymal repopulation and identification of the proliferative capacity of tetraploid and octaploid hepatocytes. Am. J. Pathol. 157, 1963&#x2013;1974 (2000).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR87\" id=\"ref-link-section-d51900554e3252\" rel=\"nofollow noopener\" target=\"_blank\">87<\/a>) background\u20091 were bred and housed under specific pathogen-free conditions in an American Association for Accreditation of Laboratory Animal Care-approved barrier facility at Cedars-Sinai Medical Center. MUP-uPA+ mice were fed a Western diet (Teklad, catalogue no. TD.88137) for 8\u2009months beginning at 8\u2009weeks after birth. HCC development was analysed at 10 months of age.<\/p>\n<p>Tissue preparation for mouse scRNA-seq and single-nuclei RNA-seq<\/p>\n<p>Mice were killed by CO2 inhalation and livers were perfused with PBS containing 2% of heparin (20 USP units\u2009ml\u22121) to remove traces of blood. For single-nucleus preparation, livers were isolated, tumour tissues were dissected and cut into 50\u2009mg tumour tissue pieces for single-nucleus isolation and sequencing. Tissue was frozen in dry ice (solid CO2) and kept in liquid nitrogen for long-term storage. For single-cell preparation, livers were isolated,\u00a0and tumour tissues were dissected and digested using a cocktail of digestion enzymes containing collagenase\u2009I (450\u2009U\u2009ml\u22121) (Sigma-Aldrich, catalogue no. C0130) and DNase\u2009I (120\u2009U\u2009m\u22121) (Sigma-Aldrich, catalogue no. D4263) in PBS (with Ca2+\/Mg2+) for 30\u2009min at 37\u00b0C with gentle shaking at 150\u2009rpm for liver immune cell isolation. After incubation, cell suspensions were filtered through a 70\u2009\u00b5m cell strainer. Immune cells were enriched by density-gradient centrifugation over Percoll (GE Healthcare, catalogue no. 17-0891-01) at 1,000g for 25\u2009min without brake (40% Percoll in RPMI-1640 and 80% Percoll in 5% FBS\/PBS). Leukocyte rings on a border of gradient were collected, washed and stained. Immune cell suspensions were stained with Zombie Aqua (BioLegend, catalogue no. 423101) on ice for 15\u2009min to exclude dead cells, incubated with Fc Block TruStain FcX (Clone 93, BioLegend, catalogue no. 101320, RRID: AB_1574975) for 20\u2009min in 2% FBS-PBS and then stained with fluorochrome labelled antibody for 30\u2009min on ice (CD45-PerCP\/Cyanine5.5 (QA17A26, BioLegend, catalogue no. 157612; RRID, catalogue no. AB_2832558, 1:100)). All the flow cytometry antibodies were validated by the manufacturer (BioLegend) and were validated in the laboratory in single channel controls. Live, CD45+ cells were sorted by BD sorter Aria\u2009III using a 100-\u00b5m nozzle.<\/p>\n<p>Mouse scRNA-seq<\/p>\n<p>The single-cell droplets were generated with a Chromium X controller using Chromium Next GEM Single Cell 3\u2032 Reagent Kits v3.1 (Dual Index) (10X Genomics, catalogue no. 1000268). Approximately 8,000 to 10,000 cells were collected to make cDNA at the single-cell level. cDNA amplification and library construction were performed according to the manufacturer\u2019s instructions. All cDNA and libraries were quantified via Agilent Technologies 2100 Bioanalyzer. Gene expression libraries were sequenced at a targeted depth of 50,000 reads per cell on the Illumina Novaseq X plus (Illumina) at Novogene. Fastq files were obtained and then processed with Cell Ranger v.8.0.1 aligning to the mouse (mm10) 2020-A reference genome on 10x Genomics Cloud Analysis.<\/p>\n<p>Mouse single-nucleus RNA sequencing<\/p>\n<p>Single nuclei were isolated from frozen tumour tissues using the Chromium Nuclei Isolation Kit (10x Genomics, catalogue no. 1000493) according to the manufacturer\u2019s instructions. cDNA amplification, library construction, sequencing and genome mapping were performed\u00a0in\u00a0the same way as for mouse scRNA-seq.<\/p>\n<p>Validation of LOY effect\u00a0using independent scRNA-seq data<\/p>\n<p>Processed scRNA-seq data and corresponding cell type information for 116 liver cancer samples from 94 male patients<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 40\" title=\"Xue, R. et al. Liver tumour immune microenvironment subtypes and neutrophil heterogeneity. Nature 612, 141&#x2013;147 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR40\" id=\"ref-link-section-d51900554e3312\" rel=\"nofollow noopener\" target=\"_blank\">40<\/a> were analysed using our Random Forest model to predict LOY at the single-cell level. Only primary tumour or metastasis samples were included in the survival analysis.<\/p>\n<p>InferCNV analysis<\/p>\n<p>For the results presented in Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#Fig13\" rel=\"nofollow noopener\" target=\"_blank\">8e,f<\/a>, CNVs in the scRNA-seq data were predicted using the InferCNV tool (<a href=\"https:\/\/github.com\/broadinstitute\/inferCNV\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/broadinstitute\/inferCNV<\/a>; v.1.13.0), so that differences in the frequencies between the LOYSCR and WTYSCR epithelial cells of gains or deletions of entire chromosomes or large chromosomal segments could be identified. The algorithm was run with default parameters, using all WTYSCR stromal cells and immune cells as reference cells. For the results presented in Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#Fig11\" rel=\"nofollow noopener\" target=\"_blank\">6c,d<\/a>, the analysis and figure were generated using Infercnvpy (<a href=\"https:\/\/github.com\/icbi-lab\/infercnvpy\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/icbi-lab\/infercnvpy<\/a>; v.0.4.5).<\/p>\n<p>Functional signature calculation for scRNA-seq data<\/p>\n<p>We used the \u2018scanpy.tl.score_genes\u2019 function from the Scanpy Python package (v.1.9.5) to compute gene set scores across cells. This function calculates gene scores for each gene listed<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 62\" title=\"Mariathasan, S. et al. TGF&#x3B2; attenuates tumour response to PD-L1 blockade by contributing to exclusion of T cells. Nature 554, 544&#x2013;548 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR62\" id=\"ref-link-section-d51900554e3359\" rel=\"nofollow noopener\" target=\"_blank\">62<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 88\" title=\"Thorsson, V. et al. The immune landscape of cancer. Immunity 48, 812&#x2013;830.e814 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR88\" id=\"ref-link-section-d51900554e3362\" rel=\"nofollow noopener\" target=\"_blank\">88<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 89\" title=\"Chu, Y. S. et al. Pan-cancer T cell atlas links a cellular stress response state to immunotherapy resistance. Nat. Med. 29, 1550&#x2013;1562 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR89\" id=\"ref-link-section-d51900554e3365\" rel=\"nofollow noopener\" target=\"_blank\">89<\/a> in \u2018gene_list\u2019 across all cells stored within the dataset.<\/p>\n<p>Sorting immune and epithelial cells from tumours and PBMCs<\/p>\n<p>We used three distinct models to examine LOY in vivo: (1) naturally occurring Y+ and LOY (Y\u2212) cells<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\" title=\"Abdel-Hafiz, H. A. et al. Y chromosome loss in cancer drives growth by evasion of adaptive immunity. Nature 619, 624&#x2013;631 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR7\" id=\"ref-link-section-d51900554e3381\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>, (2) an established LOY clonal line (C5) and (3) CRISPR-engineered Y-KO and Y-Scr cells<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\" title=\"Abdel-Hafiz, H. A. et al. Y chromosome loss in cancer drives growth by evasion of adaptive immunity. Nature 619, 624&#x2013;631 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR7\" id=\"ref-link-section-d51900554e3385\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>. For the tumour challenge, 1\u2009\u00d7\u2009105 cells from each line\u2014MB49 clone 5 (C5), Y\u2212 (LOY), Y+ (WTY), CRISPR Y-KO and CRISPR Y-Scr\u2014were injected subcutaneously into the flanks of C57Bl\/6N mice (n\u2009=\u20097 per group) obtained from Taconic Biosciences<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\" title=\"Abdel-Hafiz, H. A. et al. Y chromosome loss in cancer drives growth by evasion of adaptive immunity. Nature 619, 624&#x2013;631 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR7\" id=\"ref-link-section-d51900554e3399\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>.The DNA levels of Y chromosome genes in each engineered cell type were checked for abundance before injection. Subcutaneous tumours from each group were disrupted mechanically, filtered through a 70\u2009\u00b5m cell strainer (Corning, catalogue no. 352350) with RPMI-1640 cell culture grade media (Gibco, catalogue no. 11875093). ACK lysis buffer (Gibco, catalogue no. A1049201) was used to disrupt the infiltrating red blood cells. The single cells were then centrifuged and resuspended in 1\u00d7 HBSS buffer (Gibco, catalogue no. 14025092) and counted in a cell counter machine. Tumours with viability of greater than 60% were used for subsequent procedures. EasySep Dead Cell Removal (Annexin V) Kit (Stem Cell Technologies, catalogue no. 17899) was used to increase the viability of each tumour and remove the dead cells. Viable cells were next processed with EasySep Mouse CD45 Positive Selection Kit (Stem Cell Technologies, catalogue no. 18945). The CD45 cells with the positive magnetic beads were purified and resuspended in EasySep Buffer (Stem Cell Technologies, catalogue no. 20144). These cells were next stained with CD45-Alexa Fluor 488 (BioLegend, catalogue no. 103122, 1:40) and Viability ghost dye Red 710 (Cytek, catalogue no. SKU 13-0871-T100) and sorted for only CD45+ cells in BD AriaIII machine. These highly purified CD45+ immune cells were next used for isolating high-quality genomic DNA with Monarch Nucleic Acid Purification Kits (NEB) and quantified. The flow through obtained after the CD45 positive selection, containing stromal, endothelial or other blood cells, was next processed with the\u00a0EasySep Mouse Epithelial Cell Enrichment Kit II (Stem Cell Technologies, catalogue no. 19868), to isolate only the epithelial cells. The purity of pre- and post-isolation populations was assessed by antibody staining of random samples from each group, using the CD45 AF488 (BioLegend, catalogue no. 103122, 1:40) antibody and the BD Symphony A5, with results being analysed using Flow Jo software v.10.9.0.<\/p>\n<p>For isolating T\u2009cells from these tumours (n\u2009=\u20096 each group) and their respective PBMCs, the following panel was used: CD45 (BioLegend, catalogue no. 103116; 1:40); CD3 (BD, catalogue no. 749276, 1:40); TCR \u03b2 chain (TCRb, BioLegend, catalogue no. 109205, 1:40); CD11b (BD, catalogue no. 563553, 1:40); CD45R (BD, catalogue no. 612950, 1:40); Ghost dye Red 710 viability dye (Tonbo Biosciences, catalogue no. 13-0871-T100). Three compartments were sorted with BD FACSymphony S6 machine: epithelial (Ghost Red 710 dye\u2212CD45\u2212 or CD45\u2212TdTomato+); non-T immune cells (Ghost Red 710 dye\u2212CD45+CD11b+B220+) and T\u2009cells (Ghost Red 710 dye\u2212CD45+CD11b\u2212B220\u2212CD3+TCRb+). The sorting gates and FACS data are shown in Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>. Subsequently, the sorted cell compartments from tumours from the both groups were used for high-quality DNA extraction and subsequent qPCR. FC was calculated by comparing individual dCT values to respective average WTY(Y+) dCt or average Y SCR dCt values.<\/p>\n<p>To get substantial amount of viable T\u2009cells for DNA isolation from subcutaneous tumours, the tumours of one group were mashed and pooled together and then proceeded for sorting. However, after sorting, the T\u2009cells were aliquoted randomly to four tubes and then processed for DNA isolation, to increase the efficacy of the result and decrease the chances of human error. Therefore, even if each group had six tumours pooled together, the graph in Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#Fig15\" rel=\"nofollow noopener\" target=\"_blank\">10c<\/a> (right panel) shows four dots of one colour.<\/p>\n<p>RNA-seq of CD45\u2212 cells from CRISPR Y-KO and Y-Scr tumours<\/p>\n<p>Tumours derived from subcutaneous injection of CRISPR Y_KO and CRISPR Y-Scr cells into the flanks of C57BL\/6N mice were removed and processed. Tumour-derived single-cell suspensions were subjected to FACS to isolate CD45\u2212 cells as described above. Total RNA was extracted from the isolated cells using the RNeasy Plus Mini Kit with gDNA Eliminator columns (Qiagen), following the manufacturer\u2019s protocol. RNA sequencing library preparation and sequencing were performed by Novogene. Quality assessment of RNA-seq data, including sequence, alignment and quantification metrics, was conducted using FastQC v.0.12.1 and summarized with MultiQC v.1.13. Illumina Truseq adaptor, polyA and polyT sequences were trimmed using Trimmomatic v.0.39. The trimmed reads were aligned to the mouse genome (GRCm39\/mm39) using STAR aligner v.2.5.2b, with parameters aligned to the ENCODE long RNA-seq pipeline recommendations (<a href=\"https:\/\/github.com\/ENCODE-DCC\/long-rna-seq-pipeline\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/ENCODE-DCC\/long-rna-seq-pipeline<\/a>). Gene-level expression was quantified using featureCounts v.1.5.3, using Ensembl gene annotations (release v.113) for both alignment and quantification.<\/p>\n<p>Genes with low expression were filtered out by applying a threshold of sum of estimated counts (from featureCounts) of at least ten. Differential gene expression analysis was performed on filtered estimated read counts using the R Bioconductor package DESeq2 v.1.42.1, using a generalized linear model with a negative binomial distribution. Differentially expressed genes were identified based on a Benjamini\u2013Hochberg adjusted P value\u2009&lt;\u20090.05 and FC cut-off (\u22652 or \u2264\u22122). To validate the Y-KO-derived gene phenotype signature in human cancer data, we first excluded all Y-linked genes to prevent bias in the differential expression results. We then calculated a LOY gene phenotype score by dividing the signature scores of up-regulated genes by those of down-regulated genes, with scores scaled to a [0,1] interval.<\/p>\n<p>BBN treatment and PBMC isolation<\/p>\n<p>For the BBN experiment, 8-week-old mice (n\u2009=\u20093\u20134 mice per timepoint) were administered with 0.5% BBN water for 12\u2009weeks. After 12\u2009weeks, BBN was replaced with standard tap water. Mice were killed at 2, 4, 12, 20 and 25\u2009weeks using isoflurane. Subsequently, blood was drawn directly from the heart using an ethylenediaminetetraacetic acid (EDTA)-prewet insulin syringe and collected in an EDTA Microvette. From each mouse, a range of 700 to 1,000\u2009\u00b5l of blood was collected. To enhance yield and DNA quality, mice were pooled in each group. Pooled samples were diluted 1:1 (v:v) with PBS-EDTA 0.1\u2009M and then stratified on Histopaque-1077 Ficoll (Sigma, catalogue no. GE17-5446-02) at a ratio of 3:1 (v:v). Samples were centrifuged for 30\u2009min at 400\u2009rcf, with acceleration and break ramps set to 0 to allow gradual phase separation. The resulting PBMC ring was collected and washed once with PBS (1X). A subsequent centrifugation of 5\u2009min at 400\u2009rcf (with acceleration and break ramps set to maximum speed) resulted in a pellet that was processed for DNA extraction.<\/p>\n<p>Tissue microarray<\/p>\n<p>A human BLCA tissue microarray (TMA) with 33 unique cases comprised of triplicate cores from each patient tumour with an individual core size of 1\u2009mm was used. The TMA was comprised of both male (n\u2009=\u200918) and female (n\u2009=\u200915) patients. Cores from female patients were used as controls for FISH.<\/p>\n<p>XY FISH staining<\/p>\n<p>The unstained TMA formalin-fixed paraffin-embedded sections (4\u2009\u03bcm) were baked at 55\u201360\u2009\u00b0C overnight before subjecting the slide to the following steps on the Abbott VP2000 FISH Instrument: (1) deparaffination of the slide using xylene, (2) pre-treatment of the slide using 0.2\u00a0N\u2009HCl and 1\u2009M NaSCN, (3) protease treatment with pepsin, (4) fixation in 10% buffered formalin and, finally, (5) dehydration in series of increasing concentration (70%, 85% and 100%) ethanol. The slide was then subjected to a co-denaturation step using ThermoBrite (melting temperature 73\u2009\u00b0C, 5\u2009min; hybridization temperature 37\u2009\u00b0C overnight). Post hybridization the slide was washed twice with SSC\/0.3% NP-4 72\u2009\u00b1\u20091\u2009\u00b0C for 2\u2009min and twice with 2\u00d7 SSC\/0.3% NP-4 15\u201330\u2009\u00b0C for 1\u2009min. Finally, the slide was counterstained with nuclear DAPI before sealing with coverslip for visualization. The fluorescence tags were as follows: CEPX Xp11.1-q11.1\u2014Spectrum Green (excitation, 497\u2009nm; emission, 524\u2009nm) CEPY Yp11.1-q11.3\u2014Spectrum Orange (excitation, 559\u2009nm; emission, 588\u2009nm) and 18S RNA probe\u2014Spectrum Aqua (excitation, 433\u2009nm; emission, 480\u2009nm).<\/p>\n<p>IHC staining<\/p>\n<p>Formalin-fixed paraffin-embedded samples were sectioned at 4-\u03bcm thickness onto Superfrost Plus slides (Fisher Scientific, catalogue no. 12-550-15). IHC staining was performed on the Ventana Discovery Ultra Instrument (Roche) as described<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 90\" title=\"Ho, A. S. et al. Comparative proteomic analysis of HPV(+) oropharyngeal squamous cell carcinoma recurrence. J. Proteome Res. 21, 200&#x2013;208 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR90\" id=\"ref-link-section-d51900554e3522\" rel=\"nofollow noopener\" target=\"_blank\">90<\/a>. After applying antigen retrieval buffer (CC1 (Tris, pH\u20098.0) (Roche Ventana, catalogue no. 950-124), CD45 primary antibody (Cell Signaling, catalogue no. 13917S, rabbit monoclonal) was applied. Primary antibody was diluted antibody dilution buffer (Roche Ventana, catalogue no. ADB250) for 1\u2009h at room temperature: anti-CD45 (1:500). DISC anti-Rabbit HQ (Roche Ventana, catalogue no. 760-4815) was then applied for chromagen staining. After DAPI nuclear counterstain, the tissue area was covered coverslipped and mounted with ProLong antifade medium (Invitrogen, catalogue no. P36984).<\/p>\n<p>Whole-slide imaging<\/p>\n<p>XY FISH immunofluorescence slides were scanned using the ZEISS Axio Scan.Z1 whole slide scanner at \u00d720 magnification (Plan-Apochromat lens (numerical aperture, 0.8; M27)). TMA tissue cores were outlined with permanent marker pen on the coverslip for tissue detection. Region of scan was generated by a polygon tool and the raw focus map was generated using the \u2018every-2-tiles\u2019 strategy (z-range 150\u2009\u00b5m, 21.04-\u00b5m step size) under \u00d75 lens (Fluor \u00d75\/0.25 M27), while a fine focus map was generated using onion skin (z-range 100\u2009\u00b5m, 2.06-\u00b5m step size, 0.1 density, 24 maximum number of points). Both focus maps were generated in the DAPI channel at 2% LED intensity, 50-ms exposure time. Spectrum Green (X probe) was excited at 495-nm wavelength (5% LED intensity, 150-ms exposure time) and detected at 500\u2013550-nm bandwidth. Spectrum Orange (Y probe) was excited at 548-nm wavelength (25% LED intensity, 150\u2009ms exposure time), detected at 570\u2013640-nm bandwidth. Nuclear DAPI fluorescence was excited at 420-nm wavelength (2.5% LED intensity, 50-ms exposure time), detected at 430\u2013470-nm bandwidth. Spectrum Aqua (18S RNA probe) was excited at 434-nm wavelength (5% LED intensity, 150-ms exposure time) and detected at 460\u2013500-nm bandwidth. All signals were detected by Hamamatsu Orca Flash camera and 16-bit depth format image setting was applied. Standard IHC slide was imaged using the Leica Aperio AT2 whole slide scanner at \u00d720 magnification.<\/p>\n<p>Image quantitation and analysis: HALO AI module<\/p>\n<p>The whole slide image obtained from Zeiss Axio scan system was imported into HALO AI Module (Indica Labs), v.4.0.5107.318, for analysis. Upon import, the TMA image underwent segmentation to identify individual tissue cores. Missing cores were identified and removed from the analysis. The remaining cores were processed using the Nuclei Segmentation AI module, following the manufacturer\u2019s guidelines. For AI training, seven distinct regions of interest were selected, comprising a total of 43 nuclei, to refine the Nuclei Segmentation plugin. Following segmentation, FISH analysis was conducted using the HALO FISH module, v.3.2.3, to detect nuclear signals. The resulting data were exported as.csv files, containing object-level (cell) data for subsequent analysis.<\/p>\n<p>Validation of LOY correlation\u00a0via FACS-sorted scRNA-seq data<\/p>\n<p>To further validate the results and minimize the impact of mis-annotating LOY epithelial cells as LOY immune cells, we analysed 21 CD45-based FACS-sorted samples from three independent public scRNA-seq datasets (HNSC, <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE182227\" rel=\"nofollow noopener\" target=\"_blank\">GSE182227<\/a> (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 32\" title=\"Puram, S. V. et al. Cellular states are coupled to genomic and viral heterogeneity in HPV-related oropharyngeal carcinoma. Nat. Genet. 55, 640&#x2013;650 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR32\" id=\"ref-link-section-d51900554e3563\" rel=\"nofollow noopener\" target=\"_blank\">32<\/a>); CHOL, <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE171899\" rel=\"nofollow noopener\" target=\"_blank\">GSE171899<\/a> (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 33\" title=\"Alvisi, G. et al. Multimodal single-cell profiling of intrahepatic cholangiocarcinoma defines hyperactivated Tregs as a potential therapeutic target. J. Hepatol. 77, 1359&#x2013;1372 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR33\" id=\"ref-link-section-d51900554e3574\" rel=\"nofollow noopener\" target=\"_blank\">33<\/a>) and BLCA, <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE211388\" rel=\"nofollow noopener\" target=\"_blank\">GSE211388<\/a> (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 34\" title=\"Yu, H. et al. Tumor-infiltrating myeloid cells confer de novo resistance to PD-L1 blockade through EMT-stromal and Tgf&#x3B2;-dependent mechanisms. Mol. Cancer Ther. 21, 1729&#x2013;1741 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR34\" id=\"ref-link-section-d51900554e3586\" rel=\"nofollow noopener\" target=\"_blank\">34<\/a>)). This collection included 12 matched CD45+ and CD45\u2212 samples from six tumours. Detailed dataset and sample information are provided in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>. The same quality control, normalization and batch correction procedures described for the pan-cancer human scRNA-seq datasets were applied. CD45 expression was validated to ensure the purity of the FACS-selected samples. LOY cells were predicted using a Random Forest model.<\/p>\n<p>To further validate the accumulation of LOY immune cells in tumours, we analysed two additional datasets: RCC dataset<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 36\" title=\"Li, R. et al. Mapping single-cell transcriptomes in the intra-tumoral and associated territories of kidney cancer. Cancer Cell 40, 1583&#x2013;1599 e1510 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR36\" id=\"ref-link-section-d51900554e3600\" rel=\"nofollow noopener\" target=\"_blank\">36<\/a> (accession number, <a href=\"https:\/\/ega-archive.org\/datasets\/EGAD00001008030\" rel=\"nofollow noopener\" target=\"_blank\">EGAD0001008030<\/a>), comprising 14 matched tumour (also included in the pan-cancer datasets) and blood samples. HNSC dataset (accession number, <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE139324\" rel=\"nofollow noopener\" target=\"_blank\">GSE139324<\/a> (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 37\" title=\"Cillo, A. R. et al. Immune landscape of viral- and carcinogen-driven head and neck cancer. Immunity 52, 183&#x2013;199 e189 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR37\" id=\"ref-link-section-d51900554e3618\" rel=\"nofollow noopener\" target=\"_blank\">37<\/a>)), containing 38 matched TIL and PBMC samples. Detailed dataset and sample information are presented in Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>. The same quality control, normalization, batch correction and LOY cell prediction methods were applied. Additionally, cell types were identified using the scANVI algorithm and marker gene expression, with annotated pan-cancer scRNA-seq datasets serving as references.<\/p>\n<p>Long-term in vitro T cell stimulation assay<\/p>\n<p>Mouse CD8+ T\u2009cells were isolated from spleens of C57BL\/6N mice using the Mouse CD8+ T\u2009Cell Isolation kit (Miltenyi Biotec, catalogue no. 130-104-075). CD8+ T\u2009cells were then activated by seeding onto six-well plates coated with 1\u2009\u00b5g\u2009ml\u22121 anti-CD3 (clone 2C11, BioLegend, catalogue no. 100302) and anti-CD8 (clone 37.51, BioLegend, catalogue no. 102102). T\u2009cells were cultured in RPMI supplemented with 10% fetal calf serum, 1% penicillin\u2013streptomycin, 50\u2009\u00b5M 2-mercaptoethanol, 1% insulin transferrin sodium selenite as well as 0.1\u2009\u00b5g\u2009ml\u22121 IL-7 (Peprotech, catalogue no. 217-17) and IL-2 (Peprotech, catalogue no. 212-12). T\u2009cells were kept at no higher than 1\u2009\u00d7\u2009106\u2009ml\u22121 and transferred to new coated plates every 2\u20133\u2009days to maintain activation. DNA and RNA was isolated at the timepoints indicated using the Monarch Genomic DNA Purification Kit (New England Biolabs, catalogue no. T3010L) or RNeasy Plus Mini Kit with gDNA Eliminator (Qiagen, catalogue no. 74134), respectively. CDNA was generated with Maxima H Minus cDNA Synthesis Master Mix (Thermo Fisher, catalogue no. M1662) followed by qPCR. Data were normalized to day\u20091.<\/p>\n<p>Quantitative PCR<\/p>\n<p>Genomic DNA (10\u2009ng per reaction) was used to detect and quantify Y\u2010chromosome-specific genes associated with LOY signatures\u2014Kdm5d, Uty, Eif2s3y, Ddx3y, Ssty1, Ssty2\u00a0and Zfy1\/2\u2014as well as to detect the presence of Cas9 in immune and epithelial compartments. Housekeeping genes (B2m, Gapdh and Actb) served as endogenous controls. All qPCR reactions were carried out using SYBR Green Universal Master Mix (Applied Biosystems, catalogue no. 4309155) on a Quant Studio 6 Flex Real-Time PCR system (Applied Biosystems). To assess Y chromosome copy number in various TIL immune cell populations (T\u2009cells and non-T\u2009cells), we compared cycle threshold (Ct) values from each sorted population against Ct values from wild-type male tumour cells, using the \u0394\u0394Ct method to calculate FC. For Cas9 detection, Ct values in sorted immune populations were compared with DNA from wild-type C57Bl\/6N mice, which have no Cas9 integration, using Gapdh and Actb. Absence (or near-absence) of Cas9 amplification in immune compartments confirmed that these cells were not contaminated by, or had phagocytosed, genome-edited epithelial cells (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>).<\/p>\n<p>Details of primer sequences used in this manuscript for DNA qPCR are as follows:<\/p>\n<p>Uty forward, TCACCCTCTTCAGCCATTTC; reverse, GTTCTCATGCCCTTCTCCATTA<\/p>\n<p>Kdm5d forward, CTGCAAGATGGCTGCATTTC; reverse, TCGCTCCTCCTGTACCATAA<\/p>\n<p>Ddx3y forward, AGCAGATTCAGTGGAGGATTT; reverse, CCACTACTTCGGCTGCTATT<\/p>\n<p>Eif2s3y forward, CGTTATGCCGAGCAGATAGAA; reverse, CCGTCTCAGTAGGAAGTAGGA<\/p>\n<p>Sssty1 forward, TGAAGAAGAGGAGGAGGAAGT; reverse, TTGGGTGACAGGCTCATTAC<\/p>\n<p>Ssty2 forward, GGTGCCATTCTTACAGGACTAT; reverse, GTGGAGGTTACCTTCCTTGTAG<\/p>\n<p>Zfy1\/2 forward, CACCAAGAAAGCAGAACACATC; reverse, GCCTTTGTGTGAACGGAAATTA<\/p>\n<p>Gapdh forward, AACAGCAACTCCCACTCTTC; reverse, CCTGTTGCTGTAGCCGTATT<\/p>\n<p>Actb forward, ACCCAGGCATTGCTGACAGG; reverse, GGACAGTGAGGCCAGGATGG<\/p>\n<p>B2m forward, ACAGTTCCACCCGCCTCACATT; reverse, TAGAAAGACCAGTCCTTGCTGAAG<\/p>\n<p>Cas9 forward, CCCAAGAGGAACAGCGATAAG; reverse, CCACCACCAGCACAGAATAG<\/p>\n<p>RNA primers:<\/p>\n<p>Pdcd1 forward, CGGTTTCAAGGCATGGTCATTGG; reverse, TCAGAGTGTCGTCCTTGCTTCC<\/p>\n<p>Havcr2 (Tim3) forward, GTATCCTGCAGCAGTAGGTC; reverse, CCCTGCAGTTACACTCTACC<\/p>\n<p>Ctla4 forward, GTACCTCTGCAAGGTGGAACTC; reverse, CCAAAGGAGGAAGTCAGAATCCG<\/p>\n<p>Tcf7 forward, CCTGCGGATATAGACAGCACTTC; reverse, TGTCCAGGTACACCAGATCCCA<\/p>\n<p>Gapdh forward, ATGCCTCCTGCACCACCAACT; reverse, ATGGCATGGACTGTGGTCATGAGT<\/p>\n<p>Actb forward, ACCCAGGCATTGCTGACAGG; reverse, GGACAGTGAGGCCAGGATGG.<\/p>\n<p>FC values were calculated using the \u0394\u0394Ct method relative to the appropriate wild-type controls. Data were used to quantify the relative copy number of Y chromosome genes in tumour\u2010derived immune subsets or to confirm the absence of Cas9 in cells not genetically engineered.<\/p>\n<p>Validation of T cell exhaustion impact on LOY via scRNA-seq<\/p>\n<p>The publicly available processed scRNA-seq dataset from Giles et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 39\" title=\"Giles, J. R. et al. Shared and distinct biological circuits in effector, memory and exhausted CD8(+) T cells revealed by temporal single-cell transcriptomics and epigenetics. Nat. Immunol. 23, 1600&#x2013;1613 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR39\" id=\"ref-link-section-d51900554e3822\" rel=\"nofollow noopener\" target=\"_blank\">39<\/a> was analysed to investigate the impact of chronic stimulation on the stability of the Y chromosome in T\u2009cells. This dataset included gp33-specific CD8+ T\u2009cells from TCR-transgenic mice subjected to acute (LCMV Armstrong) and chronic (LCMV Clone 13) LCMV infections. Chronic stimulation of T\u2009cells was validated through the upregulation of canonical exhaustion markers, including Tox, Pdcd1 and Ctla4, and the downregulation of Tcf7. To evaluate LOY, the expression levels of Y-linked genes (Uty, Kdm5d, Ddx3y, Usp9y) were analysed.<\/p>\n<p>Cell-type-specific gene signatures for deconvolution analysis<\/p>\n<p>To generate cell-type-specific gene signatures for LOYSCR\/WTYSCR epithelial cells, CD4+ T\u2009cells, and CD8+ T\u2009cells from scRNA-seq data, we conducted differential analysis using the \u2018sc.tl.rank_genes_groups\u2019 function of the Scanpy (v.1.9.5) package. This analysis used Wilcoxon rank sum (Mann\u2013Whitney U) tests to identify significant differences across each LOYSCR and WTYSCR cell type. We first identified genes significantly up-regulated (log2FC\u2009&gt;\u20091, adjusted P\u2009value\u2009&lt;\u20090.05) in the LOYSCR versus WTYSCR epithelial cells, CD4+ T\u2009cells and CD8+ T\u2009cells separately. To establish unique signatures for each cell type, we then excluded genes expressed in more than 15% of any other LOYSCR or WTYSCR cell type. We then performed deconvolution on normalized bulk expression data from TCGA cancer types using the ssGSEA algorithm, evaluating the relationship of these signatures with patient outcome.<\/p>\n<p>Survival analysis<\/p>\n<p>Time-to-event outcomes were presented by using Kaplan\u2013Meier curves and compared by using log-rank test or univariate Cox proportional hazards model (survival R package; v.3.5.8) as noted in each figure. Two multivariable Cox proportional hazards models were fitted, each as a function of (1) YchrS with ancestry and race as known risk factors and confounders; (2) LOY signatures scRNA-seq signatures, including LOYSCR CD4+ T\u2009cell, LOYSCR CD8+ T\u2009cell, and LOYSCR epithelial cell signatures, with age as known risk factors and confounders. Hazard ratio along with 95% CI based on multivariable Cox proportional hazards models were reported. The function surv_cutpoint from the survminer R package (v.0.4.9) was used to determine the optimal cut-off value for the LOYSCR signatures in relation to the time-to-event outcome. This method uses maximally selected rank statistics from the maxstat<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 91\" title=\"Lausen, B. &amp; Schumacher, M. Maximally selected rank statistics. Biometrics 48, 73&#x2013;85 (1992).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR91\" id=\"ref-link-section-d51900554e3916\" rel=\"nofollow noopener\" target=\"_blank\">91<\/a> R package (v.0.7-25) to classify two groups (low- versus high-risk) based on the optimal cut-point. Moreover, continuous variables included as covariates in the Cox proportional hazards model were evaluated<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 92\" title=\"Grambsch, P. M. &amp; Therneau, T. M. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika 81, 515&#x2013;526 (1994).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#ref-CR92\" id=\"ref-link-section-d51900554e3920\" rel=\"nofollow noopener\" target=\"_blank\">92<\/a>. Linearity was assessed to ensure model adequacy.<\/p>\n<p>Development and validation of prognostic nomogram<\/p>\n<p>According to clinical risk factors and risk scores of multivariate Cox regression coefficients for Extended Data Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#Tab2\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>, a prognostic nomogram was established using the \u2018rms\u2019 R software package (v.6.8-0), and the prediction accuracy of the nomogram was assessed using the calibration curve to evaluate the match between expected and observed events at 2, 5 and 8\u2009years.<\/p>\n<p>Ethics statementHuman samples<\/p>\n<p>All human specimens and associated data were collected following protocols approved by the Institutional Review Board (IRB protocol 43021) at Cedars-Sinai Medical Center, adhering strictly to the Declaration of Helsinki guidelines. Written informed consent was obtained from each participant or their legal guardian. Detailed information regarding patient recruitment, sample collection (including TMA preparation), and data management can be found within IRB protocol 43021.<\/p>\n<p>Animal studies<\/p>\n<p>All animal procedures were reviewed and approved by the Institutional Animal Care and Use Committee (IACUC protocol 8253) at Cedars-Sinai Medical Center. Experiments were conducted in strict accordance with the guidelines specified in the National Institutes of Health (NIH) Guide for the Care and Use of Laboratory Animals. Protocol 8253 comprehensively describes animal housing conditions, care standards and experimental methodologies. All animal experiments were performed in accordance with institutional IACUC protocols. Mouse were housed under standard conditions with a 12-h light\/12-h dark cycle, temperatures maintained between 68\u2009\u00b0F and 79\u2009\u00b0F (20\u201326\u2009\u00b0C), and relative humidity between 30% and 70%.<\/p>\n<p>Statistical analysis<\/p>\n<p>All analyses were conducted using R (v.4.3.1) and Python (v.3.10.9). Before commencing tests, data were assessed for normality using the Kolmogorov\u2013Smirnov test, followed by Bartlett or Levene tests to evaluate homogeneity of variances. For normally distributed variables, the unpaired Student\u2019s t-test (Stats R package; v.4.3.1) was applied, whereas non-normally distributed variables were analysed using Wilcoxon rank sum tests. The correlation between paired variables was assessed using Spearman\u2019s correlation coefficients. Data presentation and multiple comparison corrections are as stated in figure legends. Statistical significance was considered when P\u2009values were less than 0.05, including adjusted P\u2009values. Discovery analyses involving more than 20 comparisons underwent multiple testing correction using the p.adjust function in R or multipletests function in Python, applying the Benjamini\u2013Hochberg method to control the false discovery rate at 0.05. To compare ROC curves, we used the roc.test function from the pROC R package (v.1.18.5). This allowed us to assess differences between AUC of YchrS and the AUC of YwholeS or LOYDNA. Python packages such as Scanpy (v.1.9.5), Pandas (v.2.0.0), Statsmodels (v.0.14.0), NumPy (v.1.24.2), Scipy (v.1.10.1), Matplotlib (v.3.8.0), Seaborn (v.0.11.2) and Sklearn (v.1.3.2), were used for data analysis. The R package ComplexHeatmap (v.2.11.1) was used to generate heat maps, and visualization was facilitated using ggplot2 (v.3.3.5), ggpubr (v.0.6.0), ggrepel (v.0.9.2), Statannot (v.0.6.0), Circlize (v.0.4.16), GseaVis (v.0.0.5), Enrichplot (v.1.22.0), GridExtra (v.2.3.0), Pheatmap(v.1.0.12) and DEGreport (v.1.38.5) R packages. For data manipulation, Readr (v.2.1.5), Readxl (v.1.4.3), Dplyr (v.1.1.4), Plyr (v.8.9), Apeglm (v.1.24.0), Tidyr (v.1.3.1), Tidyverse (v.2.0.0), Tibble (v. 3.2.1), Iranges (v.2.36.0), Biobase (v.2.62.0), BiocGenerics (v.0.48.1), Lubridate (v.1.9.3), Stringr (v.1.5.1) and AnnotationDbi (v.1.64.1) were used for analysis.<\/p>\n<p>Reporting summary<\/p>\n<p>Further information on research design is available in the\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09071-2#MOESM2\" rel=\"nofollow noopener\" target=\"_blank\">Nature Portfolio Reporting Summary<\/a> linked to this article.<\/p>\n","protected":false},"excerpt":{"rendered":"TCGA data acquisition and processing In this study, we used bulk RNA-seq, WES somatic mutation data, and clinical&hellip;\n","protected":false},"author":2,"featured_media":4302,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[25],"tags":[49,48,5123,5124,316,1099,1100,5125,66],"class_list":{"0":"post-4301","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-genetics","8":"tag-ca","9":"tag-canada","10":"tag-cancer-microenvironment","11":"tag-chromosomes","12":"tag-genetics","13":"tag-humanities-and-social-sciences","14":"tag-multidisciplinary","15":"tag-prognostic-markers","16":"tag-science"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/4301","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/comments?post=4301"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/4301\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media\/4302"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media?parent=4301"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/categories?post=4301"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/tags?post=4301"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}