{"id":16517,"date":"2025-07-23T17:47:12","date_gmt":"2025-07-23T17:47:12","guid":{"rendered":"https:\/\/www.newsbeep.com\/au\/16517\/"},"modified":"2025-07-23T17:47:12","modified_gmt":"2025-07-23T17:47:12","slug":"precisely-defining-disease-variant-effects-in-crispr-edited-single-cells","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/au\/16517\/","title":{"rendered":"Precisely defining disease variant effects in CRISPR-edited single cells"},"content":{"rendered":"<p>Cell-line cultures and genomic editing<\/p>\n<p>HH cutaneous T cell lines (ATCC, CRL-2105), Jurkat E6-1 (ATCC, TIB-152) and Daudi cells (ATCC, CCL-213) were cultured in complete RPMI (cRPMI), RPMI 1640 supplemented with 10% heat-inactivated FBS and 1% non-essential amino acids, sodium pyruvate, HEPES, l-glutamine, penicillin-streptomycin and 0.1% \u03b2-mercaptoethanol. HEK293T cells (ATCC, CRL-3216) were cultured in complete DMEM, DMEM supplemented with 10% heat-inactivated FBS and penicillin-streptomycin antibiotics.<\/p>\n<p>To investigate the regulatory regions around HLA-DQB1, HH and Daudi cells were genomically modified as previously described with some modifications<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 21\" title=\"Gutierrez-Arcelus, M. et al. Allele-specific expression changes dynamically during T cell activation in HLA and other autoimmune loci. Nat. Genet. 52, 247&#x2013;253 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR21\" id=\"ref-link-section-d21541635e2310\" rel=\"nofollow noopener\" target=\"_blank\">21<\/a>. In brief, 20\u2009\u00b5M Cas9 protein (Synthego) was mixed with equal volumes of 40\u2009\u00b5M modified sgRNA (Synthego) and incubated at 37\u2009\u00b0C for 15\u2009min to form RNP complexes. The sgRNA was designed using SpliceR<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 40\" title=\"Kluesner, M. G. et al. CRISPR-Cas9 cytidine and adenosine base editing of splice-sites mediates highly-efficient disruption of proteins in primary and immortalized cells. Nat. Commun. 12, 2437 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR40\" id=\"ref-link-section-d21541635e2314\" rel=\"nofollow noopener\" target=\"_blank\">40<\/a>. HH and Daudi cells were nucleofected with 2\u2009\u00b5l of RNPs in an Amaxa 4D nucleofector and 1\u2009\u00b5l of ssODN repair template (SE protocol: CL-120 for HH cells; SF protocol: CA-137 for Daudi cells). Cells were immediately transferred to 24-well plates with pre-warmed media and cultured until sorting.<\/p>\n<p>For Jurkat and Daudi cells targeting FBXO11, 1\u2009\u00b5l of mRNA (2\u2009\u00b5g\u2009\u00b5l\u22121) encoding the base editor ABE8e-NG was mixed with 1\u2009\u00b5l of 40\u2009\u00b5M modified sgRNA (Synthego). Cells were nucleofected with 2\u2009\u00b5l of an mRNA\/sgRNA mixture in an Amaxa 4D nucleofector (SE protocol: CL-120 for Jurkat cells; Daudi as above). Cells were incubated as described above in 24-well plates and cultured until sorting.<\/p>\n<p>For multiplexed PAX5 editing in Daudi cells, three sgRNAs were designed for each of the two DNA binding domains (DBD), designated region A and region B. Editing was performed in region A and B separately, as well as combined. Daudi cells were nucleofected with 1\u2009\u00b5l of 2\u2009\u00b5g\u2009\u00b5l\u22121 ABE8e mRNA and 1\u2009\u00b5l of each sgRNA (40\u2009\u00b5M) using the reagents and protocol described above. Cells were sorted into single-cell plates seven days after nucleofection followed by CRAFTseq.<\/p>\n<p>Healthy individuals, PBMC isolation and CD4+ T-cell magnetic selection<\/p>\n<p>We recruited healthy individuals and processed 40\u201350\u2009ml of peripheral blood under a Mass General Brigham (MGB) IRB-approved protocol (IRB# 2008P000427). PBMCs were isolated by layering Ficoll Paque (Sigma-Aldrich) underneath 1:1 PBS-diluted blood followed by centrifugation. Buffy coat layers were extracted, washed in PBS and then resuspended in cRPMI. Cells were stored by adding an equal volume of freezing medium (10% DMSO and 50% FBS in cRPMI) and frozen in liquid N2 until use. To isolate naive or total CD4+ T cells, frozen PBMCs were quickly thawed and put immediately in warm cRPMI media. Cells were washed twice and naive or total CD4+ T cells were isolated using a magnetic negative selection kit (Miltenyi, Naive CD4+ T cell isolation kit human, CD4+ T cell Isolation kit) following the manufacturer\u2019s protocols.<\/p>\n<p>CRISPR editing of primary CD4 T cells<\/p>\n<p>After naive CD4+ T cell isolation, cells were stimulated with anti-CD3 and anti-CD28 Dynabeads (ThermoFisher) at a ratio of 1:1 (cell:Dynabead) in the presence of 5\u2009ng\u2009ml\u22121 rhIL-2 and 10\u2009ng\u2009ml\u22121 rhIL-7 (Biolegend) in 96-well U-bottom plates. After one day, cells were extracted and nucleofected with CRISPR reagents. In brief, 250,000\u2013500,000 stimulated CD4+ T cells were nucleofected with 1\u2009\u00b5l of mRNA (2\u2009\u00b5g\u2009\u00b5l\u22121) encoding the base editor and 1\u2009\u00b5l of sgRNA (40\u2009\u00b5M, Synthego) in an Amaxa 4D nucleofector (P3 protocol: EH-115). After nucleofection, cells were transferred to 96-well U-bottom plates and cultured in cRPMI for 4\u2009h. For experiments targeting PTPRC, cells were cultured for a further 5 days with 5\u2009ng\u2009ml\u22121 rhIL-2 before proceeding to staining and cell sorting. For polarization experiments, cells were split into two wells and cultured with either 10\u2009ng\u2009ml\u22121 rhIL-12, 5\u2009ng\u2009ml\u22121 rhIL-2 and 2\u2009\u00b5g\u2009ml\u22121 anti-IL4 antibody for TH1 conditions or 10\u2009ng\u2009ml\u22121 rhIL-2 and 5\u2009ng\u2009ml\u22121 rhTGF\u03b2 for Treg conditions.<\/p>\n<p>Cell staining with indexing and oligo-conjugated antibodies<\/p>\n<p>Cells were isolated, washed and resuspended in a cell staining buffer (2% FBS in PBS with 2\u2009mM EDTA). About 500,000 to one million cells were stained with fluorophore-conjugated hashing antibodies for 20\u2009min on ice. After staining, cells were washed and conditions pooled together. Pooled samples were then spun down and resuspended in FcX True Stain (Biolegend) (BioLegend) for 5\u2009min at room temperature, followed by staining with the TotalSeq-A Human Universal Cocktail 1.0 (BioLegend) for 30\u2009min on ice before washing and proceeding to single-cell sorting. A full list of antibodies and barcodes can be found in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>.<\/p>\n<p>Bulk DNA isolation<\/p>\n<p>Cells were pelleted, resuspended in RLT+ buffer (Qiagen), flash frozen on dry ice and stored at \u221280\u2009\u00b0C until processing. For DNA isolation, samples were thawed, vortexed and incubated for 5\u2009min at room temperature before proceeding to DNA isolation using a Qiagen RNA\/DNA extraction kit following the manufacturer\u2019s protocols. After isolation, DNA concentrations were measured by spectrometry (Nanovue) and stored at \u221220\u2009\u00b0C until use. For analysis of bulk DNA editing, nested PCR reactions were used to amplify 200\u2013400\u2009bp genomic regions of interest with custom Illumina adapters added for sequencing. Bulk samples were sequenced alongside single-cell libraries. Sequences of all primers can be found in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>.<\/p>\n<p>Bulk flow cytometry<\/p>\n<p>Stimulated and genomically edited total CD4+ T cells were assayed for expression of key protein markers by flow cytometry on day 7 after nucleofection with a panel of fluorophore-conjugated antibodies. For all samples, cells were isolated and washed twice in PBS, and FC receptors were blocked with FcX True Stain (Biolegend) for 15\u2009min on ice followed by staining with directly conjugated antibodies for 30\u2009min on ice. Cells were then washed and samples were analysed on a BD LSR Fortessa. All data were processed using FlowJo before analysis in R. A full list of antibodies, clones and manufacturers can be found in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a>.<\/p>\n<p>CRAFTseq<\/p>\n<p>Lysis buffer containing 0.2% TritonX (Millipore Sigma), 1.2\u2009U recombinant RNase inhibitor (Takeda), 1.2\u2009mM DTT, 1\u2009M Betaine, 6\u2009mM dNTP and 9\u2009mM dCTP (ThermoFisher) was added into a 384-well PCR plate using a multi-channel pipette. An Agilent Bravo was used to add 2.5\u2009\u00b5M of a well-specific OligoDT barcode into each well, then to distribute 1\u2009\u00b5l of barcoded lysis buffer to multiple 384-well plates. These lysis plates were sealed and stored at \u221220\u2009\u00b0C until use the next day for sorting. Single cells stained with fluorophore- and oligo-conjugated antibodies were sorted into lysis plates, which were kept sealed on dry ice before being stored at \u221280\u2009\u00b0C until use for a maximum of 4 weeks.<\/p>\n<p>For library generation, plates were thawed and incubated at 72\u2009\u00b0C for 3\u2009min, then kept on ice during further processing. RT-PCR mix (4\u2009\u00b5l) containing 1.8\u2009\u00b5M template switch oligo (Azenta), 0.3\u2009mM dNTP, 0.8\u2009M Betaine, 4.8\u2009mM DTT, 9.7\u2009mM MgCl2, 0.8\u2009U RNase inhibitor, Maxima H Minus reverse transcriptase, 1\u00d7 KAPA HIFI buffer and 2\u2009U KAPA HiFi HotStart DNA polymerase (Roche) was then added to each well using a MANTIS liquid dispenser (Formulatrix). After first-strand synthesis (50\u2009\u00b0C for 1\u2009h and 85\u2009\u00b0C for 5\u2009min), a mixture of ADT primers (0.05\u2009\u00b5M) and genomic DNA-specific primers (0.2\u2009\u00b5M) were added using an I.DOT liquid handler (Dispendix) and amplified using the following program: 98\u2009\u00b0C for 5\u2009min then 18\u201322 cycles of 98\u2009\u00b0C for 20\u2009s, 65\u2009\u00b0C for 20\u2009s and 72\u2009\u00b0C for 6\u2009min, followed by 72\u2009\u00b0C for 5\u2009min and then hold at 4\u2009\u00b0C. After amplification, the samples were split for processing of cDNA, genomic DNA and ADT.<\/p>\n<p>For genomic DNA, using the Bravo, 1\u2009\u00b5l of product per well was taken for further amplification of genomic DNA with nested primers containing a capture oligo with well-specific barcodes. The PCR mix was distributed between plates similarly to the process used for lysis plates and contained 0.3\u2009mM dNTP, 1\u00d7 KAPA Hifi buffer, 2\u2009U KAPA HIFI HotStart Taq and 0.4\u2009\u00b5M region-specific capture primer, 0.4\u2009\u00b5M region-specific P7 primer and 0.4\u2009\u00b5M capture oligo with well-specific barcode. The DNA was amplified using the following program: 98\u2009\u00b0C for 5\u2009min, followed by 20 cycles of 98\u2009\u00b0C for 20\u2009s, 65\u2009\u00b0C for 20\u2009s and 72\u2009\u00b0C for 30\u2009s, followed by 72\u2009\u00b0C for 5\u2009min, then hold at 10\u2009\u00b0C. We then pooled 2\u2009\u00b5l of each well containing nested genomic DNA using the Bravo and purified it with 1.2\u00d7 solid-phase reverse immobilization (SPRI) beads. In brief, room-temperature SPRI beads were incubated at the given ratio by volume with PCR product for 10\u2009min, then placed on a magnet for 5\u2009min. Supernatant was removed and the beads were washed twice with 80% freshly made ethanol, dried for up to 15\u2009min in air and resuspended in nuclease-free water. The resuspended beads were incubated at room temperature for 5\u2009min, then placed back on the magnet for 5\u2009min, and the supernatant was used for following steps. The cleaned DNA was treated with thermolabile ExoI and ExoVII (NEB) to remove primers (37\u2009\u00b0C for 30\u2009min, then 87\u2009\u00b0C for 5\u2009min for ExoI denaturation) and re-cleaned with 1.2\u00d7 SPRI beads.<\/p>\n<p>For multiplexed PAX5 editing, to effectively amplify gDNA amplicons from both region A and region B, two inner PCR reactions were carried out for each region separately, rather than amplifying both regions simultaneously in one reaction.<\/p>\n<p>For cDNA, 2\u2009\u00b5l per well of the original amplification product was pooled using the Bravo and cleaned with 0.6\u00d7 SPRI beads; the supernatant was saved for ADT sequencing. The cleaned cDNA product was treated with ExoI to remove primers (37\u2009\u00b0C for 5\u2009min, 85\u2009\u00b0C for 5\u2009min for ExoI denaturation) and cleaned again with 0.8\u00d7 SPRI. cDNA concentrations were measured on a QuBit and 2\u2009ng was used for tagmentation with a NexteraXT kit (Illumina) following the manufacturer\u2019s directions in half-volume reactions (25\u2009\u00b5l total).<\/p>\n<p>For ADT, saved supernatant from the 0.6\u00d7 SPRI clean-up was re-cleaned with 1.4\u00d7 SPRI (2\u00d7 total) and ExoI was treated as above. The ADT product was quantified on a QuBit.<\/p>\n<p>For the final amplification of pooled samples, 2\u2009ng of tagmented cDNA and 5\u2009ng of genomic DNA and ADT samples underwent a final amplification step from the 3\u2032 end with custom Illumina-compatible primers. The PCR mix contained 0.2\u2009\u00b5M P7 and P5 custom oligos, 1\u00d7 Q5 buffer, 0.5\u2009\u00b5l Q5 Taq, 0.2\u2009\u00b5M dNTP and water to a total volume of 25\u2009\u00b5l. Amplification used the following programs: cDNA 72\u2009\u00b0C for 3\u2009min gap repair, 95\u2009\u00b0C for 30\u2009s, followed by 16 cycles of 95\u2009\u00b0C for 10\u2009s, 55\u2009\u00b0C for 30\u2009s and 72\u2009\u00b0C for 30\u2009s, followed by 72\u2009\u00b0C for 5\u2009min, then hold at 10\u2009\u00b0C. Genomic DNA and ADT: 98\u2009\u00b0C for 3\u2009min, followed by 16 cycles of 98\u2009\u00b0C for 15\u2009s, 65\u2009\u00b0C for 20\u2009s and 72\u2009\u00b0C for 45\u2009s, followed by 72\u2009\u00b0C for 10\u2009min. All samples were re-cleaned using SPRI beads (0.8\u00d7 for cDNA, 1.2\u00d7 for gDNA and 1.6\u00d7 for ADT) for sequencing. All sequences can be found in Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>. The full protocol can be found in a more readable format deposited on protocols.io (<a href=\"https:\/\/www.protocols.io\/view\/craftseq-d7bk9ikw\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/www.protocols.io\/view\/craftseq-d7bk9ikw<\/a>), in\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Notes<\/a> and Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">23<\/a>.<\/p>\n<p>Illumina sequencing<\/p>\n<p>Before submission to sequencing at the Genomics Platform at the Broad Institute, sample DNA concentrations were quantified on a QuBit, and distributions were measured using a TapeStation (Agilent) with D5000 high-sensitivity screentapes. All DNA amplicon libraries were sequenced on a MiSeq for 300 cycles (120 R1, 180 R2). ADT amplicon libraries were pooled at 5% with the RNA libraries and sequenced on a NextSeq550 for 75 cycles (26 R1, 50 R2).<\/p>\n<p>Processing and alignment of single-cell data<\/p>\n<p>The I7 and I5 raw FASTQs were merged across different lanes and runs. Transcriptomic reads were then aligned to the human genome (GRCh38 2020-A) using STARsolo<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 41\" title=\"Kaminow, B., Yunusov, D. &amp; Dobin, A. STARsolo: accurate, fast and versatile mapping\/quantification of single-cell and single-nucleus RNA-seq data. Preprint at bioRxiv &#010;                https:\/\/doi.org\/10.1101\/2021.05.05.442755&#010;                &#010;               (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR41\" id=\"ref-link-section-d21541635e2501\" rel=\"nofollow noopener\" target=\"_blank\">41<\/a> (v.2.7.6a). The resulting count matrices were imported into R. For all experiments, cells were filtered on at least 500 genes, 1,000 UMIs and fewer than 10% mitochondrial reads. Counts were log(CP10K\u2009+\u20091) normalized and z-score scaled. For visualization, PCA was done on the top 3,000 variable genes, as selected using variance stabilizing transformation. This was followed by batch correction using Harmony<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 42\" title=\"Korsunsky, I. et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods 16, 1289&#x2013;1296 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR42\" id=\"ref-link-section-d21541635e2508\" rel=\"nofollow noopener\" target=\"_blank\">42<\/a> and dimension reduction using UMAP (uwot package)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 43\" title=\"Melville, J. et al. The uniform manifold approximation and projection (UMAP) method for dimensionality reduction. arXiv &#010;                https:\/\/doi.org\/10.48550\/arXiv.1802.03426&#010;                &#010;               (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR43\" id=\"ref-link-section-d21541635e2512\" rel=\"nofollow noopener\" target=\"_blank\">43<\/a>. For index flow-cytometry analysis, raw fluorescence values were exported from index sorting as csv files and imported into R. Values were log10-normalized and analysed in R.<\/p>\n<p>Kallisto kite (v.0.27.3) was used to build an ADT reference transcriptome corresponding to the ADT panel (Biolegend TotalSeq A) used and to align ADT reads<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 44\" title=\"Booeshaghi, A. S., Min, K. H. J., Gehring, J. &amp; Pachter, L. Quantifying orthogonal barcodes for sequence census assays. Bioinform. Adv. 4, vbad181 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR44\" id=\"ref-link-section-d21541635e2521\" rel=\"nofollow noopener\" target=\"_blank\">44<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 45\" title=\"Bray, N. L., Pimentel, H., Melsted, P. &amp; Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525&#x2013;527 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR45\" id=\"ref-link-section-d21541635e2524\" rel=\"nofollow noopener\" target=\"_blank\">45<\/a>. UMI counts were imported into R, CLR normalized and z-score scaled. For visualization, PCA was done on CLR-normalized and scaled variable ADTs followed by plate correction using Harmony and UMAP dimension reduction on harmonized PCs.<\/p>\n<p>DNA reads were demultiplexed based on cellular barcodes. Then, amplicons were aligned to the amplicon reference using CRISPResso2 (v.2.2.9)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 46\" title=\"Clement, K. et al. CRISPResso2 provides accurate and rapid genome editing sequence analysis. Nat. Biotechnol. 37, 224&#x2013;226 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR46\" id=\"ref-link-section-d21541635e2534\" rel=\"nofollow noopener\" target=\"_blank\">46<\/a>. Allele usage statistics were calculated from CRISPResso2 outputs. For all DNA analyses, cells were filtered on a minimum number of reads (20\u201340) and minimum allele frequencies (12\u201325%) on a per-experiment basis. Assuming dizygosity, only the top two alleles after filtering were included in the analysis. If after filtering only one allele was recovered, the cell was assumed to be homozygous. Clustering of DNA editing in the DQB1 experiment (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>) was performed with supervised k-means clustering with an n of 3. For more information on processing amplicon sequencing data and genotype calling, see\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Notes<\/a> and Supplementary Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">23<\/a>.<\/p>\n<p>Linear modelling of gene and protein expression<\/p>\n<p>Associations of gene and ADT expression to dosage and conditions were made using negative binomial generalized linear modelling. We modelled counts of genes or ADTs with covariates comparing null to full models. For analyses in cell lines:<\/p>\n<p>$$\\begin{array}{c}X\\sim {\\rm{N}}{\\rm{e}}{\\rm{g}}{\\rm{a}}{\\rm{t}}{\\rm{i}}{\\rm{v}}{\\rm{e}}\\,{\\rm{b}}{\\rm{i}}{\\rm{n}}{\\rm{o}}{\\rm{m}}{\\rm{i}}{\\rm{a}}{\\rm{l}}({\\rm{G}}{\\rm{e}}{\\rm{n}}{\\rm{e}}\/{{\\rm{A}}{\\rm{D}}{\\rm{T}}}_{i})\\\\ {\\rm{N}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}:{\\rm{l}}{\\rm{n}}(X)=1+{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}\\\\ {\\rm{F}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}\\,{\\rm{g}}{\\rm{e}}{\\rm{n}}{\\rm{o}}{\\rm{t}}{\\rm{y}}{\\rm{p}}{\\rm{e}}:{\\rm{l}}{\\rm{n}}(X)=1+{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}+{\\beta }_{{\\rm{g}}{\\rm{e}}{\\rm{n}}{\\rm{o}}{\\rm{t}}{\\rm{y}}{\\rm{p}}{\\rm{e}}}{{\\rm{G}}{\\rm{e}}{\\rm{n}}{\\rm{o}}{\\rm{y}}{\\rm{t}}{\\rm{p}}{\\rm{e}}}_{i}\\\\ {\\rm{F}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}\\,{\\rm{c}}{\\rm{o}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{i}}{\\rm{o}}{\\rm{n}}:{\\rm{l}}{\\rm{n}}(X)=1+{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}+{\\beta }_{{\\rm{c}}{\\rm{o}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{i}}{\\rm{o}}{\\rm{n}}}{{\\rm{C}}{\\rm{o}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{i}}{\\rm{o}}{\\rm{n}}}_{i}\\end{array}$$<\/p>\n<p>where gene\/ADT counts is raw counts in each cell i, nCounts is total gene or ADT count in cell i, plate is plate identity, genotype is dosage at the edited nucleotide (0,1,2) and condition is culture condition (CRISPR or control). For the DQB1 editing experiment, we modelled gene and ADT counts:<\/p>\n<p>$$\\begin{array}{c}X\\sim {\\rm{N}}{\\rm{e}}{\\rm{g}}{\\rm{a}}{\\rm{t}}{\\rm{i}}{\\rm{v}}{\\rm{e}}\\,{\\rm{b}}{\\rm{i}}{\\rm{n}}{\\rm{o}}{\\rm{m}}{\\rm{i}}{\\rm{a}}{\\rm{l}}({{\\rm{G}}{\\rm{e}}{\\rm{n}}{\\rm{e}}\/{\\rm{A}}{\\rm{D}}{\\rm{T}}}_{i})\\\\ {\\rm{N}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}:{\\rm{l}}{\\rm{n}}(X)=1+{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}\\\\ {\\rm{F}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}:{\\rm{l}}{\\rm{n}}(X)=1+{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}+{\\log }_{2}({\\beta }_{{\\rm{D}}{\\rm{e}}{\\rm{l}}{\\rm{e}}{\\rm{t}}{\\rm{i}}{\\rm{o}}{\\rm{n}}{\\rm{s}}{\\rm{i}}{\\rm{z}}{\\rm{e}}}{{\\rm{D}}{\\rm{e}}{\\rm{l}}{\\rm{e}}{\\rm{t}}{\\rm{i}}{\\rm{o}}{\\rm{n}}{\\rm{s}}{\\rm{i}}{\\rm{z}}{\\rm{e}}}_{i}+1)\\end{array}$$<\/p>\n<p>where deletion size was calculated per cell i. In all cell-line analysis, genes were filtered for non-zero expression in greater than 30% of cells and a mean of 2 reads per cell. For the PTPRC editing experiments in primary human cells, we modelled:<\/p>\n<p>$$\\begin{array}{c}X\\sim {\\rm{N}}{\\rm{e}}{\\rm{g}}{\\rm{a}}{\\rm{t}}{\\rm{i}}{\\rm{v}}{\\rm{e}}\\,{\\rm{b}}{\\rm{i}}{\\rm{n}}{\\rm{o}}{\\rm{m}}{\\rm{i}}{\\rm{a}}{\\rm{l}}({{\\rm{G}}{\\rm{e}}{\\rm{n}}{\\rm{e}}\/{\\rm{A}}{\\rm{D}}{\\rm{T}}}_{i})\\\\ {\\rm{N}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}:{\\rm{l}}{\\rm{n}}(X)=1+{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}+{\\beta }_{{\\rm{i}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{v}}{\\rm{i}}{\\rm{d}}{\\rm{u}}{\\rm{a}}{\\rm{l}}}{{\\rm{I}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{v}}{\\rm{i}}{\\rm{d}}{\\rm{u}}{\\rm{a}}{\\rm{l}}}_{i}\\\\ {\\rm{F}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}\\,{\\rm{g}}{\\rm{e}}{\\rm{n}}{\\rm{o}}{\\rm{t}}{\\rm{y}}{\\rm{p}}{\\rm{e}}:{\\rm{l}}{\\rm{n}}(X)=1+{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}+{\\beta }_{{\\rm{i}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{v}}{\\rm{i}}{\\rm{d}}{\\rm{u}}{\\rm{a}}{\\rm{l}}}{{\\rm{I}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{v}}{\\rm{i}}{\\rm{d}}{\\rm{u}}{\\rm{a}}{\\rm{l}}}_{i}+{\\beta }_{{\\rm{g}}{\\rm{e}}{\\rm{n}}{\\rm{o}}{\\rm{t}}{\\rm{y}}{\\rm{p}}{\\rm{e}}}{{\\rm{G}}{\\rm{e}}{\\rm{n}}{\\rm{o}}{\\rm{t}}{\\rm{y}}{\\rm{p}}{\\rm{e}}}_{i}\\\\ {\\rm{F}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}\\,{\\rm{c}}{\\rm{o}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{i}}{\\rm{o}}{\\rm{n}}:{\\rm{l}}{\\rm{n}}(X)=1+{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}+{\\beta }_{{\\rm{i}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{v}}{\\rm{i}}{\\rm{d}}{\\rm{u}}{\\rm{a}}{\\rm{l}}}{{\\rm{I}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{v}}{\\rm{i}}{\\rm{d}}{\\rm{u}}{\\rm{a}}{\\rm{l}}}_{i}+{\\beta }_{{\\rm{c}}{\\rm{o}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{i}}{\\rm{o}}{\\rm{n}}}{{\\rm{C}}{\\rm{o}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{i}}{\\rm{o}}{\\rm{n}}}_{i}\\end{array}$$<\/p>\n<p>where individual was the individual identity in cell i. Genes were filtered for non-zero expression in greater than 15% of cells. For the IL2RA editing experiments, we modelled:<\/p>\n<p>$$\\begin{array}{c}X\\sim {\\rm{N}}{\\rm{e}}{\\rm{g}}{\\rm{a}}{\\rm{t}}{\\rm{i}}{\\rm{v}}{\\rm{e}}\\,{\\rm{b}}{\\rm{i}}{\\rm{n}}{\\rm{o}}{\\rm{m}}{\\rm{i}}{\\rm{a}}{\\rm{l}}({{\\rm{G}}{\\rm{e}}{\\rm{n}}{\\rm{e}}\/{\\rm{A}}{\\rm{D}}{\\rm{T}}}_{i})\\\\ {\\rm{N}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}:{\\rm{l}}{\\rm{n}}(X)=1+{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+\\,({1|{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i})+({1|{\\rm{I}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{v}}{\\rm{i}}{\\rm{d}}{\\rm{u}}{\\rm{a}}{\\rm{l}}}_{i})\\\\ {\\rm{F}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}:{\\rm{l}}{\\rm{n}}(X)=1+{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+({1|{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i})\\\\ \\,\\,\\,+(1|{{\\rm{I}}{\\rm{n}}{\\rm{d}}{\\rm{i}}{\\rm{v}}{\\rm{i}}{\\rm{d}}{\\rm{u}}{\\rm{a}}{\\rm{l}}}_{i})+{\\beta }_{{\\rm{g}}{\\rm{e}}{\\rm{n}}{\\rm{o}}{\\rm{t}}{\\rm{y}}{\\rm{p}}{\\rm{e}}}{{\\rm{G}}{\\rm{e}}{\\rm{n}}{\\rm{o}}{\\rm{t}}{\\rm{y}}{\\rm{p}}{\\rm{e}}}_{i}\\end{array}$$<\/p>\n<p>Random effects were used for individual and plate in these models with multiple runs. The R package lme4 was used to model fixed effects using the glm.nb function, and the R package MASS was used to model random effects using the glmer.nb function.<\/p>\n<p>To determine the significance of these models, we used an LRT comparing the null and full models and calculated a P-value for the test statistic against the chi-squared distribution with one degree of freedom. When indicated, we corrected for multiple hypothesis testing using the Benjamini\u2013Hochberg correction (FDR).<\/p>\n<p>To model the effect of PAX5 multiplexed editing on gene expression, we used negative binomial regression to model all eight edited variants jointly, which yielded a P-value for each variant and gene. We used the Bonferroni correction to account for multiple testing considering the number of variants and the number of tested genes, using a P-value cut-off equal to 0.05\/(8\u2009\u00d7\u2009the number of genes), which equals 0.05\/(8\u2009\u00d7\u20093,125)\u2009=\u20092\u2009\u00d7\u200910\u22126. To further validate the effect of individual variants, we used a likelihood-ratio test in which the full model contains all eight edited variants and covariates and the null model is the full model minus the specific variant of interest. We report LRT P-values in the relevant figures.<\/p>\n<p>To evaluate the calibration of effect sizes and P-values learnt from the linear model and LRT, we bootstrapped the CD45 experiment using 200 replications for each gene. We used the estimated \u03b2 values from this approach to define the 95% confidence intervals and nominal significance (confidence interval not containing 0) for each gene.<\/p>\n<p>                        PAX5 interaction analysis<\/p>\n<p>To test for interactions between our two edited regions in PAX5, regions A and B, we first summarized the effect of B.29 on genome-wide gene expression as a B.29 gene score: \\({S}_{i}={\\sum }_{g}{\\beta }_{g}{E}_{i,g}\\) where Si is the gene score for cell i, \u03b2g is the effect size of B.29 on gene g in negative binomial regression and Ei,g is the expression of gene g in cell i. We then tested the interaction effect between B.29 and A edit outcome (defined as any non-synonymous edit in region A) on the B.29 gene score using a likelihood ratio test to compare the two models below:<\/p>\n<p>$$\\begin{array}{l}{\\rm{N}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}:{S}_{i}\\,=\\,{\\beta }_{0}{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}\\\\ \\,\\,\\,\\,\\,\\,+\\,{\\beta }_{A}{{\\rm{A}}{\\rm{e}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{e}}{\\rm{d}}}_{i}+{\\beta }_{B}B{.29}_{i}\\\\ {\\rm{F}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}:\\,{S}_{i}\\,=\\,{\\beta }_{0}{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})\\\\ \\,+\\,{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}+{\\beta }_{A}{{\\rm{A}}{\\rm{e}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{e}}{\\rm{d}}}_{i}+{\\beta }_{B}B{.29}_{i}+{\\beta }_{AB}{{\\rm{A}}{\\rm{e}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{e}}{\\rm{d}}}_{i}B{.29}_{i}\\end{array}$$<\/p>\n<p>To define individual genes with interaction effects, we tested the interaction between B.29 genotype and A editing outcome:<\/p>\n<p>$$\\begin{array}{l}{\\rm{N}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}:{\\rm{l}}{\\rm{n}}(X)\\\\ \\,=\\,{\\beta }_{0}{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}+{\\beta }_{A}{{\\rm{A}}{\\rm{e}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{e}}{\\rm{d}}}_{i}+{\\beta }_{B}B{.29}_{i}\\\\ {\\rm{F}}{\\rm{u}}{\\rm{l}}{\\rm{l}}\\,{\\rm{m}}{\\rm{o}}{\\rm{d}}{\\rm{e}}{\\rm{l}}:{\\rm{l}}{\\rm{n}}(X)\\\\ \\,=\\,{\\beta }_{0}{\\log }_{10}({{\\rm{n}}{\\rm{C}}{\\rm{o}}{\\rm{u}}{\\rm{n}}{\\rm{t}}{\\rm{s}}}_{i})+{\\beta }_{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}{{\\rm{p}}{\\rm{l}}{\\rm{a}}{\\rm{t}}{\\rm{e}}}_{i}+{\\beta }_{A}{{\\rm{A}}{\\rm{e}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{e}}{\\rm{d}}}_{i}+{\\beta }_{B}B{.29}_{i}\\\\ \\,\\,+\\,{\\beta }_{AB}{{\\rm{A}}{\\rm{e}}{\\rm{d}}{\\rm{i}}{\\rm{t}}{\\rm{e}}{\\rm{d}}}_{i}B{.29}_{i}\\end{array}$$<\/p>\n<p>In this analysis, we only tested the genes that are significantly associated with B.29 genotype in our original negative binomial regression models.<\/p>\n<p>Mixscape analysis<\/p>\n<p>We applied Mixscape<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\" title=\"Papalexi, E. et al. Characterizing the molecular regulation of inhibitory immune checkpoints with multimodal single-cell screens. Nat. Genet. 53, 322&#x2013;331 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR12\" id=\"ref-link-section-d21541635e7794\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a> separately for each cell type to the FBXO11 and CD45 experiments. In each context, we ran Mixscape with three types of input to the model: experimental condition (CRISPR\/control, genotyping unaware); dosage of 0 versus 1 or 2 (genotyping aware); and dosage of 0 or 1 versus 2 (genotyping aware). For processing the data for Mixscape, we followed the default RNA processing, including TP10K library size normalization, log transformation, variance stabilizing transformation selection of variable genes, variance scaling and centring of data, and PCA. We provided 40 PCs to Mixscape for calculation of the perturbation signature. To run Mixscape, we set the number of nearest neighbours for the analysis to 10, defined a log fold change threshold of 0.05 for inclusion of differentially expressed genes, and required differential expression of 2 or more genes to predict perturbation status.<\/p>\n<p>To identify differentially expressed genes between predicted knockout and non-perturbed predictions by Mixscape, we performed analogous linear modelling of gene expression, as in our model associating genotype dosage with expression. However, in the Mixscape-based approach, expression association with Mixscape-predicted status was performed as below.<\/p>\n<p>For the PTPRC experiment:<\/p>\n<p>$$\\begin{array}{c}X \\sim {\\rm{Negative}}\\;{\\rm{binomial}}({{\\rm{Gene}}}_{i})\\\\ {\\rm{Null}}\\;{\\rm{model}}:{\\rm{ln}}(X)=1+{\\log }_{10}({{\\rm{nCounts}}}_{i})+{\\beta }_{{\\rm{plate}}}{{\\rm{plate}}}_{i}+{\\beta }_{{\\rm{individual}}}{{\\rm{Individual}}}_{i}\\\\ {\\rm{Full}}\\;{\\rm{model}}\\;{\\rm{Mixscape}}:{\\rm{ln}}(X)=1+{\\log }_{10}({{\\rm{nCounts}}}_{i})+{\\beta }_{{\\rm{plate}}}{{\\rm{plate}}}_{i}+{\\beta }_{{\\rm{individual}}}{{\\rm{Individual}}}_{i}+{\\beta }_{{\\rm{Mixscape}}}{{\\rm{Mixscape}}}_{i}\\end{array}$$<\/p>\n<p>For the FBXO11 experiment:<\/p>\n<p>$$\\begin{array}{c}X \\sim {\\rm{Negative}}\\;{\\rm{binomial}}({{\\rm{Gene}}}_{i})\\\\ {\\rm{Null}}\\;{\\rm{model}}:{\\rm{ln}}(X)=1+{\\log }_{10}({{\\rm{nCounts}}}_{i})+{\\beta }_{{\\rm{plate}}}{{\\rm{plate}}}_{i}\\\\ {\\rm{Full}}\\;{\\rm{model}}\\;{\\rm{Mixscape}}:{\\rm{ln}}(X)=1+{\\log }_{10}({{\\rm{nCounts}}}_{i})+{\\beta }_{{\\rm{plate}}}{{\\rm{plate}}}_{i}+{\\beta }_{{\\rm{mixscape}}}{{\\rm{Mixscape}}}_{i}\\end{array}.$$<\/p>\n<p>Demultiplexing donor<\/p>\n<p>Samples were genotyped on an Illumina Multi-Ethnic Genotyping Array (MEGA) or Global Screening Array (GSA). Genotyping quality control, haplotype phasing and whole-genome imputation were done for MEGA as previously described<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 47\" title=\"Sakaue, S. et al. Tissue-specific enhancer&#x2013;gene maps from multimodal single-cell data identify causal disease alleles. Nat. Genet. 56, 615&#x2013;626 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR47\" id=\"ref-link-section-d21541635e8505\" rel=\"nofollow noopener\" target=\"_blank\">47<\/a>. Analogous steps were performed for GSA, with further genotyping quality control including PHWE\u2009&gt;\u20091\u2009\u00d7\u200910\u221210, sample relatedness (IBD)\u2009&lt;\u20090.9 and sample heterozygosity rate\u2009&gt;\u20090.217. All imputed variants were filtered on minor allele frequency\u2009&gt;\u20090.01 and imputation accuracy (Rsq)\u2009&gt;\u20090.9 in both datasets. Variants were further pruned to the intersection of filtered variants between the two genotyping arrays. Non-biallelic variants and variants in sex chromosomes or non-exonic regions were also removed. Demuxlet (v.1.0) was run with default parameters to predict donors from genotypes. Cell donor identity was assigned using the output column SNG.1ST, corresponding to the most likely singlet identity. Cells with ambiguous donor assignments were excluded.<\/p>\n<p>Gene set enrichment analysis<\/p>\n<p>We performed gene set enrichment using fgsea on the results from the linear models predicting differentially expressed genes associated with dosage in the CD45 experiment and polarization status in the IL2RA experiment. We included all genes in the enrichment analysis. Gene ranks were defined according to the sign of the effect size of the associated variable (for directionality) multiplied by \u2212log10P. For the CD45 analysis, we included the MSigDB immunological signatures (C7) pathways<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 48\" title=\"Liberzon, A. et al. Molecular signatures database (MSigDB) 3.0. Bioinformatics 27, 1739&#x2013;1740 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR48\" id=\"ref-link-section-d21541635e8527\" rel=\"nofollow noopener\" target=\"_blank\">48<\/a> that were specific to CD4 T cells. We then filtered to the pathways that included between 15 and 500 genes. For the IL2RA analysis, we defined a Treg and TH1 pathway by determining differentially expressed genes in bulk RNA-seq from a previously published source of in vitro polarized T cells<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 37\" title=\"Cano-Gamez, E. et al. Single-cell transcriptomics identifies an effectorness gradient shaping the response of CD4+ T cells to cytokines. Nat. Commun. 11, 1801 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR37\" id=\"ref-link-section-d21541635e8539\" rel=\"nofollow noopener\" target=\"_blank\">37<\/a>. To perform differential expression analysis between Treg and TH1 samples, we ran DESeq2 with default parameters on the raw counts matrix. We added genes to the polarization pathways if they were nominally significant (P\u2009&lt;\u20090.01) and positively associated (fold change\u2009&gt;\u20091) with the given polarization condition.<\/p>\n<p>Base editor mRNAs<\/p>\n<p>Base editor mRNAs were generated by in vitro transcription using the HiScribe T7 High-Yield RNA synthesis kit (NEB, E2040S) by the method described previously<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"Chen, P. J. et al. Enhanced prime editing systems by manipulating cellular determinants of editing outcomes. Cell 184, 5635&#x2013;5652 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#ref-CR49\" id=\"ref-link-section-d21541635e8558\" rel=\"nofollow noopener\" target=\"_blank\">49<\/a>. NEBnext polymerase was used to PCR amplify template plasmids and install a functional T7 promoter and a 120-nucleotide polyadenine tail. Transcription reactions were set up with complete substitution of uracil by N1-methylpseudouridine (Trilink BioTechnologies, N-1080) and co-transcriptional 5\u2032 capping using the CleanCap AG analogue (Trilink BioTechnologies, N-7113) to generate a 5\u2032 Cap1 structure. mRNAs were purified using ethanol precipitation according to the kit&#8217;s instructions, dissolved in nuclease-free water and normalized to a concentration of 2\u2009\u00b5g\u2009\u00b5l\u22121 using Nanodrop RNA quantification of diluted test samples.<\/p>\n<p>Reporting summary<\/p>\n<p>Further information on research design is available in the\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09313-3#MOESM2\" rel=\"nofollow noopener\" target=\"_blank\">Nature Portfolio Reporting Summary<\/a> linked to this article.<\/p>\n","protected":false},"excerpt":{"rendered":"Cell-line cultures and genomic editing HH cutaneous T cell lines (ATCC, CRL-2105), Jurkat E6-1 (ATCC, TIB-152) and Daudi&hellip;\n","protected":false},"author":2,"featured_media":16518,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[25],"tags":[64,63,17818,17819,336,17820,1320,1321,1733,128,17821],"class_list":{"0":"post-16517","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-genetics","8":"tag-au","9":"tag-australia","10":"tag-functional-genomics","11":"tag-gene-regulation-in-immune-cells","12":"tag-genetics","13":"tag-high-throughput-screening","14":"tag-humanities-and-social-sciences","15":"tag-multidisciplinary","16":"tag-rna-sequencing","17":"tag-science","18":"tag-transcriptomics"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/16517","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/comments?post=16517"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/16517\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media\/16518"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media?parent=16517"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/categories?post=16517"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/tags?post=16517"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}