{"id":93656,"date":"2025-10-21T00:20:12","date_gmt":"2025-10-21T00:20:12","guid":{"rendered":"https:\/\/www.newsbeep.com\/ie\/93656\/"},"modified":"2025-10-21T00:20:12","modified_gmt":"2025-10-21T00:20:12","slug":"modeling-heterogeneity-in-single-cell-perturbation-states-enhances-detection-of-response-eqtls","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/ie\/93656\/","title":{"rendered":"Modeling heterogeneity in single-cell perturbation states enhances detection of response eQTLs"},"content":{"rendered":"<p>Cohorts<\/p>\n<p>We used published data (GEO accession no. <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/geo\/query\/acc.cgi?acc=GSE162632\" rel=\"nofollow noopener\" target=\"_blank\">GSE162632<\/a>) to study the effect of 12\u2009h of ex vivo stimulation with either IAV or mock (control) in 90 samples of heterogeneous cell composition (PBMCs) using scRNA-seq. Similarly, we obtained data (EGA accession no. <a href=\"https:\/\/ega-archive.org\/studies\/EGAS00001005376\" rel=\"nofollow noopener\" target=\"_blank\">EGAS00001005376<\/a>) of 120 PBMC samples infected for 24\u2009h with CA, PA, MTB or mock (control) from the European Genome-Phenome Archive under a Lifelines DEEP DAG2+ Project (LLDEEP DAG2+) data access agreement.<\/p>\n<p>Genotype quality control<\/p>\n<p>The original Randolph et al. study<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Randolph, H. E. et al. Genetic ancestry effects on the response to viral infection are pervasive but cell type specific. Science 374, 1127&#x2013;1133 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR24\" id=\"ref-link-section-d113710646e2207\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a> (IAV perturbation) used 4\u00d7 low-pass whole-genome sequencing to call genotypes of 89 people\u2014a total of 78,111,311 genome-wide variants. Genotypes with a posterior probability &lt;\u20090.9 were considered as missing data. We removed variants with a missing call rate &gt;10% and only included variants with a minor allele frequency \u22655% and Hardy-Weinberg equilibrium P\u2009&gt;\u20091\u2009\u00d7\u200910\u22125. In total, for the IAV perturbation dataset we had 5,194,816 genetic variants genome-wide that passed quality control (QC), and these variants were used in the eQTL mapping. Furthermore, we selected 508,883 independent variants using PLINK<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 52\" title=\"Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559&#x2013;575 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR52\" id=\"ref-link-section-d113710646e2216\" rel=\"nofollow noopener\" target=\"_blank\">52<\/a> (\u2013indep-pairwise 50 5 0.2), and then we performed PC analysis (PCA) using the R package SNPRelate<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 53\" title=\"Zheng, X. et al. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28, 3326&#x2013;3328 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR53\" id=\"ref-link-section-d113710646e2220\" rel=\"nofollow noopener\" target=\"_blank\">53<\/a>.<\/p>\n<p>The original Oelen et al. study<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Oelen, R. et al. Single-cell RNA-sequencing of peripheral blood mononuclear cells reveals widespread, context-specific gene expression regulation upon pathogenic exposure. Nat. Commun. 13, 3267 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR22\" id=\"ref-link-section-d113710646e2227\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a> (CA, PA and MTB perturbations) used genotype data from the HumanCytoSNP-12 BeadChip and imputed with the Haplotype Reference Consortium reference panel of 120 people at a total of 7,249,882 genome-wide variants mapped on the Hg19 reference genome. For the main analysis we included variants with missing call rate &lt;10%, minor allele frequency\u2009\u22655% and Hardy-Weinberg equilibrium P\u2009&gt;\u20091\u2009\u00d7\u200910\u22125. The variants that passed the filtering step were then lifted over to the hg38 reference genome using Picard tools. In total, for this dataset, we had 3,962,615 variants after QC, which were used in the eQTL mapping. We selected 292,681 independent variants using PLINK<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 52\" title=\"Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559&#x2013;575 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR52\" id=\"ref-link-section-d113710646e2236\" rel=\"nofollow noopener\" target=\"_blank\">52<\/a> (\u2013indep-pairwise 50 5 0.2), and then we performed PCA using SNPRelate<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 53\" title=\"Zheng, X. et al. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28, 3326&#x2013;3328 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR53\" id=\"ref-link-section-d113710646e2240\" rel=\"nofollow noopener\" target=\"_blank\">53<\/a>. For the secondary analysis, we selected the 3,554 variants included in the Supplementary Table 9 of Oelen et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Oelen, R. et al. Single-cell RNA-sequencing of peripheral blood mononuclear cells reveals widespread, context-specific gene expression regulation upon pathogenic exposure. Nat. Commun. 13, 3267 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR22\" id=\"ref-link-section-d113710646e2245\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a> and lifted over 2,347 variants from the hg19 to hg38 reference genome.<\/p>\n<p>Single-cell mRNA quality control<\/p>\n<p>We performed quality control of the 236,993 PBMCs from the 12-h perturbation experiment with IAV. These cells had &lt;20% reads that mapped to mitochondrial genes. We removed ten influenza IAV genes from the original 19,248 genes in the count matrix. We selected a total of 208,223 cells with &gt;500 expressed genes for the analysis. For the 24-h perturbations with CA, PA and MTB, there were, in total, 493,061 cells and 33,538 genes measured. As in the IAV perturbation study, we selected 475,333 cells with &lt;20% reads mapping to mitochondrial genes and &gt;500 detected genes.<\/p>\n<p>Then, for each perturbation condition (NP or P) in each study, we normalized the counts for each cell, scaled each gene\u2019s expression, and cosine normalized. We selected the 3,000 most variable genes based on the variance stabilizing transformation and conducted PCA using an implicitly restarted Lanczos method implemented in the irlba R package. Then, we controlled the effects of batch and sample donor using Harmony<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 54\" title=\"Korsunsky, I. et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods 16, 1289&#x2013;1296 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR54\" id=\"ref-link-section-d113710646e2260\" rel=\"nofollow noopener\" target=\"_blank\">54<\/a> (v.0.1.0) (parameters: \u03b8donor\u2009=\u20091, \u03b8batch\u2009=\u20091). The top 20 hPCs were used for downstream analysis. For cell-type annotations, we used the originally published cluster labels.<\/p>\n<p>Estimating the continuous perturbation state of a cell<\/p>\n<p>In contrast to traditional models that consider only the discrete perturbation state imposed by the experimental design, we define a continuous score to quantify the degree of perturbation response of each cell. The underlying assumption is that perturbation may induce changes in the transcriptional profile of a cell that are proportional to the degree of response of the cell. Similarly, experimental conditions may induce changes in the transcriptional profile of NP cells<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"van den Brink, S. C. et al. Single-cell sequencing reveals dissociation-induced gene expression in tissue subpopulations. Nat. Methods 14, 935&#x2013;936 (2017).\" href=\"#ref-CR55\" id=\"ref-link-section-d113710646e2280\">55<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Mattei, D. et al. Enzymatic dissociation induces transcriptional and proteotype bias in brain cell populations. Int. J. Mol. Sci. 21, 7944 (2020).\" href=\"#ref-CR56\" id=\"ref-link-section-d113710646e2280_1\">56<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Machado, L., Relaix, F. &amp; Mourikis, P. Stress relief: emerging methods to mitigate dissociation-induced artefacts. Trends Cell Biol. 31, 888&#x2013;897 (2021).\" href=\"#ref-CR57\" id=\"ref-link-section-d113710646e2280_2\">57<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Barnes, M. G., Grom, A. A., Griffin, T. A., Colbert, R. A. &amp; Thompson, S. D. Gene expression profiles from peripheral blood mononuclear cells are sensitive to short processing delays. Biopreserv. Biobank 8, 153&#x2013;162 (2010).\" href=\"#ref-CR58\" id=\"ref-link-section-d113710646e2280_3\">58<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 59\" title=\"Andriamboavonjy, L. et al. Comparative analysis of methods to reduce activation signature gene expression in PBMCs. Sci. Rep. 13, 23086 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR59\" id=\"ref-link-section-d113710646e2283\" rel=\"nofollow noopener\" target=\"_blank\">59<\/a>. To systematically identify axes of variation in the transcriptional profile of P and NP cells, we use PCA and correct technical or biological effects with Harmony to obtain hPCs. We assume that one or several hPCs may capture the effect of perturbation in a cell.<\/p>\n<p>To define a continuous perturbation score, we used a penalized logistic regression with hPCs as independent variables to predict the log odds of being in the pool of P cells, which serves as a surrogate of the cell\u2019s degree of response to the experimental perturbation. Thus, for each perturbation model, we developed a continuous perturbation score of a cell by modeling the discrete perturbation state of a cell (i) (NP\u2009=\u20090, P\u2009=\u20091) as a function of the top 20 hPCs in a logistic Lasso regression model implemented in the R package glmnet<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 60\" title=\"Friedman, J., Hastie, T. &amp; Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1&#x2013;22 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR60\" id=\"ref-link-section-d113710646e2293\" rel=\"nofollow noopener\" target=\"_blank\">60<\/a>. We fitted the model and used the LASSO penalty term with a tuning parameter (\u03bb) obtained from cross validation (cv.glmnet parameters: alpha\u2009=\u20091, lambda.1se)<\/p>\n<p>$$\\log \\left(\\frac{{\\pi }_{i}}{1-{\\pi }_{i}}\\right)={\\theta }_{i}+{\\beta }_{\\rm{PC}_{1}}{X}_{i,\\rm{PC}_{1}}+\\cdots +{\\beta }_{\\rm{PC}_{\\textit{K}}}{X}_{i,\\rm{PC}_{\\textit{K}}}$$<\/p>\n<p>\n                    (1)\n                <\/p>\n<p>where \\(K\\) is the number of PCs and \u03c0i is the probability of the cell (i) to be in the pool of P cells.<\/p>\n<p>We use this approach instead of focusing on predefined gene sets such as the interferon-stimulated gene pathway for viral perturbations, which may not accurately represent the changes on the transcriptome in specific tissues, cells or perturbation experiments.<\/p>\n<p>Cell-type-specific perturbation score<\/p>\n<p>For each perturbation experiment, cell-type-specific perturbation scores for each main cell-type presented in PBMCs were generated by subsetting the dataset by cell-type and then performing the same steps as described before: gene normalization, scale, cosine normalization, selection of the 3,000 most variable genes (variance stabilizing transformation), PCA, harmony and Lasso regression using the top 20 corrected expression hPCs as predictors in the logistic Lasso regression model.<\/p>\n<p>Correspondence of the perturbation score with canonical protein marker of response to perturbation<\/p>\n<p>We evaluate the ability of the perturbation score generated using the penalized logistic regression to capture the response to perturbation. We re-analyzed a dataset of PBMCs perturbed with either anti CD3 and anti CD28 and LPS, which profiled cells using scRNA-seq as well as surface protein markers (CITE-seq)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 61\" title=\"Lawlor, N. et al. Single cell analysis of blood mononuclear cells stimulated through either LPS or anti-CD3 and anti-CD28. Front. Immunol. 12, 636720 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR61\" id=\"ref-link-section-d113710646e2553\" rel=\"nofollow noopener\" target=\"_blank\">61<\/a>. Thus, for each condition we calculated the perturbation score using the scRNA-seq data as described before. Then, we evaluate the Pearson correlation between the perturbation score of a cell and the expression of surface protein markers of activation.<\/p>\n<p>Gene pathway enrichment analysis<\/p>\n<p>For each perturbation experiment, we calculated the Pearson correlation between the 3,000 most variable genes and the perturbation score of a cell. We selected genes with a correlation Q\u2009&lt;\u20090.05 (3,000 tests). Then, we used these correlation values to rank the genes and performed a gene pathway enrichment analysis using the fgsea R package<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 27\" title=\"Korotkevich, G. et al. Fast gene set enrichment analysis. Preprint at bioRxiv &#010;                https:\/\/doi.org\/10.1101\/060012&#010;                &#010;               (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR27\" id=\"ref-link-section-d113710646e2568\" rel=\"nofollow noopener\" target=\"_blank\">27<\/a> with a maximum gene set size of 500, a minimum size of 15, and 100,000 permutations. We include the following gene sets from the Pathway Interaction Database stored in MSigDB: Hallmark gene set and large collection of immunological conditions (C7).<\/p>\n<p>Prioritizing eQTLs in PBMCs bulk expression profiles<\/p>\n<p>For each individual participant and perturbation condition, we summed the gene expression counts for each gene across all cells to generate bulk profiles, then we averaged the gene expression from each perturbation condition by individual participant, resulting in the average bulk expression profiles. This allows us to use a traditional eQTL linear regression model with one sample per donor to prioritize eQTLs. For each perturbation experiment, we generated bulk expression profiles for the NP, P and average conditions. We normalized the expression profiles (log2 count per million\u2009+\u20091) and applied an inverse rank normal transformation (qnorm((rank(gene, na.last\u2009=\u2009\u2018keep\u2019) \u2212 0.5)\/sum(!is.na(gene))). Then, we removed the effect of age, sex if available, top five genotype PCs and 15 latent factors using PEER factor analysis<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 62\" title=\"Stegle, O., Parts, L., Durbin, R. &amp; Winn, J. A Bayesian framework to account for complex non-genetic factors in gene expression levels greatly increases power in eQTL studies. PLoS Comput. Biol. 6, e1000770 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR62\" id=\"ref-link-section-d113710646e2582\" rel=\"nofollow noopener\" target=\"_blank\">62<\/a>. We only included transcripts expressed in at least 50% of the samples. The corrected gene expression was used as the dependent variable in our bulk cis-eQTL mapping. We performed the cis-eQTL mapping by testing each gene with genetic variants within 1\u2009Mb of each gene\u2019s transcription starting site using a linear regression implemented in FastQTL<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 63\" title=\"Ongen, H., Buil, A., Brown, A. A., Dermitzakis, E. T. &amp; Delaneau, O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics 32, 1479&#x2013;1485 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR63\" id=\"ref-link-section-d113710646e2592\" rel=\"nofollow noopener\" target=\"_blank\">63<\/a>. We use 1,000 permutations per locus and select the gene\u2013SNP pairs (eGenes) with a Q\u2009value\u2009&lt;\u20090.05. (Number of tests in IAV, 48,016,811; CA, 46,012,957; PA, 46,012,957; MTB, 44,896,173). In most cases, we prioritized the eQTL from the averaged NP and P profile as this produced the largest number of eQTLs (Q\u2009value\u2009&lt;\u20090.10) (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>). Standard approaches to fit the PME model are computationally expensive and impractical for the systematic genome-wide identification of eGenes in single-cell experiments.<\/p>\n<p>Identifying response eQTL effects<\/p>\n<p>For the single-cell approaches for both PBMCs or cell-type-specific analysis, we modeled the raw expression (E) counts in a cell (i) as a function of genotype allele dosage and an interaction with cell state accounting for confounders and technical batch using a described PME model<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 18\" title=\"Nathan, A. et al. Single-cell eQTL models reveal dynamic T cell state dependence of disease loci. Nature 606, 120&#x2013;128 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR18\" id=\"ref-link-section-d113710646e2617\" rel=\"nofollow noopener\" target=\"_blank\">18<\/a>. For all the analyses, we included age and sex as fixed effects where available. We also controlled for genetic population stratification by including the top five genotype PCs (gPC) as covariates. Similarly, we controlled for unwanted variation in gene expression by including the top five gene expression PCs (ePC). In addition, we included the number of unique molecular identifier (UMIs) and percentage of reads mapping to mitochondrial genes (MT%) as fixed effects, and donor (d) and technical batch (b) as random effects. We include two cell state variables, the discrete perturbation state (Discrete: where P\u2009=\u20091 for perturbation or 0 for no perturbation) and the residual perturbation score (rScore) (3) that avoid the collinearity between the Discrete variable and the perturbation score (Score) from Eq. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#Equ1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>. To estimate the reQTL effect, we fit a model that included the genotype main effect (G) and the interaction terms of G with the Discrete (GxDiscrete) and rScore (GxrScore) variables. In contrast, we only included the interaction term GxDiscrete in the simplified 1df-discrete model. For the cell-type-specific reQTL analysis, we subset the dataset by each principal cell-type and used the cell-type-specific score and applied the same model (2) as described above.<\/p>\n<p>$$\\begin{array}{l}\\log \\left({E}_{i}\\right)\\;=\\;\\theta +{\\beta }_{G}{X}_{d,{\\rm{geno}}}+{\\beta }_{{\\rm{age}}}{X}_{d,{\\rm{age}}}+{\\beta }_{{\\rm{nUMI}}}{X}_{i,{\\rm{nUMI}}}+{\\beta }_{{\\rm{MT}}}{X}_{i,{\\rm{MT}}}\\\\\\qquad\\qquad\\;+{\\beta }_{{\\rm{Discrete}}}{X}_{i,{\\rm{Discrete}}}+{\\beta }_{{\\rm{rScore}}}{X}_{i,{\\rm{rScore}}}+{\\beta }_{\\rm{Gx}{\\rm{rScore}}}{X}_{d,{\\rm{geno}}}{X}_{i,{\\rm{rScore}}}\\\\\\qquad\\quad\\quad\\;\\,+{\\beta }_{\\rm{Gx}{\\rm{Discrete}}}{X}_{d,{\\rm{geno}}}{X}_{i,{\\rm{Discrete}}}+\\mathop{\\sum}\\limits_{k=1}^{5}{\\beta }_{{g\\rm{PC}}_{k}}{X}_{d,{g\\rm{PC}}_{k}}\\\\\\qquad\\qquad\\;+\\mathop{\\sum}\\limits_{k=1}^{5}{\\beta }_{{e\\rm{PC}}_{k}}{X}_{i,{e\\rm{PC}}_{k}}+\\left(d\\right)+({k}_{b}|b),\\end{array}$$<\/p>\n<p>\n                    (2)\n                <\/p>\n<p>$${\\rm{Score}}=\\theta +{\\beta }_{{\\rm{Discrete}}}{X}_{i,{\\rm{Discrete}}}+{r\\;\\rm{Score}}.$$<\/p>\n<p>\n                    (3)\n                <\/p>\n<p>To map reQTLs in bulk profiles (i) (Pseudobulk LME) we modeled the residuals from the PEER factor analysis (rPFA) as a function of the genotype and the GxDiscrete term accounting for donor\u2019s age, sex and both the total numbers of UMIs and mean MT% across all cells per donor using an LME model. We included donor as a random effect in the LME model (equation (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#Equ4\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>)).<\/p>\n<p>$$\\begin{array}{l}{{\\rm{rPFA}}}_{i}\\;=\\;\\theta +{\\beta }_{G}{X}_{d,{\\rm{geno}}}+{\\beta }_{{\\rm{age}}}{X}_{d,{\\rm{age}}}+{\\beta}_{{\\rm{nUMI}}}{X}_{i,{\\rm{nUMI}}}+{\\beta }_{{\\rm{MT}}}{X}_{i,{\\rm{MT}}}\\\\\\qquad\\qquad+{\\beta }_{{\\rm{Discrete}}}{X}_{i,{\\rm{Discrete}}}+{\\beta}_{{\\rm{GxDiscrete}}}{X}_{d,{\\rm{geno}}}{X}_{i,{\\rm{Discrete}}}+\\left(d\\right)+\\varepsilon .\\end{array}$$<\/p>\n<p>\n                    (4)\n                <\/p>\n<p>We compared the significance of each of these full models against a null model with no interaction terms using the likelihood ratio test and considered significant reQTLs to be those with an LRT Q\u2009value\u2009&lt;\u20090.05. Furthermore, to rule out the possibility of spurious significant results, we performed 500 permutations per each discovered reQTL. For the permutation in the single-cell models, we shuffled values of the perturbation score and the discrete perturbation state randomly within each donor. For the pseudobulk LME model, we shuffled values of the perturbation state randomly across samples.<\/p>\n<p>To exclude spurious associations, for each gene we counted the proportion of LRT P\u2009values from the permutation experiments that were smaller than the discovery LRT P (permutation P) and removed genes with permutation P\u2009&lt;\u20090.05. To exclude genes where the PME model may not be appropriate due to high differential gene expression across cell states, we fit a PME model where we simulated null conditions by shuffling the perturbation score value randomly within each donor. We tested the significance of the interaction term under this null by comparing this model with a model with no GxScore term using the LRT ANOVA. For each eGene, we performed 100 permutations and removed eGenes where we observed a significant P\u2009value (LRT P\u2009&lt;\u20090.01) in more than five out of 100 permutations.<\/p>\n<p>In addition, we performed a secondary analysis for the CA, PA and MTB perturbation experiments based on the 1,825 eGenes prioritized by ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Oelen, R. et al. Single-cell RNA-sequencing of peripheral blood mononuclear cells reveals widespread, context-specific gene expression regulation upon pathogenic exposure. Nat. Commun. 13, 3267 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR22\" id=\"ref-link-section-d113710646e3656\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a> and listed in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a> and passing QC in our preprocessing. For each perturbation experiment, we estimated reQTL effects for this set of 1,825 eQTLs using each of the models described above (equations (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#Equ1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>) and (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#Equ3\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>)) and compared the significance against a null model with no interactions terms using the LRT and considered significant those with LRT Q\u2009value\u2009&lt;\u20090.05. Furthermore, for each perturbation experiment, we generated P\u2009values under the uniform distribution based on the number of eQTLs included in our secondary analysis. Then, we calculated the enrichment of significant reQTLs in the 2df-model by comparing the number of tests where we found 2df-model LRT P\u2009&lt;\u20090.001, &gt;0.001 and \u22640.01, &gt;0.01 and \u22640.05, &gt;0.05 and \u22640.1, and &gt;0.1 with those found under the uniform distribution using a \u03c72 test.<\/p>\n<p>Estimating the eQTL effect at levels of the perturbation score<\/p>\n<p>For any given reQTL, we selected cells either in the NP or P condition and calculated the 45th and 55th percentile of the perturbation score in that group. Then, we selected the cells with perturbation score values within the 45th to 55th percentile (50p) and estimated the eQTL effect in those cells by fitting a single-cell PME, where we modeled the raw gene counts (E) in a cell (i) as a function of the genotype (G) and the same cell and donor level fix and random covariates as described above (equation (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#Equ2\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>)) but without interaction terms (equation (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#Equ5\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>)). We consider the significance of the G term by comparing it with a null model with no G term using the LRT with 1\u2009d.f.<\/p>\n<p>$$\\begin{array}{l}\\log \\left({E}_{i}\\right)\\;=\\;\\theta +{\\beta }_{G}{X}_{d,{\\rm{geno}}}+{\\beta }_{{\\rm{age}}}{X}_{d,{\\rm{age}}}+{\\beta }_{{\\rm{nUMI}}}{X}_{i,{\\rm{nUMI}}}+{\\beta }_{{\\rm{MT}}}{X}_{i,{\\rm{MT}}}\\\\\\qquad\\qquad\\quad+\\mathop{\\sum}\\limits_{k=1}^{5}{\\beta}_{{g\\rm{PC}}_{k}}{X}_{d,{g\\rm{PC}}_{k}}+\\mathop{\\sum}\\limits_{k=1}^{5}{\\beta }_{{e\\rm{PC}}_{k}}{X}_{i,{e\\rm{PC}}_{k}}+\\left(d\\right)+({k}_{b}|b).\\end{array}$$<\/p>\n<p>\n                    (5)\n                <\/p>\n<p>To visualize the eQTL effect at the donor level, we aggregated the gene counts in the cells in the 50p by donor and then normalized the gene expression (log2 counts per million\u2009+\u20091). Then, we plotted the gene expression as a function of the eQTL\u2019s allele dosage.<\/p>\n<p>Delta betas between PBMC and cell-type-specific reQTL analysis<\/p>\n<p>First, based on the PBMCs analysis and for each main cell-type (that is, TCD4+, monocytes), we calculated the difference in the total eQTL between the 50th percentile in the P state versus the 50th percentile in the NP state (Delta 1). Furthermore, based on the cell-type-specific analysis (TCD4+ and TCD4+ perturbation score), we calculated the difference in the total eQTL between the 50th percentile in the P state versus the 50th percentile in the NP state (Delta 2). We only included eGenes with LRT Q\u2009value\u2009&lt;\u20090.05 in the cell-type-specific analysis. Then, for each main cell-type we calculated the Pearson correlation between Delta 1 and Delta 2.<\/p>\n<p>Composite eQTL effect in a cell<\/p>\n<p>We defined the total eQTL in a cell (i) as the sum of the genotype main effect (G) with the discrete interaction effect (GxDiscrete) and the interaction effect with the residual of the perturbation score (GxrScore), weighted by its corresponding value (Discrete, NP\u2009=\u20090, P\u2009=\u20091; Score, continuous perturbation score) (Eq. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#Equ6\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>).<\/p>\n<p>$${{\\beta }_{{\\rm{total}}}=\\beta }_{G}+{\\beta }_{{\\rm{GxDiscrete}}}{X}_{i}+{\\beta }_{{\\rm{GxrScore}}}{X}_{i}.$$<\/p>\n<p>\n                    (6)\n                <\/p>\n<p>Comparison of the IAV perturbation score and an interferon-alpha score<\/p>\n<p>As an alternative approach to quantify the response to perturbation, we developed a per cell interferon-alpha score based on a predefined gene set (MSigDB: Hallmark interferon-alpha response (INF-alpha))<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 64\" title=\"Liberzon, A. et al. The molecular signatures database hallmark gene set collection. Cell Syst. 1, 417&#x2013;425 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR64\" id=\"ref-link-section-d113710646e4265\" rel=\"nofollow noopener\" target=\"_blank\">64<\/a>. The selected genes (INF-alpha) were normalized, then we removed genes expressed in &lt;1% of the cells and then scaled the expression of the gene (mean\u2009=\u20090, variance\u2009=\u20091). Then, for each cell, we averaged the expression across genes. To compare reQTL detected by each score, we ran a reduced single-cell reQTL model where we tested only for the significance of the interaction effect between the genotype and the score (GxScore). We then evaluated the number of significant reQTL (LRT Q\u2009value\u2009&lt;\u20090.05) as well as the standardized effect size of the main eQTL effect (G) and interaction effects (GxScore) between the two approaches.<\/p>\n<p>Comparison of Poisson mixed effect and negative binomial models<\/p>\n<p>To confirm that the single-cell PME model was well calibrated, we fit a single-cell reQTL model where we modeled the raw expression counts in single cells as a function of genotype allele dosage and interaction terms equivalent to the PME 2df-model described in Eq. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#Equ1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>, using a negative binomial model implemented in the glmmTMB<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 65\" title=\"Brooks, M. E. et al. glmmTMB balances speed and flexibility among packages for zero-inflated generalized linear mixed modeling. R. J. 9, 378 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR65\" id=\"ref-link-section-d113710646e4283\" rel=\"nofollow noopener\" target=\"_blank\">65<\/a> R package. We calculated the correlation between the LRT P\u2009value from the original PME and the negative binomial model.<\/p>\n<p>Downsampling donors and cells<\/p>\n<p>To test the effect of sample size on the power to detect reQTLs in the single-cell models, we generated datasets where we removed cells or removed donors randomly. We generated five datasets and, in each, we removed either 10, 20, 30, 40 or 50% of cells randomly within each donor. For each dataset, we generated three replicates by using different random seeds. Similarly, we generated five datasets with three replicates each where for each set we randomly removed either 10, 20, 30, 40 or 50% of donors. Then, in each downsampled dataset, we estimated reQTL effects with the PME model by testing the originally prioritized eGenes and counting the number of reQTLs that reached an LRT Q\u2009value\u2009&lt;\u20090.05.<\/p>\n<p>Power to detect reQTL at different degrees of perturbation<\/p>\n<p>For each of the perturbation experiments (IAV, CA, PA and MTB), we generated three new datasets with different degrees of perturbation by sampling cells from the P condition (that is, cells exposed to IAV) at different ranges of the score (30th to 70th, 20th to 80th, and full distribution). We constrained the sampling so that each dataset contained the same number of P cells, and we kept all the NP cells. Then, we used the same set of prioritized eQTLs as the original analysis (IAV, 1,892; CA, 1,863; PA, 1,794; MTB, 1,355) and tested for reQTLs in single cells using both the 2df-model and the reduced 1df-discrete model. We compared the number of reQTLs that reached LRT Q\u2009value\u2009&lt;\u20090.05 between the two models.<\/p>\n<p>Similarly, for each perturbation experiment, we generated three additional new datasets by sampling P cells at different levels of the perturbation score (&lt;33rd, &gt;66th and the full distribution), again constraining the number of cells to be fixed between the datasets, and keeping all NP cells. We performed a reQTL analysis using both 2df-model and 1df-discrete models and compared the number of reQTLs that reached LRT Q\u2009value\u2009&lt;\u20090.05 between the two models.<\/p>\n<p>Comparison of the 2df-model with CellRegMap to find reQTLs in single cells<\/p>\n<p>We compared our reQTL framework with CellRegMap<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 19\" title=\"Cuomo, A. S. E. et al. CellRegMap: a statistical framework for mapping context&#x2010;specific regulatory variants using scRNA&#x2010;seq. Mol. Syst. Biol. 18, e10663 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR19\" id=\"ref-link-section-d113710646e4327\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a> (CRM), a statistical framework that can identify eQTLs with dynamic effects across cell-states in single cells. CRM can use PCs as cell states; however, in our comparison, we applied CRM using the residual effect of the perturbation score and the discrete perturbation state as these are the same cell-states used in our 2df-model. In the CA perturbation experiment, we defined a set of ground-truth reQTLs. These were reQTLs detected by the original analysis by Oelen et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Oelen, R. et al. Single-cell RNA-sequencing of peripheral blood mononuclear cells reveals widespread, context-specific gene expression regulation upon pathogenic exposure. Nat. Commun. 13, 3267 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR22\" id=\"ref-link-section-d113710646e4331\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a> and detected in our pseudobulk reQTL approach. Similarly, we defined a set of negative controls as those reQTLs not reported by Oelen et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Oelen, R. et al. Single-cell RNA-sequencing of peripheral blood mononuclear cells reveals widespread, context-specific gene expression regulation upon pathogenic exposure. Nat. Commun. 13, 3267 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR22\" id=\"ref-link-section-d113710646e4335\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a> or our pseudobulk analysis (LRT Q\u2009value\u2009&gt;\u20090.2). In addition, for every ground-true reQTL, we randomly selected three negative controls with mean gene expression within the range of the mean gene expression of positive control \u00b1 (mean positive\/2). Based on these criteria, we had 35 positive reQTL and 105 negative controls, and we tested for reQTLs using our 2df-model and CRM with the default parameters. We corrected for multiple comparisons using a Bonferroni correction and compared the sensitivity and specificity of the two models.<\/p>\n<p>reQTLs with heterogeneous effects across cell types<\/p>\n<p>We tested for heterogeneous effects by comparing the effect size of each of the interaction terms between cell types. For each perturbation experiment, we selected cells from each main cell type present in the experiment (IAV: B, CD4+ T, CD8+ T, Monocytes, NK; CA-PA-MTB: B, CD4+ T, CD8+ T, Monocytes, NK, DC) and fitted the single-cell 2df-model with the GxDiscrete and GxScore terms. Then, for each reQTL, we used the summary statistics to compare the effect sizes of each of the interaction\u2019s terms across cell types. We tested whether the variability in the observed effect sizes is larger than the expected based on sampling variability alone using the Cochran\u2019s Q test (Q) implemented in the metafor R package (rma, method\u2009=\u2009\u2018FE,\u2019 weighted\u2009=\u2009TRUE). We defined response eQTLs with heterogeneous effects as those with a Cochrane\u2019s Q-test Q\u2009value\u2009&lt;\u20090.10 in at least one of the interaction terms.<\/p>\n<p>$$\\begin{array}{ccc}{\\beta }_{{\\rm{meta}}}=\\left(\\mathop{\\sum }\\limits_{k=1}^{K}\\frac{{\\beta }_{k}}{s{{e}_{k}}^{2}}\\right)\\left\/\\left(\\mathop{\\sum }\\limits_{k=1}^{K}\\frac{1}{s{{e}_{k}}^{2}}\\right)\\right.&amp;{\\rm{and}}&amp; Q=\\mathop{\\sum }\\limits_{k=1}^{K}\\frac{1}{s{{e}_{k}}^{2}}{\\left({\\beta }_{k}-{\\beta}_{{\\rm{meta}}}\\right)}^{2}\\end{array},$$<\/p>\n<p>where \\(K\\) is the number of cell types in the perturbation experiment, \\({\\beta }_{k}\\) is the effect size for each cell-type, and \\({\\beta }_{{\\rm{meta}}}\\) is the average effect size weighted by the inverse of the variance of the estimate on each cell type.<\/p>\n<p>Overlap with previous response eQTLs in response to IAV<\/p>\n<p>We compared the overlap between the reQTLs found in the IAV dataset using the 2df-model (all PBMCs and monocyte-specific) and the reQTLs found after IAV perturbation in primary monocytes reported originally in Table S2 by Quach et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 29\" title=\"Quach, H. et al. Genetic adaptation and Neandertal admixture shaped the immune system of human populations. Cell 167, 643&#x2013;656 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR29\" id=\"ref-link-section-d113710646e4754\" rel=\"nofollow noopener\" target=\"_blank\">29<\/a>. There were 286 eGenes present in both analyses. We compared the overlap of eGenes with significant reQTLs (LRT Q\u2009value\u2009&lt;\u20090.05) found in our PBMC analysis and the monocyte-specific analysis of the IAV dataset. Similarly, we selected eGenes with reported reQTLs P\u2009value from Quach et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 29\" title=\"Quach, H. et al. Genetic adaptation and Neandertal admixture shaped the immune system of human populations. Cell 167, 643&#x2013;656 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR29\" id=\"ref-link-section-d113710646e4764\" rel=\"nofollow noopener\" target=\"_blank\">29<\/a>. In the case of several reQTLs per gene reported in ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 29\" title=\"Quach, H. et al. Genetic adaptation and Neandertal admixture shaped the immune system of human populations. Cell 167, 643&#x2013;656 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR29\" id=\"ref-link-section-d113710646e4768\" rel=\"nofollow noopener\" target=\"_blank\">29<\/a>, we chose the result with the lowest reQTL P\u2009value.<\/p>\n<p>Colocalization analysis<\/p>\n<p>To find instances where a perturbation-dependent reQTL shared its causal variant with a GWAS locus, we performed a colocalization analysis using coloc<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 66\" title=\"Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR66\" id=\"ref-link-section-d113710646e4784\" rel=\"nofollow noopener\" target=\"_blank\">66<\/a> (coloc.abf), a Bayesian approach that estimates the posterior probability of having a shared true causal variant between the eQTL and the GWAS locus. We used summary statistics from nine GWAS on immune traits (type\u20091 diabetes<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 67\" title=\"Chiou, J. et al. Interpreting type 1 diabetes risk with genetics and single-cell epigenomics. Nature 594, 398&#x2013;402 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR67\" id=\"ref-link-section-d113710646e4788\" rel=\"nofollow noopener\" target=\"_blank\">67<\/a>, multiple sclerosis<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 68\" title=\"International Multiple Sclerosis Genetics Consortium (IMSGC). Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis. Nat. Genet. 45, 1353&#x2013;1360 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR68\" id=\"ref-link-section-d113710646e4792\" rel=\"nofollow noopener\" target=\"_blank\">68<\/a>, asthma<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 69\" title=\"Demenais, F. et al. Multiancestry association study identifies new asthma risk loci that colocalize with immune-cell enhancer marks. Nat. Genet. 50, 42&#x2013;53 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR69\" id=\"ref-link-section-d113710646e4796\" rel=\"nofollow noopener\" target=\"_blank\">69<\/a>, RA<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 1\" title=\"Ishigaki, K. et al. Multi-ancestry genome-wide association analyses identify novel genetic mechanisms in rheumatoid arthritis. Nat. Genet. 54, 1640&#x2013;1651 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR1\" id=\"ref-link-section-d113710646e4800\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>, SLE<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 33\" title=\"Bentham, J. et al. Genetic association analyses implicate aberrant regulation of innate and adaptive immunity genes in the pathogenesis of systemic lupus erythematosus. Nat. Genet. 47, 1457&#x2013;1464 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR33\" id=\"ref-link-section-d113710646e4805\" rel=\"nofollow noopener\" target=\"_blank\">33<\/a>, inflammatory bowel disease<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 70\" title=\"Liu, J. Z. et al. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat. Genet. 47, 979&#x2013;986 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR70\" id=\"ref-link-section-d113710646e4809\" rel=\"nofollow noopener\" target=\"_blank\">70<\/a>, Crohn\u2019s disease<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 70\" title=\"Liu, J. Z. et al. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat. Genet. 47, 979&#x2013;986 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR70\" id=\"ref-link-section-d113710646e4813\" rel=\"nofollow noopener\" target=\"_blank\">70<\/a>, ulcerative colitis<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 70\" title=\"Liu, J. Z. et al. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat. Genet. 47, 979&#x2013;986 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR70\" id=\"ref-link-section-d113710646e4817\" rel=\"nofollow noopener\" target=\"_blank\">70<\/a>, psoriasis<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 71\" title=\"Tsoi, L. C. et al. Identification of 15 new psoriasis susceptibility loci highlights the role of innate immunity. Nat. Genet. 44, 1341&#x2013;1348 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR71\" id=\"ref-link-section-d113710646e4821\" rel=\"nofollow noopener\" target=\"_blank\">71<\/a>) and three GWAS on nonimmune traits (coronary artery disease<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 72\" title=\"van der Harst, P. &amp; Verweij, N. Identification of 64 novel genetic loci provides an expanded view on the genetic architecture of coronary artery disease. Circ. Res. 122, 433&#x2013;443 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR72\" id=\"ref-link-section-d113710646e4825\" rel=\"nofollow noopener\" target=\"_blank\">72<\/a>, type\u20092 diabetes<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 73\" title=\"Xue, A. et al. Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes. Nat. Commun. 9, 2941 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR73\" id=\"ref-link-section-d113710646e4830\" rel=\"nofollow noopener\" target=\"_blank\">73<\/a>, depression<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 74\" title=\"Howard, D. M. et al. Genome-wide association study of depression phenotypes in UK Biobank identifies variants in excitatory synaptic pathways. Nat. Commun. 9, 1470 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#ref-CR74\" id=\"ref-link-section-d113710646e4834\" rel=\"nofollow noopener\" target=\"_blank\">74<\/a>).<\/p>\n<p>We defined eQTLs as those with a nominal Q\u2009value\u2009&lt;\u20090.05 in the analysis with FastQTL. We ran coloc with the default parameters and defined colocalization as those loci with a posterior probability H4 (PPH4)\u2009&gt;\u20090.5. Furthermore, for those colocalized loci, we generated the 95% credible set and obtained the posterior probability of each variant within the credible set (snpH4) under the single causal variant assumption. We calculated the enrichment of colocalization in eGenes with reQTL effect compared to eGenes with no reQTL effect using Fisher\u2019s exact test.<\/p>\n<p>In addition, to find colocalization at different levels of the perturbation score, we defined tertiles of perturbation within the NP (T1\u2013T3) and P (T4\u2013T6) conditions. For example, in the IAV experiment, T1 was defined as NP cells with perturbation score values &lt;33rd percentile (\u22120.90), T2 was defined as NP cells with the perturbation score \u226533rd and below 66th percentile (\u22120.75) and T3 was defined as NP cells with a perturbation score \u2265 66th percentile. Then, for each tertile, we generated pseudobulk gene expression profiles, tested for cis-eQTL and ran coloc as described before.<\/p>\n<p>Reporting summary<\/p>\n<p>Further information on research design is available in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02344-6#MOESM2\" rel=\"nofollow noopener\" target=\"_blank\">Nature Portfolio Reporting Summary<\/a> linked to this article.<\/p>\n","protected":false},"excerpt":{"rendered":"Cohorts We used published data (GEO accession no. GSE162632) to study the effect of 12\u2009h of ex vivo&hellip;\n","protected":false},"author":2,"featured_media":93657,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[25],"tags":[3810,5209,4730,4731,5208,22786,1437,254,5207,61,60,82],"class_list":{"0":"post-93656","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-genetics","8":"tag-agriculture","9":"tag-animal-genetics-and-genomics","10":"tag-biomedicine","11":"tag-cancer-research","12":"tag-gene-function","13":"tag-gene-regulation","14":"tag-general","15":"tag-genetics","16":"tag-human-genetics","17":"tag-ie","18":"tag-ireland","19":"tag-science"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts\/93656","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/comments?post=93656"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts\/93656\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/media\/93657"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/media?parent=93656"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/categories?post=93656"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/tags?post=93656"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}