{"id":467663,"date":"2026-02-14T05:39:10","date_gmt":"2026-02-14T05:39:10","guid":{"rendered":"https:\/\/www.newsbeep.com\/us\/467663\/"},"modified":"2026-02-14T05:39:10","modified_gmt":"2026-02-14T05:39:10","slug":"genome-wide-association-analyses-highlight-the-role-of-the-intestinal-molecular-environment-in-human-gut-microbiota-variation","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/us\/467663\/","title":{"rendered":"Genome-wide association analyses highlight the role of the intestinal molecular environment in human gut microbiota variation"},"content":{"rendered":"<p>Ethical considerations<\/p>\n<p>The current study has been approved by the Swedish Ethical Review Authority (DNR 2022-06137-01, DNR 2024-01992-02). All participants in the respective studies below provided written informed consent. The Swedish Ethical Review Board approval numbers are: SCAPIS (DNR 2010-228-31M), SIMPLER (DNR 2009\/2066-32, DNR 2009\/1935-32, DNR 2010\/0148-32, DNR 2014\/892-31\/3), MDC (DNR 532\/2006, DNR 51-90) and MOS (DNR 2012-594). The PPP-Botnia study received approval from the Ethics Committee of Helsinki University (approval number 608\/2003). The HUNT study was approved by the local ethical review board (Regional committee for medical and health research ethics, Central Norway; REK-656785).<\/p>\n<p>Discovery studiesSCAPIS<\/p>\n<p>SCAPIS<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 56\" title=\"Bergstrom, G. et al. The Swedish CArdioPulmonary BioImage Study: objectives and design. J. Intern. Med. 278, 645&#x2013;659 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR56\" id=\"ref-link-section-d69842288e5301\" rel=\"nofollow noopener\" target=\"_blank\">56<\/a> is a multicenter cohort comprising 30,154 people aged 50\u201365\u2009years. For this analysis, 8,733 participants of European ancestry from the Malm\u00f6 and Uppsala sites with both gut microbiome and genotype data were included. At baseline, participants provided blood samples during the first visit and were asked to collect stool samples at home, storing them at \u221220\u2009\u00b0C until samples were brought to the study center at the second visit for storage at \u221280\u2009\u00b0C. DNA extracted from whole blood was used for genotyping. Birth year and sex were obtained from the Swedish population register. Information on dispensed antibiotics (Anatomical Therapeutic Chemical code J01) in the past 6\u2009months was obtained from the Swedish Prescribed Drug Register. BMI was defined as weight divided by height squared (kg\u2009m\u22122). Habitual alcohol and fiber intakes were estimated from a food frequency questionnaire (g\u2009day\u22121)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 57\" title=\"Christensen, S. E. et al. Two new meal- and web-based interactive food frequency questionnaires: validation of energy and macronutrient intake. J. Med. Internet Res. 15, e109 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR57\" id=\"ref-link-section-d69842288e5309\" rel=\"nofollow noopener\" target=\"_blank\">57<\/a>. Smoking behavior was assessed using a questionnaire and defined as current, former and never smoker.<\/p>\n<p>SIMPLER-V\u00e4stmanland and SIMPLER-Uppsala<\/p>\n<p>The Swedish Infrastructure for Medical Population-Based Life-Course and Environmental Research (SIMPLER; <a href=\"https:\/\/www.simpler4health.se\/w\/sh\/en\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/www.simpler4health.se\/w\/sh\/en<\/a>) includes data from two large, ongoing population-based studies: the Cohort of Swedish Men (COSM) and the Swedish Mammography Cohort (SMC)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 58\" title=\"Harris, H. et al. The Swedish mammography cohort and the cohort of Swedish men: study design and characteristics of two population-based longitudinal cohorts. OA Epidemiol. 1, 16 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR58\" id=\"ref-link-section-d69842288e5328\" rel=\"nofollow noopener\" target=\"_blank\">58<\/a>. The COSM initially enrolled 48,850 men born between 1918 and 1952 living in V\u00e4stmanland and \u00d6rebro counties in 1997. The SMC enrolled 66,651 women by sending invitations to all women born between 1914 and 1948 living in Uppsala and V\u00e4stmanland counties between 1987 and 1990. The current analysis is based on a subsample selected randomly from these studies who were invited for clinical examination with genotype and gut microbiome data: SIMPLER-V\u00e4stmanland (SIMPLER-V) and SIMPLER-Uppsala (SIMPLER-U). SIMPLER-V includes 4,515 COSM and SMC participants from V\u00e4stmanland examined between 2010 and 2019. SIMPLER-U includes 981 women from the county of Uppsala, examined between 2003 and 2009 (no stool collected) and re-examined between 2015 and 2019 (stool collected). Participants were asked to collect stool samples at home and store them at \u221220\u2009\u00b0C until they were brought to the test center, where samples were stored at \u221280\u2009\u00b0C. For 115 SIMPLER-V participants, the examination was conducted at home. DNA for genotyping was extracted from whole-blood samples. Information on dispensed antibiotics in the past 6\u2009months was obtained from the Swedish Prescribed Drug Register.<\/p>\n<p>Malm\u00f6 offspring study<\/p>\n<p>The Malm\u00f6 offspring study (MOS) includes participants aged \u226518\u2009years who are children or grandchildren of participants from the Malm\u00f6 Diet and Cancer Study (MDC)\u2014cardiovascular cohort, a subset of the larger MDC<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 59\" title=\"Brunkwall, L. et al. The Malmo Offspring Study (MOS): design, methods and first results. Eur. J. Epidemiol. 36, 103&#x2013;116 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR59\" id=\"ref-link-section-d69842288e5340\" rel=\"nofollow noopener\" target=\"_blank\">59<\/a>. Data collection in MOS began in 2013 and included 4,721 participants by 2020. The current study included 1,788 participants with genotype and gut microbiome data who attended baseline measurements between 2013 and 2017. Stool samples were collected and stored in home freezers (\u221220\u2009\u00b0C) until they were brought to the study sites, where they were stored at \u221280\u2009\u00b0C in the biobank. DNA for genotyping was extracted from whole-blood samples. Demographic information was collected using a questionnaire. Antibiotic use was self-reported and was also derived from the Swedish Prescribed Drug Register. Participants who were also part of SCAPIS were excluded from the MOS data.<\/p>\n<p>Replication cohortNorwegian Tr\u00f8ndelag Health Study<\/p>\n<p>The Tr\u00f8ndelag Health (HUNT) study is a long-term population-based health investigation conducted in the Tr\u00f8ndelag county, Norway<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 60\" title=\"Krokstad, S. et al. Cohort profile: the HUNT study, Norway. Int. J. Epidemiol. 42, 968&#x2013;977 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR60\" id=\"ref-link-section-d69842288e5357\" rel=\"nofollow noopener\" target=\"_blank\">60<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 61\" title=\"&#xC5;svold, B. O. et al. Cohort profile update: the HUNT study, Norway. Int. J. Epidemiol. 52, e80&#x2013;e91 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR61\" id=\"ref-link-section-d69842288e5360\" rel=\"nofollow noopener\" target=\"_blank\">61<\/a>. Four surveys have been used to collect data and biological samples from participants between 1984 and 2019. Approximately 230,000 people have participated in at least one survey. Of these, around 88,000 participants have undergone genotyping<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 62\" title=\"Brumpton, B. M. et al. The HUNT study: a population-based cohort for genetic research. Cell Genom. 2, 100193 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR62\" id=\"ref-link-section-d69842288e5364\" rel=\"nofollow noopener\" target=\"_blank\">62<\/a>. Among the 56,042 participants in the HUNT4 survey, 13,268 submitted stool samples for gut microbiome analysis on a filter paper. We included data from 12,652 HUNT4 participants of European descent having both genetic and gut microbiome data available. Sequencing and bioinformatic processing were performed analogously to SCAPIS and MOS at Cmbio (Copenhagen, Denmark).<\/p>\n<p>BMI and age distribution were compared between studies with density plots. A map depicting the study sites was generated with the maps v.3.4.2.1\u2009R package. Other studies (MDC, PPP-Botnia) are described in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Note<\/a>.<\/p>\n<p>Genetic analysisGenotyping and imputation<\/p>\n<p>DNA extraction, genotyping, pre-imputation quality control and imputation were performed separately in each cohort (SCAPIS, SIMPLER, MOS and HUNT) using high-density Illumina genotyping arrays and standard pipelines for variant calling and quality filtering. Quality control steps removed samples with poor genotyping quality, sex discrepancies, non-European ancestry and markers with high missingness or implausible allele frequencies. Imputation was performed using standard algorithms (EAGLE, minimac, PBWT) at established imputation servers against the Haplotype Reference Consortium (HRC) r1.1 panel. Detailed protocols for each cohort are provided in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Note<\/a>.<\/p>\n<p>Validation of genotypes using Sanger sequencing<\/p>\n<p>Direct genotyping using Sanger sequencing was performed to confirm the variants in <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/snp\/?term=rs10836441\" rel=\"nofollow noopener\" target=\"_blank\">rs10836441<\/a> (OR51E1\u2013OR51E2 locus) and <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/snp\/?term=rs4556017\" rel=\"nofollow noopener\" target=\"_blank\">rs4556017<\/a> (MUC12 locus). Details are given in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Note<\/a>.<\/p>\n<p>Stool DNA extraction and metagenomic sequencingSCAPIS, MOS and HUNT<\/p>\n<p>Stool DNA extraction and quality control for SCAPIS and MOS were performed by Cmbio and described in Sayols-Baixeras et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 63\" title=\"Sayols-Baixeras, S. et al. Streptococcus species abundance in the gut is linked to subclinical coronary atherosclerosis in 8973 participants from the SCAPIS cohort. Circulation 148, 459&#x2013;472 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR63\" id=\"ref-link-section-d69842288e5435\" rel=\"nofollow noopener\" target=\"_blank\">63<\/a>. In brief, samples were randomized on the box level, and DNA was extracted using the NucleoSpin 96 Soil extraction kit (Macherey\u2013Nagel). DNA extraction quality was evaluated using agarose gel electrophoresis. One negative and one positive (mock) control were added to each batch. DNA was quantified with fluorometric techniques both after DNA extraction and after library preparation. DNA extraction and quality control in samples from HUNT have been described in detail in Grahnemo et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 64\" title=\"Grahnemo, L. et al. Cross-sectional associations between the gut microbe Ruminococcus gnavus and features of the metabolic syndrome. Lancet Diabetes Endocrinol. 10, 481&#x2013;483 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR64\" id=\"ref-link-section-d69842288e5439\" rel=\"nofollow noopener\" target=\"_blank\">64<\/a>. In brief, three 6-mm disks were punched out from each filter card into a well. DNA was isolated using the Microbiome MagMAX Ultra kit (Thermo Fisher Scientific) after bead-beating. For all three studies, genomic DNA was fragmented and used for library construction using the NEBNext Ultra Library Prep Kit from Illumina. The prepared DNA libraries were purified and evaluated for fragment size distribution. Libraries from stool DNA were sequenced using the Illumina Novaseq 6000 instrument using 2\u2009\u00d7\u2009150-base-pair paired-end reads, generating on average 26.0, 25.3 and 22.9\u2009million read pairs, respectively, in SCAPIS, MOS and HUNT, with 97.8% of the sequenced bases having Phred quality score &gt;20 in SCAPIS and MOS, and more than 85% had a Phred quality score \u226530 in HUNT.<\/p>\n<p>SIMPLER study<\/p>\n<p>SIMPLER stool samples were thawed, a pea-size amount was aliquoted, and 800\u2009\u00b5l of DNA\/RNA Shield (Zymo Research) was added. These aliquots were refrozen and sent to the Centre for Translational Microbiome Research at the Karolinska Institute in Stockholm, Sweden for DNA extraction and metagenomic sequencing. DNA was extracted with the MagPure Stool kit (Magen Biotechnology). Each batch had one negative (DNA\/RNA Shield) and one positive control (Zymo mock). Stool DNA was fragmented and used for library construction using the MGI Easy FS DNA Library Prep Set kit. The prepared DNA libraries were evaluated with a TapeStation D1000 kit (Agilent), and the quantity was determined by QuantIT HighSensitivity dsDNA Assay on a Tecan Spark (Tecan). Equimolarly pooled libraries were circularized using the MGI Easy Circularization kit (MGI Tech) and sequenced using 2\u2009\u00d7\u2009150\u2009bp paired-end reads on the DNBSEQ G400 or T7 sequencing instrument (MGI) with an average yield of 51\u2009million reads\/sample.<\/p>\n<p>Microbial taxonomic profiling<\/p>\n<p>Read pairs mapped to the human reference genome GRCh38.p14 were removed using Bowtie2 (v.2.4.2)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 65\" title=\"Langmead, B. &amp; Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357&#x2013;359 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR65\" id=\"ref-link-section-d69842288e5460\" rel=\"nofollow noopener\" target=\"_blank\">65<\/a> in SCAPIS, MOS and HUNT, and against GRCh38 using Kraken\u20092 (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 66\" title=\"Wood, D. E., Lu, J. &amp; Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol. 20, 257 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR66\" id=\"ref-link-section-d69842288e5464\" rel=\"nofollow noopener\" target=\"_blank\">66<\/a>) in SIMPLER. Remaining bioinformatic processing, calculation of relative abundances and microbial taxonomic annotation were performed for all studies, including HUNT, at Cmbio using the CHAMP profiler based on the Human Microbiome Reference HMR05 catalog<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\" title=\"Pita, S. et al. CHAMP delivers accurate taxonomic profiles of the prokaryotes, eukaryotes, and bacteriophages in the human microbiome. Front. Microbiol. 15, 1425489 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR12\" id=\"ref-link-section-d69842288e5468\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a> (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Note<\/a>). The taxonomic annotation was based on the Genome Taxonomy Database (GTDB) release 214 (release date: 28 April 2023). A rarefied species abundance table was produced by random sampling, without replacement, of 190,977 gene counts per sample in SCAPIS and MOS, and 641,964 gene counts per sample in SIMPLER. In total, 4,248 species were detected in the rarefied data in SCAPIS, 3,430 in MOS and 4,192 in SIMPLER-V, and 3,523 in SIMPLER-U. The alpha diversity measures\u2014Shannon index, inverse Simpson index and richness\u2014were calculated using rarefied data with the diversity function of the vegan R package (R v.4.3.1). Only the 921 species with prevalence &gt;5% in all four studies were kept for the species-level analyses. Those detected in fewer than 50% of samples in at least one cohort based on nonrarefied data were converted into a binary present\/absent variable. Those detected in more than 50% of samples in all four studies were rank-based inverse normal (RIN) transformed. Alpha diversity measures were also RIN-transformed, and, for significant findings, were also analyzed on a nontransformed scale for increased interpretability. The RIN transformation was performed separately for each cohort.<\/p>\n<p>Analysis of scRNA-seq data<\/p>\n<p>Gene expression data in cells derived from human duodenum, ileum and colon were obtained from Hickey et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Hickey, J. W. et al. Organization of the human intestine at single-cell resolution. Nature 619, 572&#x2013;584 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR22\" id=\"ref-link-section-d69842288e5483\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a>, and mean gene expression was generated per their annotated clusters. The expression in EECs from human duodenal and ileal organoids was assessed as described<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 23\" title=\"Smith, C. A. et al. Single-cell transcriptomics of human organoid-derived enteroendocrine cell populations from the small intestine. J. Physiol. 603, 7751&#x2013;7763 (2025).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR23\" id=\"ref-link-section-d69842288e5487\" rel=\"nofollow noopener\" target=\"_blank\">23<\/a>. Briefly, a yellow fluorescent protein was inserted downstream of the Chromogranin A promoter by CRISPR\u2013Cas9 to label EECs. Fluorescent EECs were then isolated using flow cytometry and analyzed by 10\u00d7 scRNA-seq. Gene expression in EECs from the murine gastrointestinal tract was analyzed with scRNA-seq, as described in Smith et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Smith, C. A. et al. Single-cell transcriptomic atlas of enteroendocrine cells along the murine gastrointestinal tract. PLoS ONE 19, e0308942 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR24\" id=\"ref-link-section-d69842288e5491\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a>.<\/p>\n<p>Statistical analysisGWAS of microbiome composition<\/p>\n<p>GWAS was performed separately for microbial alpha diversity and 921 species using REGENIE<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 67\" title=\"Mbatchou, J. et al. Computationally efficient whole-genome regression for quantitative and binary traits. Nat. Genet. 53, 1097&#x2013;1103 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR67\" id=\"ref-link-section-d69842288e5507\" rel=\"nofollow noopener\" target=\"_blank\">67<\/a> v.3.3 for each cohort (SCAPIS, SIMPLER-V, SIMPLER-U, MOS). A subset of the genotype datasets was created for the first REGENIE step to fit whole-genome regression models including only quality-controlled directly genotyped SNPs with MAF\u2009&gt;\u20091% and Hardy-Weinberg equilibrium P\u2009&lt;\u20091\u2009\u00d7\u200910\u221215. For the second step, all variants with an information score &gt;0.7 were included in association analyses performed using logistic regression for binary variables and genetic variants with MAF\u2009&gt;\u20095% in all four cohorts, and linear regression for RIN-transformed variables and genetic variants with MAF\u2009&gt;\u20091% in all four cohorts. Covariates were sex, age, age2, plate and genetic principal components (PC) 1\u201310. The PCs were calculated in unrelated samples, separately for each cohort, with PLINK<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 68\" title=\"Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR68\" id=\"ref-link-section-d69842288e5516\" rel=\"nofollow noopener\" target=\"_blank\">68<\/a> using an LD-pruned dataset, and all samples were then projected onto these components. In SCAPIS and MOS, plate represents metagenomics DNA extraction plate, whereas in SIMPLER it means the metagenomic aliquoting plate. Plate, age and sex were included to increase precision and power. For SCAPIS, the site was accounted for by the plate variable because plates were nested into the site variable. Based on previous nonlinear associations between age and microbiome<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 69\" title=\"Odamaki, T. et al. Age-related changes in gut microbiota composition from newborn to centenarian: a cross-sectional study. BMC Microbiol. 16, 90 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR69\" id=\"ref-link-section-d69842288e5520\" rel=\"nofollow noopener\" target=\"_blank\">69<\/a> and our results from a naive linear model for the association between age and microbial species, we opted to include age also as age2. REGENIE accounts for population stratification, but to account for any residual bias, we also included genetic PCs\u20091\u201310 in the model<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 70\" title=\"Mbatchou, J. Could regenie distinguish quantitative and binary covariates automatically? #61 GitHub &#010;                https:\/\/github.com\/rgcgithub\/regenie\/issues\/61#issuecomment-735800868&#010;                &#010;               (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR70\" id=\"ref-link-section-d69842288e5527\" rel=\"nofollow noopener\" target=\"_blank\">70<\/a>. Cohort-specific results were meta-analyzed using the inverse-variance weighted fixed-effects method in METAL<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 71\" title=\"Willer, C. J., Li, Y. &amp; Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190&#x2013;2191 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR71\" id=\"ref-link-section-d69842288e5531\" rel=\"nofollow noopener\" target=\"_blank\">71<\/a> v.2011-03-25. Independent loci were determined using LD clumping (r2 0.001, window 10\u2009Mb) in PLINK<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 68\" title=\"Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR68\" id=\"ref-link-section-d69842288e5539\" rel=\"nofollow noopener\" target=\"_blank\">68<\/a> v.2.00-alpha-5-20230923 with SCAPIS dosages used to determine the correlation structure. Variant-alpha diversity associations with P\u2009&lt;\u20091.7\u2009\u00d7\u200910\u22128 and variant-species associations with P\u2009&lt;\u20095.4\u2009\u00d7\u200910\u221211 were considered study-wide-significant. This threshold was based on a Bonferroni correction of the conventional genome-wide threshold of 5\u2009\u00d7\u200910\u22128 for three alpha diversity metrics and 921 species tested. Confidence intervals for the I2 statistic were calculated using the metagen function of the meta v.6.5-0\u2009R package. The loci were annotated using the Open Targets Genetics<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 72\" title=\"Mountjoy, E. et al. An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci. Nat. Genet. 53, 1527&#x2013;1533 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR72\" id=\"ref-link-section-d69842288e5561\" rel=\"nofollow noopener\" target=\"_blank\">72<\/a> v.22.10 database (variant index, variant to gene and variant to trait annotations). Heritability was determined using SumHer<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 73\" title=\"Speed, D. &amp; Balding, D. J. SumHer better estimates the SNP heritability of complex traits from summary statistics. Nat. Genet. 51, 277&#x2013;284 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR73\" id=\"ref-link-section-d69842288e5565\" rel=\"nofollow noopener\" target=\"_blank\">73<\/a> v.6 according to the GCTA heritability model, with SCAPIS dosages used to determine the correlation structure.<\/p>\n<p>Sensitivity analyses<\/p>\n<p>Sensitivity analyses were performed for the 149 genome-wide locus-species associations by (1) excluding participants with antibiotic use in the 6\u2009months before sampling, (2) excluding participants with self-reported IBD, (3) retaining an unrelated subset where no participant had third degree relatedness or closer with any other participant using a KING-robust kinship estimator threshold of 0.0442, (4) retaining one random spouse in SIMPLER and one random participant living at the same address in MOS to assess cohabitation (SCAPIS was removed for this analysis), (5) using centered log ratio plus RIN transformation for species analyzed using linear regression, (6) using Firth correction for species analyzed using logistic regression, (7) removing age2 from the covariates, (8) analyzing SCAPIS-Uppsala and SCAPIS-Malm\u00f6 as two separate cohorts in the meta-analysis and (9\u201312) adding BMI, alcohol intake, smoking or fiber intake, respectively, as covariates. The analyses adding alcohol, smoking and fiber were performed in SCAPIS only, where data on these variables were nearly complete.<\/p>\n<p>External replication<\/p>\n<p>Associations passing the study-wide threshold were assessed in HUNT by applying the same models as in the Swedish cohorts and using REGENIE with the same model specifications. We further assessed the validity of our findings using summary statistics from the published FINRISK<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\" title=\"Qin, Y. et al. Combined effects of host genetics and diet on human gut microbiota and incident disease in a single population cohort. Nat. Genet. 54, 134&#x2013;142 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR9\" id=\"ref-link-section-d69842288e5587\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a> and Dutch Microbiome Project<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\" title=\"Lopera-Maya, E. A. et al. Effect of host genetics on the gut microbiome in 7,738 participants of the Dutch Microbiome Project. Nat. Genet. 54, 143&#x2013;151 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR7\" id=\"ref-link-section-d69842288e5591\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a> studies. Details are given in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Note<\/a>.<\/p>\n<p>GWAS of higher taxa<\/p>\n<p>We also performed GWAS of 455 genera, 106 families, 50 orders, 21 classes, 17 phyla and 3 superkingdoms. Relative abundances were created for these higher-level taxa by summation of their respective species-level relative abundances. The 364 taxa detected in 5\u201350% of samples in each cohort were analyzed using logistic regression (absence\/presence), and 288 taxa with prevalence &gt;50% were analyzed using RIN-transformed relative abundances and linear regression. Study-wide significance was considered at P\u2009&lt;\u20095.4\u2009\u00d7\u200910\u221211, the same level as for species.<\/p>\n<p>GWAS of functional modules<\/p>\n<p>Functional gut metabolic and gut\u2013brain modules were attributed to species that contained at least two-thirds of the genes needed for the functionality of that module. If an alternative reaction pathway within a module existed, only one such pathway was required. All reaction pathways were required for modules with fewer than four steps. Module abundances were defined as the sum of the relative abundances of all species in a module. Similar to the GWAS of the species, two modules detected in 5\u201350% of samples in each cohort were analyzed using logistic regression (absence\/presence) and 115 modules with prevalence &gt;50% were analyzed using RIN-transformed relative abundances and linear regression. Study-wide significance was considered at P\u2009&lt;\u20094.3\u2009\u00d7\u200910\u221210.<\/p>\n<p>Interaction analysis for ABO, secretor status and Lewis blood groups<\/p>\n<p>Blood groups A, B, AB and O were determined based on allele combinations of ABO genetic variants <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/snp\/?term=rs505922\" rel=\"nofollow noopener\" target=\"_blank\">rs505922<\/a> and <a href=\"http:\/\/www.ncbi.nlm.nih.gov\/snp\/?term=rs8176746\" rel=\"nofollow noopener\" target=\"_blank\">rs8176746<\/a> (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 74\" title=\"Garvert, L. et al. The association between genetically determined ABO blood types and major depressive disorder. Psychiatry Res. 299, 113837 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR74\" id=\"ref-link-section-d69842288e5650\" rel=\"nofollow noopener\" target=\"_blank\">74<\/a>), secretor status based on FUT2 genetic variant <a href=\"http:\/\/www.ncbi.nlm.nih.gov\/snp\/?term=rs601338\" rel=\"nofollow noopener\" target=\"_blank\">rs601338<\/a> (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 75\" title=\"Kim, J. et al. Relationship between ABO blood group alleles and pancreatic cancer is modulated by secretor (FUT2) genotype, but not Lewis antigen (FUT3) genotype. Cancer Epidemiol. Biomarkers Prev. 32, 1242&#x2013;1248 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR75\" id=\"ref-link-section-d69842288e5665\" rel=\"nofollow noopener\" target=\"_blank\">75<\/a>) and Lewis status (positive, negative) based on allele combinations of FUT3 variants <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/snp\/?term=rs812936\" rel=\"nofollow noopener\" target=\"_blank\">rs812936<\/a>, <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/snp\/?term=rs28362459\" rel=\"nofollow noopener\" target=\"_blank\">rs28362459<\/a> and <a href=\"http:\/\/www.ncbi.nlm.nih.gov\/snp\/?term=rs3894326\" rel=\"nofollow noopener\" target=\"_blank\">rs3894326<\/a> (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 75\" title=\"Kim, J. et al. Relationship between ABO blood group alleles and pancreatic cancer is modulated by secretor (FUT2) genotype, but not Lewis antigen (FUT3) genotype. Cancer Epidemiol. Biomarkers Prev. 32, 1242&#x2013;1248 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR75\" id=\"ref-link-section-d69842288e5694\" rel=\"nofollow noopener\" target=\"_blank\">75<\/a>). Blood groups A and AB were combined into antigen A, and blood groups B and AB into antigen B. Mixed models were run for each cohort with species associated with ABO, FUT2 or FUT3\u2013FUT6 at the study-wide significance level as outcome using the lmer (for species assessed with linear regression in the GWAS) and glmer (for species assessed with logistic regression in the GWAS) functions of the lmerTest v.3.1-3 R package. The interaction between antigen (ABO A, B or Lewis) and secretor status was estimated with covariates sex, age, age2, plate and genetic PCs\u20091\u201310. First-degree relatedness, determined by KING<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 76\" title=\"Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867&#x2013;2873 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR76\" id=\"ref-link-section-d69842288e5709\" rel=\"nofollow noopener\" target=\"_blank\">76<\/a> kinship coefficient \u22650.177, was used as a random effect. For the logistic mixed models, random and fixed effects coefficients were optimized in the penalized iteratively reweighted least squares step (setting nAGQ\u2009=\u20090). Cohort-specific results were meta-analyzed with the rma function of the metafor v.4.4-0 R package using the fixed-effect inverse-variance weighted method. Study-wide significance was considered at Bonferroni-corrected P\u2009&lt;\u20093.3\u2009\u00d7\u200910\u22123.<\/p>\n<p>GWAS of GLP-1<\/p>\n<p>After overnight fasting, GLP-1 levels were measured in MDC and PPP-Botnia study participants (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Note<\/a>) before and 2\u2009h after a 75-g oral glucose load. GWAS of GLP-1 was performed in 2,588 people with fasting and 2,613 with 2-h GLP-1 in MDC, and in 926 people with fasting and 898 with 2-h GLP-1 in PPP-Botnia. GLP-1 levels were log-transformed before analysis. SNPTEST<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 77\" title=\"Marchini, J., Howie, B., Myers, S., McVean, G. &amp; Donnelly, P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet. 39, 906&#x2013;913 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR77\" id=\"ref-link-section-d69842288e5730\" rel=\"nofollow noopener\" target=\"_blank\">77<\/a> v.2.5.6 was used for genome-wide association analyses, using the frequentist score method adjusted for age, sex and the genetic PC1-4. Results were filtered based on MAF\u2009&gt;\u20090.01, Hardy-Weinberg equilibrium P\u2009&gt;\u20095\u2009\u00d7\u200910\u22127, and imputation info scores &gt;0.4. A fixed-effect meta-analysis was performed using GWAMA<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 78\" title=\"Magi, R. &amp; Morris, A. P. GWAMA: software for genome-wide association meta-analysis. BMC Bioinf. 11, 288 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR78\" id=\"ref-link-section-d69842288e5739\" rel=\"nofollow noopener\" target=\"_blank\">78<\/a>.<\/p>\n<p>Functional mapping<\/p>\n<p>Genetic variants associated with microbial alpha diversity or species at the genome-wide significant level were mapped to functional pathways using FUMA<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 26\" title=\"Watanabe, K., Taskesen, E., van Bochoven, A. &amp; Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR26\" id=\"ref-link-section-d69842288e5751\" rel=\"nofollow noopener\" target=\"_blank\">26<\/a> v.1.5.2. One (out of 2,353) variant without an rsID was removed. If a genetic variant was associated with several traits or was multiallelic, the trait or allele pair with the lowest P was used as input.<\/p>\n<p>Colocalization<\/p>\n<p>Pairwise colocalization analyses were performed to investigate whether microbial richness and the eight study-wide significant species colocalized in the identified study-wide significant loci and with sex hormone binding globulin, WHRadjBMI, LDL cholesterol, IBD, glucose and stool frequency. Details are provided in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Note<\/a>.<\/p>\n<p>Mendelian randomization<\/p>\n<p>We performed two-sample MR analyses to investigate bidirectional effects between specific species (C.\u2009saudiense, T.\u2009sanguinis, Intestinibacter sp9005540355) and BMI, WHR and LDL cholesterol. Details are provided in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Note<\/a>.<\/p>\n<p>Plasma metabolomics<\/p>\n<p>The plasma metabolomics analysis in SCAPIS has been described elsewhere<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 79\" title=\"Dekkers, K. F. et al. An online atlas of human plasma metabolite signatures of gut microbiome composition. Nat. Commun. 13, 5370 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR79\" id=\"ref-link-section-d69842288e5798\" rel=\"nofollow noopener\" target=\"_blank\">79<\/a> and in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Note<\/a>. Associations of genetic variants with plasma metabolites were analyzed using the same REGENIE pipeline as for the microbiome, adjusting for age, age2, sex, delivery batch and genetic PCs 1\u201310. Metabolites detected in fewer than 100 samples were excluded. Those detected in 5\u201350% of samples were analyzed by logistic regression, and those in \u226550% of samples were RIN-transformed and analyzed by linear regression. We report one lead SNP per study-wide locus; when several species were associated, we selected the lead SNP among those replicated in HUNT, prioritizing the lowest P\u2009value in Swedish cohorts. FDR correction (Benjamini\u2013Hochberg) of 5% was applied.<\/p>\n<p>Stool metabolomics<\/p>\n<p>To find stool metabolites associated with the study-wide significant loci, we downloaded GWAS of stool metabolites summary statistics (only P\u2009&lt;\u200910\u22125 available) from Zierer et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 80\" title=\"Zierer, J. et al. The fecal metabolome as a functional readout of the gut microbiome. Nat. Genet. 50, 790&#x2013;795 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR80\" id=\"ref-link-section-d69842288e5823\" rel=\"nofollow noopener\" target=\"_blank\">80<\/a> (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a>) and lifted the genomic coordinates over to GRCh37 using Ensembl Variation 112 for variants with an rsID and <a href=\"https:\/\/genome.ucsc.edu\/cgi-bin\/hgLiftOver\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/genome.ucsc.edu\/cgi-bin\/hgLiftOver<\/a> for variants without an rsID. Genetic variants that could not be lifted over were removed (247 out of 46,765). We assessed the same lead variants per study-wide locus as described for the genetic association with plasma metabolites. A lookup was performed for genetic variants within 100\u2009kb of the locus region corresponding to the study-wide significant lead variant.<\/p>\n<p>Short-chain fatty acids<\/p>\n<p>In MOS, a panel of nine plasma SCFAs was measured<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 81\" title=\"Fristedt, R., Ruppert, V., Trower, T., Cooney, J. &amp; Landberg, R. Quantitation of circulating short-chain fatty acids in small volume blood samples from animals and humans. Talanta 272, 125743 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#ref-CR81\" id=\"ref-link-section-d69842288e5846\" rel=\"nofollow noopener\" target=\"_blank\">81<\/a>. Laboratory method for SCFA measurement is described in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">Supplementary Note<\/a>. The association of genetic variants with SCFAs was assessed with the same REGENIE pipeline as described above for the microbiome, with age, age2, sex, SCFA measurement batch and genetic PCs 1\u201310 as covariates. SCFAs were RIN-transformed and assessed using linear regression. We assessed the same lead SNPs per study-wide locus as described for the genetic association with plasma metabolites. FDR correction (Benjamini\u2013Hochberg) of 5% was applied.<\/p>\n<p>Reporting summary<\/p>\n<p>Further information on research design is available in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02512-2#MOESM2\" rel=\"nofollow noopener\" target=\"_blank\">Nature Portfolio Reporting Summary<\/a> linked to this article.<\/p>\n","protected":false},"excerpt":{"rendered":"Ethical considerations The current study has been approved by the Swedish Ethical Review Authority (DNR 2022-06137-01, DNR 2024-01992-02).&hellip;\n","protected":false},"author":2,"featured_media":467664,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[34],"tags":[2342,13114,258,8869,5713,13113,257,23710,97,3870,21730],"class_list":{"0":"post-467663","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-health","8":"tag-agriculture","9":"tag-animal-genetics-and-genomics","10":"tag-biomedicine","11":"tag-cancer-research","12":"tag-epidemiology","13":"tag-gene-function","14":"tag-general","15":"tag-genetic-association-study","16":"tag-health","17":"tag-human-genetics","18":"tag-microbiology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/467663","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/comments?post=467663"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/467663\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media\/467664"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media?parent=467663"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/categories?post=467663"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/tags?post=467663"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}