{"id":779,"date":"2025-07-17T16:09:18","date_gmt":"2025-07-17T16:09:18","guid":{"rendered":"https:\/\/www.newsbeep.com\/au\/779\/"},"modified":"2025-07-17T16:09:18","modified_gmt":"2025-07-17T16:09:18","slug":"combined-genome-wide-association-study-of-facial-traits-in-europeans-increases-explained-variance-and-improves-prediction","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/au\/779\/","title":{"rendered":"Combined genome-wide association study of facial traits in Europeans increases explained variance and improves prediction"},"content":{"rendered":"<p>C-GWAS and replication highlight 62 novel face loci<\/p>\n<p>The C-GWAS discovery phase included 11,662 individuals of European descent from five cohorts (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a> and see the \u201cMethods\u201d section): the Rotterdam Study (RS, N\u2009=\u20094242), the TwinsUK study (N\u2009=\u20091080), two US studies from Pennsylvania and Indiana (US-P: N\u2009=\u20091990 and US-I: N\u2009=\u2009784), and the Avon Longitudinal Study of Parents and their Children (ALSPAC, N\u2009=\u20093566). From digital 3D facial images, we mapped 44 a-priori defined facial landmarks from which 946 inter-landmark facial distances were obtained (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1a<\/a> and Supplementary Table\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>) spanning eight regions of the face: forehead, eyes, upper nose (root and bridge), lower nose (wing and tip), mouth, upper cheek, lower cheek, and chin (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1b<\/a>). Following General Procrustes analysis and outlier removal, the landmark coordinates were confirmed to be normally distributed across all cohorts (Bonferroni corrected p\u2009&gt;\u20090.05, see the \u201cMethods\u201d section), with specific examples from RS and TwinsUK illustrated in Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>. The 946 distances, adjusted for covariate effects, were rank-normalized across all 5 cohorts and used as phenotypes in the subsequent genetic analyses (see the \u201cMethods\u201d section).<\/p>\n<p>Fig. 1: C-GWAS identified 188 genetic loci with study-wide significant face association.<a class=\"c-article-section__figure-link\" data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-61761-7\/figures\/1\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" aria-describedby=\"Fig1\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/07\/41467_2025_61761_Fig1_HTML.png\" alt=\"figure 1\" loading=\"lazy\" width=\"685\" height=\"427\"\/><\/a><\/p>\n<p>a Localization of 44 landmarks on the face. b Eight facial regions and their corresponding landmarks. c QQ plots for C-GWAS combined p-values (one-sided), MinGWAS adjusted p-values (one-sided), and associated p-values of the most inflated single-trait GWAS (two-sided). Grey dots represent results from the GWAS for the facial trait L21_L23, which had the highest inflation among all single-trait GWASs. LDSC intercepts and \\({\\lambda }_{{{\\rm{G}}}{{\\rm{C}}}}\\) for the three types of results are indicated. The solid line indicates the expected distribution of p-values under the null hypothesis. d Venn diagram illustrates the overlap between the 188 loci identified by C-GWAS and the 440 loci reported by previous GWASs obtained by our literature survey. e Miami plots display the results of C-GWAS combined p-values (one-sided) and MinGWAS adjusted p-values (one-sided). Solid and dashed lines represent the study-wide significance (\\(5\\times {10}^{-8}\\)) and suggestive significance (\\(1\\times {10}^{-5}\\)) thresholds, respectively. Loci with the study-wide significance are highlighted using three types of marks, including purple dots for overlapping loci between C-GWAS and MinGWAS, green crosses for independent loci from C-GWAS or MinGWAS, and orange triangles for novel loci identified by C-GWAS that were not reported in previous GWASs. The 3D template facial image in this figure is adapted from White et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 82\" title=\"White, J. D. et al. MeshMonk: Open-source large-scale intensive 3D phenotyping. Sci. Rep. 9, 6085 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR82\" id=\"ref-link-section-d2722786e1099\" rel=\"nofollow noopener\" target=\"_blank\">82<\/a> published under an Open Access license (CC BY 4.0), see <a href=\"http:\/\/creativecommons.org\/licenses\/by\/4.0\/\" rel=\"nofollow noopener\" target=\"_blank\">http:\/\/creativecommons.org\/licenses\/by\/4.0\/<\/a>.<\/p>\n<p>C-GWAS was conducted in two stages. First, 432 pairs of GWAS meta-analyses for symmetrical distances were combined into 432 single-trait GWAS outputs. Next, these were combined with 82 additional GWAS meta-analyses for non-symmetrical distances (total 514) into one C-GWAS output (see the \u201cMethods\u201d section). SNP-based heritability of 514 facial traits GWAS was estimated using LD score regression<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 34\" title=\"Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291&#x2013;295 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR34\" id=\"ref-link-section-d2722786e1121\" rel=\"nofollow noopener\" target=\"_blank\">34<\/a> (LDSC) at an average of 0.23 and a range from 0.06 to 0.36 (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>). Higher heritability was observed for the nose and forehead, whereas lower values for the mouth and cheek (Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>). Notably, the mouth and chin exhibited significant heritability divergence between internal variation and distances to other facial regions, highlighting the composite nature of facial variation, driven by cranial structure and soft tissue thickness<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 35\" title=\"Qian, W. et al. Genetic evidence for facial variation being a composite phenotype of cranial variation and facial soft tissue thickness. J. Genet. Genom.=Yi chuan xue bao 49, 934&#x2013;942 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR35\" id=\"ref-link-section-d2722786e1131\" rel=\"nofollow noopener\" target=\"_blank\">35<\/a>.<\/p>\n<p>The C-GWAS method ensures that under the null hypothesis of no association, combined p-values follow a uniform distribution<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 31\" title=\"Xiong, Z. et al. Combining genome-wide association studies highlight novel loci involved in human facial variation. Nat. Commun. 13, 7832 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR31\" id=\"ref-link-section-d2722786e1141\" rel=\"nofollow noopener\" target=\"_blank\">31<\/a>, allowing the standard genome-wide significance threshold of 5\u2009\u00d7\u200910\u22128 to be used as the study-wide threshold. Similarly, minimal p-values from 514 single-trait GWASs were adjusted within C-GWAS (see the \u201cMethods\u201d section), referred to as MinGWAS, ensuring they also follow a uniform distribution under the null. This adjustment enables direct comparison between C-GWAS, MinGWAS, and single-trait GWAS results. We assessed statistical inflation using LDSC and the genomic control factor \u03bbGC (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1c<\/a>). The LDSC intercept quantifies inflation from non-polygenic effects, such as population stratification. 514 single-trait GWASs showed no significant inflation (LDSC intercept\u2009&lt;\u20091.03), although \u03bbGC showed slight inflation with an average of 1.06 and a range from 1.03 to 1.08 (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>), likely due to polygenic effects, consistent with previous GWAS findings for highly polygenic complex traits. Both MinGWAS and C-GWAS showed no LDSC inflation (LDSC intercept\u2009&lt;\u20091.01) but higher lambda values (C-GWAS \u03bbGC\u2009=\u20091.25, MinGWAS \u03bbGC\u2009=\u20091.21) due to the increased power of multi-trait analysis. After excluding all study-wide significant regions (\u00b1500\u2009kbp), \u03bbGC decreased but remained elevated (C-GWAS \u03bbGC\u2009=\u20091.19, MinGWAS \u03bbGC\u2009=\u20091.16, Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>), suggesting persistent minor polygenic effects even after removing significant regions.<\/p>\n<p>Compared with our previous face C-GWAS of 78 traits obtained from 13 landmarks in 10,115 individuals<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 31\" title=\"Xiong, Z. et al. Combining genome-wide association studies highlight novel loci involved in human facial variation. Nat. Commun. 13, 7832 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR31\" id=\"ref-link-section-d2722786e1193\" rel=\"nofollow noopener\" target=\"_blank\">31<\/a>, the current C-GWAS of 946 facial traits from 44 landmarks in 11,662 individuals gained 184% extra statistical power as estimated using the increase in the mean \u03c72 statistic method described in Turley et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 32\" title=\"Turley, P. et al. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat. Genet. 50, 229&#x2013;237 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR32\" id=\"ref-link-section-d2722786e1201\" rel=\"nofollow noopener\" target=\"_blank\">32<\/a>. C-GWAS identified 188 distinct genetic loci (separated by &gt;500\u2009kb) at study-wide significance (p\u2009&lt;\u20095\u2009\u00d7\u200910\u22128), four times more than MinGWAS (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1e<\/a>). As the significance of C-GWAS results increases, the proportion of C-GWAS results surpassing MinGWAS rises (Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>). Specifically, when the C-GWAS p-value is below 10\u22125, over 95% of the results are more significant than MinGWAS, and increases to over 99% at study-wide significance. One of the abundant examples, rs970797 at 2q31.1 MTX2 showing p\u2009=\u20092.86\u2009\u00d7\u200910\u221293 with C-GWAS, while p\u2009=\u20097.9\u2009\u00d7\u200910\u221221 with MinGWAS. These findings suggest that most face-associated SNPs affect multiple facial dimensions simultaneously, rather than being specific to a single trait.<\/p>\n<p>Of the 188 identified genetic loci, 62 (33%) were novel and had not been reported in previous facial GWASs. Among the remaining 126 loci overlapping with previous face GWASs, 26 (20%) were previously genome-wide but not study-wide significant, confirming the increased power of our approach (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1d<\/a>). Among the 188 loci, in addition to the top SNPs at each locus, 41 (21.8%) hosted 65 additional independent study-wide significant SNPs identified through conditional analysis using a modified version of GCTA-COJO<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 36\" title=\"Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369&#x2013;375 (2012). S361-363.\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR36\" id=\"ref-link-section-d2722786e1243\" rel=\"nofollow noopener\" target=\"_blank\">36<\/a> (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>, see the \u201cMethods\u201d section), resulting in a total of 253 lead SNPs (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>). Of these, 97 (38%) were novel SNPs, with 64 located in 62 novel loci and 33 located in 29 previously reported facial loci but in low LD (r2\u2009&lt;\u20090.1) with the previously reported top SNPs at these loci. The 2q36.1 PAX3 and 14q32.2 BCL11B harboured the largest number of independently associated SNPs, with 6 and 5 SNPs, respectively (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>). These findings reinforce that multiple independent lead SNPs within the same locus can distinctly influence diverse facial attributes<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 21\" title=\"White, J. D. et al. Insights into the genetic architecture of the human face. Nat. Genet. 53, 45&#x2013;53 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR21\" id=\"ref-link-section-d2722786e1267\" rel=\"nofollow noopener\" target=\"_blank\">21<\/a>.<\/p>\n<p>Fig. 2: Multiple independent lead SNPs within the same face-associated locus.<a class=\"c-article-section__figure-link\" data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-61761-7\/figures\/2\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" aria-describedby=\"Fig2\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/07\/41467_2025_61761_Fig2_HTML.png\" alt=\"figure 2\" loading=\"lazy\" width=\"685\" height=\"306\"\/><\/a><\/p>\n<p>Detailed illustration for two regional Manhattan plots, a for 5 lead SNPs at 14q32.2, and b for 6 lead SNPs at 2q36.1, was presented in three layers. The top layer displayed the distribution and intensity of effects from each lead SNP on the face using the regional percentage of variance explained (regional-PVE, see the \u201cMethods\u201d section) across 8 facial regions. The middle layer is a zoomed view of the C-GWAS combined p-values (one-sided), with multiple lead SNPs and their LD counterparts (p\u2009&lt;\u20091\u2009\u00d7\u200910\u22125) highlighted in different colours. Each lead SNP is annotated with its rsID and represented by a diamond shape. Protein-coding genes (black texts) and non-coding RNAs (grey texts) in the same region are annotated below the plots according to their chromosomal positions (GRCh37). The bottom layer displays the LD (r2) patterns of multiple lead SNPs and their LD counterparts. LDs are calculated in RS, and hierarchical clustering is applied. Multiple lead SNPs are marked with arrows corresponding to their colours. The 3D template facial image in this figure is adapted from White et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 82\" title=\"White, J. D. et al. MeshMonk: Open-source large-scale intensive 3D phenotyping. Sci. Rep. 9, 6085 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR82\" id=\"ref-link-section-d2722786e1302\" rel=\"nofollow noopener\" target=\"_blank\">82<\/a> published under an Open Access license (CC BY 4.0), see <a href=\"http:\/\/creativecommons.org\/licenses\/by\/4.0\/\" rel=\"nofollow noopener\" target=\"_blank\">http:\/\/creativecommons.org\/licenses\/by\/4.0\/<\/a>.<\/p>\n<p>Intraclass correlation coefficient (ICC) analysis of allelic effects demonstrated high consistency across the five European cohorts for both, previously known and novel lead SNPs (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>), with a specific example of effects on nasion width from RS and ALSPAC illustrated in Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>. MAMBA<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 37\" title=\"McGuire, D. et al. Model-based assessment of replicability for genome-wide association meta-analysis. Nat. Commun. 12, 1964 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR37\" id=\"ref-link-section-d2722786e1330\" rel=\"nofollow noopener\" target=\"_blank\">37<\/a> analysis further assessing posterior probability of replicability (PPR) across all five European cohorts revealed high PPR values approaching 1 (Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>). The associations with more significant meta-analysis p-values corresponded to higher PPR values, further corroborating the robustness of our findings. External cross-ancestry replication analyses in 9674 Chinese individuals revealed high replication rates of 69.8% in all lead SNPs and 39.6% in the novel lead SNPs (FDR corrected p\u2009&lt;\u20090.05, Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>). Further details on covariate impact and pan-ethnic effects of lead SNPs are provided in Supplementary Notes\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>.<\/p>\n<p>Biological annotations and pleiotropy of face-associated SNPs<\/p>\n<p>A stratified LDSC<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 38\" title=\"Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 47, 1228&#x2013;1235 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR38\" id=\"ref-link-section-d2722786e1361\" rel=\"nofollow noopener\" target=\"_blank\">38<\/a> of C-GWAS summary statistics across various cell and tissue types identified 15 significantly enriched cell types (Bonferroni threshold p\u2009=\u20094.9\u2009\u00d7\u200910\u22124, Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>, Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>). The most substantial enrichments were notably observed in adult dermal fibroblast primary cells (p\u2009=\u20093.3\u2009\u00d7\u200910\u221211) and cranial neural crest cells (CNCC, p\u2009=\u20092.1\u2009\u00d7\u200910\u221210). A multi-omics integration analysis (CNCC regulatory network<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 39\" title=\"Feng, Z. et al. hReg-CNCC reconstructs a regulatory network in human cranial neural crest cells and annotates variants in a developmental context. Commun. Biol. 4, 442 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR39\" id=\"ref-link-section-d2722786e1387\" rel=\"nofollow noopener\" target=\"_blank\">39<\/a>, multi-tissue eQTLs and chromatin interactions) revealed 365 candidate genes potentially functionally affecting human facial variation (see the \u201cMethods\u201d section, Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>). GO enrichment analysis of these genes highlighted 177 significant GO-term biological processes (Bonferroni corrected p\u2009&lt;\u20090.001, Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a>) clustered into 16 groups (Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a> and Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>), covering not only well-established craniofacial morphogenesis processes, such as embryo, skeleton, limb, bone, ear, brain, neuron, eye, differentiation, growth, and the Wnt pathway, but also processes in the key cell types highly relevant to CNCC formation including epithelium, mesenchyme, and mesoderm (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig3\" rel=\"nofollow noopener\" target=\"_blank\">3a<\/a>). Additional enrichment in seemingly unrelated processes like vasculum, heart, muscle, urogenital, and digestive systems further emphasize a pleiotropic nature of facial genetic factors. Further details on contributing cell types and candidate gene robustness are provided in the Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a>, Supplementary Notes\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>.<\/p>\n<p>Fig. 3: Enriched biological processes and four clusters of face candidate genes.<a class=\"c-article-section__figure-link\" data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-61761-7\/figures\/3\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" aria-describedby=\"Fig3\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/07\/41467_2025_61761_Fig3_HTML.png\" alt=\"figure 3\" loading=\"lazy\" width=\"685\" height=\"630\"\/><\/a><\/p>\n<p>a The figure details the enriched biological processes and the specificity of 4 face candidate gene sets annotated by 253 lead SNPs. The outer bar chart contains 177 biological processes with significant enrichment Bonferroni corrected p-values (one-sided hypergeometric test), divided into 16 categories. The dashed line indicates the significance threshold (p\u2009=\u20090.001). The inner radar chart reflects the mean fold enrichment of genes in each gene set across the biological processes in each of the 16 categories. Different colours denote different gene sets. The dashed line represents the mean fold enrichment under the null distribution (no enrichment). The outer ring annotations include the titles of the 16 categories, the top three most significant enriched biological processes in each category, and biological processes in the top 5% of fold enrichment of genes in 4 gene sets across the 177 biological processes, highlighted in the corresponding colour of the gene set. b Hierarchical clustering of 225 face candidate gene sets annotated by 253 lead SNPs, identified 4 clusters based on their effects on facial regions (in terms of regional-PVE across 8 facial regions, see the \u201cMethods\u201d section). Face maps showed the total effects of the annotated genes on the face for gene sets corresponding to four clusters. The 3D template facial image in this figure is adapted from White et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 82\" title=\"White, J. D. et al. MeshMonk: Open-source large-scale intensive 3D phenotyping. Sci. Rep. 9, 6085 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR82\" id=\"ref-link-section-d2722786e1443\" rel=\"nofollow noopener\" target=\"_blank\">82<\/a> published under an Open Access license (CC BY 4.0), see <a href=\"http:\/\/creativecommons.org\/licenses\/by\/4.0\/\" rel=\"nofollow noopener\" target=\"_blank\">http:\/\/creativecommons.org\/licenses\/by\/4.0\/<\/a>.<\/p>\n<p>Hierarchical clustering analysis revealed four distinct sets of candidate genes predominantly affecting different sets of facial regions (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig3\" rel=\"nofollow noopener\" target=\"_blank\">3b<\/a> and Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>) and showed specificity in biological processes (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig3\" rel=\"nofollow noopener\" target=\"_blank\">3a<\/a> and Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>). For example, genes influencing the nose were more enriched in limb morphogenesis, while those affecting the chin and mouth were more enriched in morphogenesis of branching structures, particularly axons. These findings offer a preliminary map of how embryonic development, driven by shared genetic factors, potentially influences distinct facial features. The extensive pleiotropy was further supported by a GWAS Catalog look-up, where 144 (57%) of the 253 lead SNPs, or their high LD counterparts (r2\u2009&gt;\u20090.6), were associated with 13 phenotypic categories of non-facial traits (Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a>). The top-associated categories are anthropometric, brain imaging, metabolic and appearances traits, such as height, waist-to-hip ratio, sulcal depth, cortical surface area, total testosterone levels, contactin-2 levels, male pattern baldness, and hair colour (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">11<\/a>). These results collectively suggest that the phenotypic spectrum of face-associated SNPs is more extensive than previously may have thought.<\/p>\n<p>Enhanced proportion of genetically explained facial variance and facial PRS profiles<\/p>\n<p>Across the 514 facial traits, the combined proportion of variance explained (PVE) from all 253 lead SNPs together ranged from 2.25% to 7.89% (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>), a 2.25-fold average increase from our earlier study<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\" title=\"Xiong, Z. et al. Novel genetic loci affecting facial shape variation in humans. Elife 8, e49898 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR3\" id=\"ref-link-section-d2722786e1499\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>, which were based on 31 SNPs and 78 facial traits and ranged from 0.65% to 4.62%. To further explore the explanatory power of the lead SNPs, we extended the analysis to assess PVE across the entire face (face-PVE, including all 514 traits) and within specific facial regions (regional-PVE, including up to 32 traits per region, see the \u201cMethods\u201d section). Combining 253 lead SNPs resulted in a face-PVE of 4.5%, with the highest regional-PVE in the upper nose (6.0%) and lower nose (5.6%), and the lowest in the lower cheek (3.2%) and mouth (3.6%) (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig4\" rel=\"nofollow noopener\" target=\"_blank\">4a<\/a>). Notably, the novel SNPs tended to have more widespread effects across multiple facial regions rather than being confined to a specific area (Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">11<\/a>). Our comprehensive mapping of genetic effects on distinct facial dimensions highlighting the polygenic and multidimensional nature of facial shape (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a>). Detailed single-trait PVE, face-PVE and regional-PVE analyses are provided in the Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>.<\/p>\n<p>Fig. 4: Facial effect of face-associated SNPs with a focus on the nose.<a class=\"c-article-section__figure-link\" data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-61761-7\/figures\/4\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" aria-describedby=\"Fig4\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/07\/41467_2025_61761_Fig4_HTML.png\" alt=\"figure 4\" loading=\"lazy\" width=\"685\" height=\"613\"\/><\/a><\/p>\n<p>a Illustration of the total effects of 253 lead SNPs on the face, based on regional-PVE across 8 facial regions. The 3D template facial image in a is adapted from White et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 82\" title=\"White, J. D. et al. MeshMonk: Open-source large-scale intensive 3D phenotyping. Sci. Rep. 9, 6085 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR82\" id=\"ref-link-section-d2722786e1533\" rel=\"nofollow noopener\" target=\"_blank\">82<\/a> published under an Open Access license (CC BY 4.0), see <a href=\"http:\/\/creativecommons.org\/licenses\/by\/4.0\/\" rel=\"nofollow noopener\" target=\"_blank\">http:\/\/creativecommons.org\/licenses\/by\/4.0\/<\/a>. b Enlarged frontal and lateral views of the nose with 14 landmark points highlighted in c and d, and Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>. The scale indicates the p-value of deviation of the landmark from the average in c and d, which is derived from the association test between PRS and landmark coordinates using linear regression. PRS analysis and its effects on the nose are illustrated for: c for L21\u2013L24 and L23\u2013L26, and d for L7\u2013L8 and L8\u2013L9. Each figure is divided into three sections: PRS effects on the nose (left, enlarged nasal view), population profiling (middle, violin plots), and gene contributions (right). In the nasal views, landmarks significantly associated with standardized PRS are marked with orange dots (3 times positive effect offset) or blue dots (3 times negative effect offset). Connecting lines visually represent nasal variation, with scaled distances shown in the top left. Nasal images reconstructed at mean PRS\u2009\u00b1\u20093\u2009SD are framed in orange (+3SD) and blue (\u22123SD) and generated using a 3D graph auto-encoder (see the \u201cMethods\u201d section). The overall effect of increasing PRS is described in text at the top left. The violin plots show standardized PRS results for four major populations from the 1000 Genomes Project (African, AFR, N\u2009=\u2009504; European, EUR, N\u2009=\u2009504; East Asian, EAS, N\u2009=\u2009503; South Asian, SAS, N\u2009=\u2009489), with the band indicating the median, the box representing the first and third quartiles, and whiskers extending 1.5 times the interquartile range. Gene contribution plots on the right list the top regions accounting for 50% of PRS variance. The 3D template facial image in b\u2013d was generated from the average covariate-adjusted facial shape of all RS participants (N\u2009=\u20094242).<\/p>\n<p>Building on these findings, we constructed polygenic risk score (PRS) profiles for 2000 individuals across four major continental populations (African, AFR; European, EUR; East Asian, EAS; South Asian, SAS) using data from the 1000 Genomes Project<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 40\" title=\"Genomes Project C et al. A global reference for human genetic variation. Nature 526, 68&#x2013;74 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR40\" id=\"ref-link-section-d2722786e1603\" rel=\"nofollow noopener\" target=\"_blank\">40<\/a>, excluding admixed samples from AFR and the American population. The PRS was based on 382 face-associated SNPs, including 253 lead SNPs from our C-GWAS and 129 SNPs from previous facial shape GWASs, for 23 most genetically explainable and independent facial traits (see the \u201cMethods\u201d section, Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">13<\/a>). Facial PRS profiles largely agree with established anthropological knowledge concerning nose shape variation among continental populations (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig4\" rel=\"nofollow noopener\" target=\"_blank\">4b\u2013d<\/a>, Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a> and Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>). A comparative analysis of computed tomography scans of 388 adults of AFR, Asian (ASN), and EUR ancestry<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 41\" title=\"Simmons-Ehrhardt, T. L., Monson, K. L., Flint, T. &amp; Saunders, C. P. Quantitative accuracy and 3D biometric matching of 388 statistically estimated facial approximations of live subjects. Forensic Imaging 21, 200377 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR41\" id=\"ref-link-section-d2722786e1620\" rel=\"nofollow noopener\" target=\"_blank\">41<\/a> further confirmed significant correlations between mean differences in facial PRS profiles and phenotypic traits between EUR and AFR (Pearson\u2019s r\u2009=\u20090.75), as well as between EUR and ASN (r\u2009=\u20090.81) (Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>). Specifically, considering the average facial PRS profiles of EUR as reference (z\u2009=\u20090), AFR showed notably smaller nose root (z\u2009=\u2009\u22121.86), less protruded (z\u2009=\u2009\u22121.76) but more upturned (z\u2009=\u20090.74) nose tips, alongside broader nose wings (z\u2009=\u20090.8) and shorter nose bridges (z\u2009=\u2009\u22120.82). EAS were characterized by significantly smaller (z\u2009=\u2009\u22121.77) and less (z\u2009=\u2009\u22122.42) protruded nose tips, along with smaller nose root (z\u2009=\u2009\u22121.12). Meanwhile, SAS displayed similar nose shapes to EUR (all |z\u2009|\u2009&lt;0.5), which is remarkable given their closer geographic proximity to EAS. The latter finding is in line with the study of Zaidi et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 42\" title=\"Zaidi, A. A., Mattern, B. C., Claes, P., McEvoy, B., Hughes, C. &amp; Shriver, M. D. Investigating the case of human nose shape and climate adaptation. PLoS Genet. 13, e1006616 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR42\" id=\"ref-link-section-d2722786e1665\" rel=\"nofollow noopener\" target=\"_blank\">42<\/a> finding that the average nose differences between SAS and EUR were smaller than other intercontinental comparisons. Furthermore, our findings are largely consistent with numerous facial photogrammetric studies reviewed by Wen et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\" title=\"Wen, Y. F., Wong, H. M., Lin, R., Yin, G. &amp; McGrath, C. Inter-ethnic\/racial facial variations: a systematic review and bayesian meta-analysis of photogrammetric studies. PLoS ONE 10, e0134525 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR5\" id=\"ref-link-section-d2722786e1669\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>, reinforcing the correlation between genetic facial profiles and physical facial anthropometry.<\/p>\n<p>Re-identification of individuals from 3D images with facial PRS profiles<\/p>\n<p>Motivated by these findings, we explored the feasibility of re-identification of individuals\u2019 images from their facial PRS profiles, focusing on the nose region, which manifested the highest genetically explainable phenotypic variance (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>). This was achieved by calculating the cosine similarity scores between PRS profiles of the nose (referred to as nose PRS profile) and 3D image-derived phenotypic nose profiles (referred to as nose 3D profiles) in the RS. When focusing on the top 1% of the nose 3D profiles that most closely mirrored an individual\u2019s nose PRS profile, we observed a 3.4% probability of accurately identifying the correct individual\u2019s nose 3D profile based on its nose PRS profile (cumulative matching characteristic, CMC1%\u2009=\u20093.4%, that is, with a 1% selection threshold, the matching probability was 3.4%). The matching accuracy significantly improved with less stringent selection thresholds, achieving 24% at CMC10% and 73% at CMC50% (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5a<\/a>).<\/p>\n<p>Fig. 5: Individual re-identification using phenotypic and PRS nose profiles.<a class=\"c-article-section__figure-link\" data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-61761-7\/figures\/5\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" aria-describedby=\"Fig5\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/07\/41467_2025_61761_Fig5_HTML.png\" alt=\"figure 5\" loading=\"lazy\" width=\"685\" height=\"474\"\/><\/a><\/p>\n<p>a Cumulative Match Characteristic (CMC) curve for the individual re-identification model using the PRS of the five nose traits, as detailed in Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig4\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a> and Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>. The curve is based on 50 rounds of 10-fold cross-validation, highlighting CMC1%, CMC10%, and CMC50% in orange crosses and labelled in the top left. b Receiver operating characteristic (ROC) curve for the individual re-identification model using the PRS of the same five nose traits, based on 50 rounds of 10-fold cross-validation. The average AUC is labelled on the top left. c Distribution of AUC values across different prediction validation scenarios in two European cohorts, RS and TwinsUK. The density plot shows the distribution of AUC null values obtained through 10,000 replicates, where each replicate involved PRS constructed from a number and MAF-matched SNPs randomly selected across the genome. AUC values achieved by our PRS models under different validation scenarios were compared with the null distribution using arrowed lines.<\/p>\n<p>Utilizing a binary variable of true and false matches, the similarity score achieved a moderate AUC of 0.67 (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5b<\/a>, see the \u201cMethods\u201d section). Because RS was part of the C-GWAS discovery dataset, aiming to establish a benchmark, we generated null AUC values through 10,000 replicates, where each replicate employed PRS constructed from randomly selected SNPs that were matched by number and MAF. The AUC of 0.67 in RS significantly exceeded the null distribution, confirming that our matching accuracy surpasses expectations by random chance (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5c<\/a>). Next, we employed the TwinsUK data in this re-identification analysis. Although the TwinsUK dataset was also part of the C-GWAS discovery dataset, we assessed the potential decrease in AUC when applying the PRS to TwinsUK as a separate dataset. By considering 216 SNPs that overlapped between both datasets, the internal validation within the reference sample of RS yielded an AUC of 0.62. When this PRS was then applied to the TwinsUK dataset, we obtained an AUC of 0.60 (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5c<\/a>). This small AUC reduction of 0.02 suggests a negligible winner\u2019s curse effect within the training cohort (i.e., RS), as confirmed by additional simulation analyses (Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a>). The significant deviation from the null distribution of both AUC outcomes, from RS and TwinsUK, implies that our genetic model captures meaningful effects beyond random noise. While the AUC values obtained here may seem modest, it is important to emphasize that these results were achieved in homogeneous populations and based solely on SNP data for nose traits alone. Expanding the number of genetically predictable facial phenotypes beyond the nose, even if each has limited accuracy, could collectively improve the overall accuracy in individual re-identification.<\/p>\n<p>Face-associated SNPs experienced positive selection in Europeans<\/p>\n<p>Among the 253 lead SNPs identified in our facial C-GWAS, 69% exhibited a frequency difference &gt;\u20090.2 among EUR, EAS, and AFR, as per the 1000 Genomes Project data (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>). This proportion is significantly higher than the average observed in random samples of the same number of SNPs across the genome after 10,000 replicates (p\u2009=\u20090.023, Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a>). The average of fixation index (FST), which reflects genetic variation between groups and can indicate local positive selection, was significantly higher between EUR and EAS (p\u2009=\u20090.017) and EUR and AFR (p\u2009=\u20090.011) than expected from randomly sampled SNPs (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">6a<\/a>). However, this pattern was not observed in the EAS-AFR comparison (p\u2009=\u20090.32, Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a>). Moreover, the mean Population Branch Statistic (PBS) for the 253 lead SNPs compared to randomly selected SNPs was statistically significant in EUR (p\u2009&lt;\u20090.01, Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">6a<\/a>), but not in EAS (p\u2009=\u20090.44) and AFR (p\u2009=\u20090.25). These findings suggest a more pronounced influence of positive selection on face-associated SNPs in Europeans.<\/p>\n<p>Fig. 6: Face-associated SNPs under positive selection in Europeans.<a class=\"c-article-section__figure-link\" data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-61761-7\/figures\/6\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" aria-describedby=\"Fig6\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/07\/41467_2025_61761_Fig6_HTML.png\" alt=\"figure 6\" loading=\"lazy\" width=\"685\" height=\"329\"\/><\/a><\/p>\n<p>a Simulations analyses show the positive selection. Grey bars and curves show the expected distribution density from 10,000 simulations under the null hypothesis. The dashed line indicates the one-sided 0.95 cumulative density threshold. Observed values from 253 lead SNPs are highlighted in orange with one-sided empirical p-values. For illustration of the 2q12.1 region (b) and 9p22.3-p22.2 region c under natural selection, Left: C-GWAS combined p-values within 250\u2009kb of the top SNP plotted against PBS results in EUR. Dashed lines indicate suggestive significance threshold for C-GWAS and the top 1% threshold for PBS, respectively. The top SNP is highlighted with a purple diamond, and LD counterparts are in colour scale. The SNP with both a significant facial effect and a selection signal is annotated. Middle: World map of allele frequency for annotated SNPs based on 1000 Genomes Project sub-populations. Base map is from Natural Earth data v1.4.0 (public domain). Right: Regional-PVE effects of annotated SNP. The 3D template facial image in this figure is adapted from White et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 82\" title=\"White, J. D. et al. MeshMonk: Open-source large-scale intensive 3D phenotyping. Sci. Rep. 9, 6085 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR82\" id=\"ref-link-section-d2722786e1810\" rel=\"nofollow noopener\" target=\"_blank\">82<\/a> published under an Open Access license (CC BY 4.0), see <a href=\"http:\/\/creativecommons.org\/licenses\/by\/4.0\/\" rel=\"nofollow noopener\" target=\"_blank\">http:\/\/creativecommons.org\/licenses\/by\/4.0\/<\/a>.<\/p>\n<p>Although our targeted SNPs may have higher MAF in EUR because they were identified through a European-based C-GWAS, a recent large-scale face GWAS conducted in East Asians<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Zhang, M. et al. Genetic variants underlying differences in facial morphology in East Asian and European populations. Nat. Genet. 54, 403&#x2013;411 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR22\" id=\"ref-link-section-d2722786e1832\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a> identified 244 face-associated SNPs and found evidence of positive selection on these SNPs in Europeans. This suggests that facial differences between Europeans and East Asians, such as more protruded and narrow noses in Europeans, may result from adaptation and positive selection in European populations. The consistency of findings across independent GWAS in different continental populations indicates these results are not merely artefacts of GWAS-based allele frequency biases.<\/p>\n<p>Of the 188 genetic loci where\u00a0the 253 lead SNPs are located, 22 displayed strong signals of positive selection (top 1% of the genome, Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">15<\/a>), highlighting their importance in the evolutionary history of facial variation in modern humans. Among these, two loci were at the extreme tail of statistical significance: 2q12.1, containing POU3F3 primarily associated with the nose region, and 9p22.3-p22.2 containing BNC2 primarily associated with the chin region (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">6b and c<\/a>). Notably, BNC2 was previously identified as a skin-colour gene based on association<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 43\" title=\"Jacobs, L. C. et al. Comprehensive candidate gene study highlights UGT1A and BNC2 as new genes determining continuous skin color variation in Europeans. Hum. Genet. 132, 147&#x2013;158 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR43\" id=\"ref-link-section-d2722786e1855\" rel=\"nofollow noopener\" target=\"_blank\">43<\/a> and functional evidence<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 44\" title=\"Visser, M., Palstra, R. J. &amp; Kayser, M. Human skin color is influenced by an intergenic DNA polymorphism regulating transcription of the nearby BNC2 pigmentation gene. Hum. Mol. Genet. 23, 5750&#x2013;5762 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR44\" id=\"ref-link-section-d2722786e1859\" rel=\"nofollow noopener\" target=\"_blank\">44<\/a>, and is located in a genomic region of Neanderthal ancestry (introgressed segments)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 45\" title=\"Gittelman, R. M. et al. Archaic hominin admixture facilitated adaptation to out-of-Africa environments. Curr. Biol.26, 3375&#x2013;3382 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR45\" id=\"ref-link-section-d2722786e1863\" rel=\"nofollow noopener\" target=\"_blank\">45<\/a>.<\/p>\n<p>Facial PRS profiles of Neanderthals differ more from Europeans than from Africans<\/p>\n<p>Next, we explored whether face-associated SNPs identified via C-GWAS in Europeans could help genetically reconstruct facial traits in archaic humans. Modern human populations outside Africa carry ~1.5\u20132.1% Neanderthal-derived ancestry<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 46\" title=\"Green, R. E. et al. A draft sequence of the Neandertal genome. Science 328, 710&#x2013;722 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR46\" id=\"ref-link-section-d2722786e1875\" rel=\"nofollow noopener\" target=\"_blank\">46<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 47\" title=\"Prufer, K. et al. The complete genome sequence of a Neanderthal from the Altai Mountains. Nature 505, 43&#x2013;49 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR47\" id=\"ref-link-section-d2722786e1878\" rel=\"nofollow noopener\" target=\"_blank\">47<\/a>. We annotated our 253 lead SNPs with previously identified introgressed segments of Neanderthal in the genomes of EUR and EAS<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Sankararaman, S. et al. The genomic landscape of Neanderthal ancestry in present-day humans. Nature 507, 354&#x2013;357 (2014).\" href=\"#ref-CR48\" id=\"ref-link-section-d2722786e1882\">48<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Vernot, B. &amp; Akey, J. M. Resurrecting surviving Neandertal lineages from modern human genomes. Science 343, 1017&#x2013;1021 (2014).\" href=\"#ref-CR49\" id=\"ref-link-section-d2722786e1882_1\">49<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 50\" title=\"Yuan, K. et al. Refining models of archaic admixture in Eurasia with ArchaicSeeker 2.0. Nat. Commun. 12, 6232 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR50\" id=\"ref-link-section-d2722786e1885\" rel=\"nofollow noopener\" target=\"_blank\">50<\/a>. On average, 65 SNPs (25.7%) in EUR and 57 SNPs (22.5%) in EAS overlapped with Neanderthal introgressed segments, including 30 SNPs (11.9%) shared between both populations (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a>). Lead SNPs in EUR-specific segments predominantly influenced the upper nose and forehead, while those in EAS-specific segments primarily affected the chin, and shared lead SNPs were mostly associated with variation in the lower nose (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7a<\/a> and Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">13<\/a>). This pattern suggests that face-associated SNPs located in Neanderthal introgressed segments have shaped facial variation differently across modern human populations, for instance, prominent brow ridges in EUR<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 51\" title=\"Weaver, T. D. Out of Africa: modern human origins special feature: the meaning of neandertal skeletal morphology. Proc. Natl Acad. Sci. USA 106, 16028&#x2013;16033 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR51\" id=\"ref-link-section-d2722786e1899\" rel=\"nofollow noopener\" target=\"_blank\">51<\/a> and flatter chin structures in EAS<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 52\" title=\"Bergmann, I., Hublin, J. J., Gunz, P. &amp; Freidline, S. E. How did modern morphology evolve in the human mandible? The relationship between static adult allometry and mandibular variability in Homo sapiens. J. Hum. Evol. 157, 103026 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR52\" id=\"ref-link-section-d2722786e1903\" rel=\"nofollow noopener\" target=\"_blank\">52<\/a>. Distinct genetic loci contributed to these population-specific introgression effects (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7b<\/a>). Notably, chromosome 6p21.1 contains SUPT3H (EUR-specific) associated with the upper nose, and RUNX2 (EAS-specific) associated with the upper cheek (Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>). Additionally, TBX15 on chromosome 1p12, represented by three lead SNPs with broad facial effects, significantly contributed to EAS-specific introgression influence, aligning with previous findings linking TBX15 to Denisovan introgression associated with lip thickness in Latin Americans<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\" title=\"Bonfante, B. et al. A GWAS in Latin Americans identifies novel face shape loci, implicating VPS13B and a Denisovan introgressed region in facial variation. Sci. Adv. 7, eabc6160 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR11\" id=\"ref-link-section-d2722786e1926\" rel=\"nofollow noopener\" target=\"_blank\">11<\/a>. Importantly, our analysis relies solely on C-GWAS lead SNPs overlapping\u00a0within previously reported introgressed segments, without considering ancestral\/derived allelic states. To explore this further, we examined whether the lead SNP in the introgressed segments meets the condition of archaic variants as previously defined<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 53\" title=\"Zeberg, H. et al. A Neanderthal sodium channel increases pain sensitivity in present-day humans. Curr. Biol.30, 3465&#x2013;3469 e3464 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR53\" id=\"ref-link-section-d2722786e1930\" rel=\"nofollow noopener\" target=\"_blank\">53<\/a>, which would increase confidence that the lead SNP represents a true introgressed variant. However, the presence of such variants in modern populations is not necessarily due to hybridization between Neanderthals and Early Sapiens; it could also be due to incomplete lineage sorting. Furthermore, the limited number of high-confidence introgressed face-associated SNPs identified using this approach restricts our ability to draw robust conclusions (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">17<\/a> and Supplementary Note\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>).<\/p>\n<p>Fig. 7: Inferring the relationship between archaic and modern human facial shape using face-associated SNPs.<a class=\"c-article-section__figure-link\" data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-61761-7\/figures\/7\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" aria-describedby=\"Fig7\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/07\/41467_2025_61761_Fig7_HTML.png\" alt=\"figure 7\" loading=\"lazy\" width=\"685\" height=\"484\"\/><\/a><\/p>\n<p>a Regional-PVE effects of lead SNPs in Neanderthal introgressed segments on the face, divided into EUR-specific, overlapping, and EAS-specific groups. b Top gene regions contributing to total effects for the three SNP groups in a, showing cumulative contributions accounting for 50% of the total. c Violin plots of squared correlations between facial PRS profiles of archaic humans and four major populations in the 1000 Genomes Project (African, AFR, N\u2009=\u2009504; European, EUR, N\u2009=\u2009504; East Asian, EAS, N\u2009=\u2009503; South Asian, SAS, N\u2009=\u2009489), grouped by Neanderthals, Neanderthal-Denisovan admixed individual, and Denisovan. The band indicates the median, the box indicates the first and third quartiles, and the whiskers indicate 1.5 times of interquartile range from box. d The mean absolute frequency differences between 10 archaic human samples and 2000 samples from four major populations in the 1000 Genomes Project (same as c) were illustrated for different sets of SNPs, with higher values indicating greater dissimilarity (see the \u201cMethods\u201d section). The dashed line corresponds to the 357 SNPs used to construct the archaic human facial PRS profiles. The violin plots represent replicates of 10,000 random sets of SNPs. The band indicates the median, the box indicates the first and third quartiles, and the whiskers indicate 1.5 times of interquartile range from the box. Labels on the x-axis denote the different SNP sets used: \u201cALL\u201d represents random 357 genome-wide SNPs, \u201cEUR segments\u201d and \u201cEAS segments\u201d include the SNPs among the 357 overlapping introgression segments specific to Europeans and East Asians, respectively. The 3D template facial image in this figure is adapted from White et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 82\" title=\"White, J. D. et al. MeshMonk: Open-source large-scale intensive 3D phenotyping. Sci. Rep. 9, 6085 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR82\" id=\"ref-link-section-d2722786e1986\" rel=\"nofollow noopener\" target=\"_blank\">82<\/a> published under an Open Access license (CC BY 4.0), see <a href=\"http:\/\/creativecommons.org\/licenses\/by\/4.0\/\" rel=\"nofollow noopener\" target=\"_blank\">http:\/\/creativecommons.org\/licenses\/by\/4.0\/<\/a>.<\/p>\n<p>Next, we constructed facial PRS profiles based on 357 face-associated SNPs available in 10 archaic humans (8 Neanderthals, 1 Denisovan, and 1 Neanderthal\u2013Denisovan admixed individual) using genomic data obtained from the Allen Ancient DNA Resource<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 54\" title=\"Mallick, S. et al. The Allen Ancient DNA Resource (AADR) a curated compendium of ancient human genomes. Sci. Data 11, 182 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR54\" id=\"ref-link-section-d2722786e2009\" rel=\"nofollow noopener\" target=\"_blank\">54<\/a> (see the \u201cMethods\u201d section, Supplementary Table\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a> and Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">18<\/a>). When comparing archaic humans to the four major modern human populations in the 1000 Genomes Project, archaic facial PRS profiles are more different from three non-African groups, including EUR, than from AFR (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7c<\/a>). This finding aligns with current understanding that Africans are ancestral to all modern humans, making them genetically closer to the common ancestors of both modern and archaic humans. However, this finding may appear as a paradox given that the introgression happened in non-Africans. To explore this further, we calculated the mean absolute differences in allele frequencies for the 357 SNPs between archaic humans and modern human populations, and compared these to 10,000 random sets of number-matched random SNPs. Differences between archaic humans and AFR were consistently smaller than those between archaic humans and all non-Africans from EUR, EAS, and SAS, even when focusing on SNPs within introgressed segments (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7d<\/a>). These findings suggest that the similarity in facial PRS profiles between archaic humans and AFR is likely due to shared ancestral alleles. However, we acknowledge a potential bias stemming from the predominantly European GWAS origin of these 357 facial SNPs. Future studies should prioritize facial GWAS in diverse populations, particularly in African cohorts, to reduce such biases and clarify the evolutionary history underlying facial shape genetics.<\/p>\n<p>Facial PRS profiles of Neanderthals align with Neanderthal skulls<\/p>\n<p>Neanderthal skulls have been discovered and show a series of distinct features that are different compared to skulls of modern humans<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 51\" title=\"Weaver, T. D. Out of Africa: modern human origins special feature: the meaning of neandertal skeletal morphology. Proc. Natl Acad. Sci. USA 106, 16028&#x2013;16033 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR51\" id=\"ref-link-section-d2722786e2033\" rel=\"nofollow noopener\" target=\"_blank\">51<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 55\" title=\"Schwartz, J. H. &amp; Tattersall, I. Fossil evidence for the origin of Homo sapiens. Am. J. Phys. Anthropol. 143, 94&#x2013;121 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR55\" id=\"ref-link-section-d2722786e2036\" rel=\"nofollow noopener\" target=\"_blank\">55<\/a>. To validate Neanderthal facial PRS profiles, we mapped them to 16 Neanderthal facial features reflected by skulls, excluding features not reflected in bone structure, such as the nose tip and certain aspects of the nose bridge (Supplementary Table\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>, see the \u201cMethods\u201d section). Of the 16 facial features analysed, 15 showed concordances between (a) the direction of the difference predicted by facial PRS profiles in Neanderthals relative to Europeans, and (b) the direction of the reported feature differences for Neanderthals relative to modern humans, examples include a wider face, more protruded brow ridge, flatter cheekbones, and lower palate (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig8\" rel=\"nofollow noopener\" target=\"_blank\">8a<\/a>). Permutation tests (see the \u201cMethods\u201d section) indicated that observing 15 concordant features is statistically highly significant, exceeding all 10,000 random permutations (p\u2009&lt;\u20090.0001, Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig8\" rel=\"nofollow noopener\" target=\"_blank\">8b<\/a>). Only one feature (wider palate in Neanderthals) showed inconsistent classification due to conflicting directions from multiple overlapping PRS (p\u2009=\u20090.0126). No discordant results were observed (p\u2009=\u20090.0012). The statistical significance of these findings indicates that the observed concordance between European-based PRS predictions for Neanderthals and their known facial features relative to modern humans is unlikely to have occurred by chance. To further assess the robustness of these results, we conducted simulations to evaluate how data limitations for archaic humans, such as pseudohaploid data, small sample sizes, and low call rates, might affect PRS accuracy (Supplementary Notes\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">11<\/a>). The results showed that pseudohaploid data and small sample sizes did not bias PRS averages, although they did increase the uncertainty. Additionally, imputing genotypes for low call rate samples yielded PRS averages highly consistent with those from high call rate samples. These findings suggest that, despite moderate PRS r2 values that may not accurately represent individual facial profile and the inherent uncertainty of archaic DNA data, the broader patterns in PRS profiles, such as directionality of averages, remained robust, particularly for extreme values, reinforcing the reliability of archaic PRS profiles for characterizing population-level facial difference.<\/p>\n<p>Fig. 8: Facial PRS profiles of archaic humans.<a class=\"c-article-section__figure-link\" data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"https:\/\/www.nature.com\/articles\/s41467-025-61761-7\/figures\/8\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" aria-describedby=\"Fig8\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/07\/41467_2025_61761_Fig8_HTML.png\" alt=\"figure 8\" loading=\"lazy\" width=\"685\" height=\"503\"\/><\/a><\/p>\n<p>a Mapping facial PRS profiles of Neanderthals to 16 facial features reflected by skulls. Left panel: PRS (absolute values\u2009&gt;\u20090.4) are sorted and summarized by their facial effects. Right panel: 16 known Neanderthal facial features categorized by different colour according to facial regions, with lines indicating enhancement (solid) or reduction (dashed) predicted by the corresponding PRS. Triangles represent consistent directional predictions, including upward (enhancement) or downward (reduction) those, and circles denote inconsistencies. The diamonds highlight PRS not corresponding to known features. b Comparison of concordance in Neanderthal facial features based on phenotypic and genetic differences. The bar graph assessed the concordance between phenotypic descriptions from archaeological studies and genetic predictions using PRS for 16 facial features of Neanderthals. Each bar represents the number of facial features displaying concordant (orange), discordant (purple), or inconsistent (green) results based on the alignment between skull morphology differences and the direction of mean PRS differences for Neanderthals relative to EUR (see the \u201cMethods\u201d section). The analysis involved 10,000 permutations using 357 MAF-matched SNPs to generate null distributions and assess the statistical significance of the observed patterns with three one-sided significance levels noted at the top. c Same form of illustration as a but displayed the difference between Denisovans and Neanderthals using the PRS difference (Denisovan PRS minus Neanderthal PRS average).<\/p>\n<p>Next, we compared the facial PRS profiles between Neanderthals (8 samples) and EUR from the 1000 Genomes Project using a t-test, which showed significant differences for 18 out of 23 facial traits (FDR\u2009&lt;\u20090.05, Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>). Notably, Neanderthals had a significantly wider and flatter upper cheek region (p\u2009=\u20098.5\u2009\u00d7\u200910\u22126) than EUR, consistent with fossil evidence showing the broad facial structure and relatively straight cheekbones of Neanderthals relative to modern humans<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 51\" title=\"Weaver, T. D. Out of Africa: modern human origins special feature: the meaning of neandertal skeletal morphology. Proc. Natl Acad. Sci. USA 106, 16028&#x2013;16033 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR51\" id=\"ref-link-section-d2722786e2115\" rel=\"nofollow noopener\" target=\"_blank\">51<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 56\" title=\"Tattersall, I. &amp; Schwartz, J. H. Morphology, paleoanthropology, and Neanderthals. Anat. Rec. 253, 113&#x2013;117 (1998).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR56\" id=\"ref-link-section-d2722786e2118\" rel=\"nofollow noopener\" target=\"_blank\">56<\/a>. Additionally, Neanderthals had a more retracted nose tip (p\u2009=\u20091.34\u2009\u00d7\u200910\u22125) than EUR, which represents a previously unreported finding since no soft tissue evidence exists for Neanderthals. While methods to predict nose shape from skull data have been developed for modern humans<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 57\" title=\"Rynn, C., Wilkinson, C. M. &amp; Peters, H. L. Prediction of nasal morphology from the skull. Forensic Sci. Med Pathol. 6, 20&#x2013;34 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR57\" id=\"ref-link-section-d2722786e2128\" rel=\"nofollow noopener\" target=\"_blank\">57<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 58\" title=\"Ryu, J. Y. et al. Craniofacial anthropometric investigation of relationships between the nose and nasal aperture using 3D computed tomography of Korean subjects. Sci. Rep. 10, 16077 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR58\" id=\"ref-link-section-d2722786e2131\" rel=\"nofollow noopener\" target=\"_blank\">58<\/a>, they are trained and validated with head image data of soft and hard tissues of modern humans and are therefore not applicable to archaic humans. More specifically, average Neanderthal PRS profiles predicted shorter (z\u2009=\u2009\u20133.14), narrower (z\u2009=\u2009\u20131.91), but more protruded (z\u2009=\u20091.35) nose bridges, less protruded (z\u2009=\u2009\u20131.99) but more upturned (z\u2009=\u20091.5) nose tips, alongside broader nose wings (z\u2009=\u20091.91), differing more from the nose PRS profiles of EUR (z\u2009=\u20090) than from those of AFR.<\/p>\n<p>Although genomic data are currently only available for a single Denisovan individual, we also performed facial PRS analysis for Denisovans, reflecting preliminary findings. Denisovan PRS fell within the EUR PRS distribution for 17 out of 23 facial traits, with six traits at the extreme tail (p\u2009&lt;\u20090.05, Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>). Notable differences from EUR included a shorter (z\u2009=\u2009\u20133.66) nose bridge, less protruded nose tips (z\u2009=\u2009\u20133.12), wider orbits (z\u2009=\u20094.19), a higher brow ridge (z\u2009=\u20092.12), a higher chin (z\u2009=\u2009\u20132.37), and a smaller tear trough (z\u2009=\u2009\u20131.94). Denisovan PRS also differed from Neanderthals for 10 out of 16 mapped traits, including a higher, more protruded brow ridge, a nose with more European characteristics (larger nose root, narrower wings, and lower tip), more protruded cheekbones, a flatter infraorbital region, and a higher, flatter chin compared to Neanderthals (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#Fig8\" rel=\"nofollow noopener\" target=\"_blank\">8c<\/a>). Although complete Denisovan skulls have not been discovered yet, a Denisovan partial mandible (Xiahe mandible, &gt;160\u2009kya) found on the Tibetan Plateau<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\" title=\"Chen, F. et al. A late Middle Pleistocene Denisovan mandible from the Tibetan Plateau. Nature 569, 409&#x2013;412 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR10\" id=\"ref-link-section-d2722786e2188\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a> shows a higher anterior mandible than modern humans but lower than Neanderthals, with a smaller symphyseal angle, indicating a flatter, more retracted chin. Our chin PRS profiles for Denisovans aligned with three of these findings but inconsistently predicted a higher anterior mandible than Neanderthals (Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a>). Inspecting previously reported differences in DNA methylated regions among modern humans, Neanderthals and Denisovan<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\" title=\"Gokhman, D. et al. Reconstructing Denisovan anatomy using DNA methylation maps. Cell 179, 180&#x2013;192 e110 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#ref-CR8\" id=\"ref-link-section-d2722786e2196\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a>, we found consistent results with Denisovan facial PRS profiles for 9 out of 12 overlapping predictions, including face width, face length, palate height, etc., but inconsistencies were noted for midface prominence and palate width, etc (Supplementary Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">15<\/a> and Supplementary Data\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41467-025-61761-7#MOESM3\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a>).<\/p>\n<p>While these results indicate genetic evidence for facial differences between archaic and modern humans in specific directions, as well as between Neanderthals and Denisovans, we emphasize the limitations of these findings given the small sample size of currently available genome data from archaic humans.<\/p>\n","protected":false},"excerpt":{"rendered":"C-GWAS and replication highlight 62 novel face loci The C-GWAS discovery phase included 11,662 individuals of European descent&hellip;\n","protected":false},"author":2,"featured_media":780,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[25],"tags":[1315,64,63,1316,1317,1318,336,1319,1320,1321,128],"class_list":{"0":"post-779","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-genetics","8":"tag-3-d-reconstruction","9":"tag-au","10":"tag-australia","11":"tag-evolutionary-genetics","12":"tag-gene-ontology","13":"tag-genetic-variation","14":"tag-genetics","15":"tag-genome-wide-association-studies","16":"tag-humanities-and-social-sciences","17":"tag-multidisciplinary","18":"tag-science"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/779","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/comments?post=779"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/779\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media\/780"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media?parent=779"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/categories?post=779"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/tags?post=779"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}