{"id":607354,"date":"2026-04-26T11:48:28","date_gmt":"2026-04-26T11:48:28","guid":{"rendered":"https:\/\/www.newsbeep.com\/us\/607354\/"},"modified":"2026-04-26T11:48:28","modified_gmt":"2026-04-26T11:48:28","slug":"telomere-to-telomere-genome-assemblies-and-population-resequencing-of-diploid-and-allotetraploid-peanut-varieties","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/us\/607354\/","title":{"rendered":"Telomere-to-telomere genome assemblies and population resequencing of diploid and allotetraploid peanut varieties"},"content":{"rendered":"<p>Assembly and annotation of six representative peanut genomes<\/p>\n<p>To determine the genome structure and genetic diversity of A. hypogaea, we sequenced six representative accessions: two diploid progenitors (A. duranensis, <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/V14167\" rel=\"nofollow noopener\" target=\"_blank\">V14167<\/a>; A. ipaensis, <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/K30076\" rel=\"nofollow noopener\" target=\"_blank\">K30076<\/a>), and four allotetraploid cultivars (Laiyang Silihong, S245; Chedouzi, HN873; Huayu23, HN51; Yunnan Rainbow Peanut, S83) (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). These accessions exhibited diverse phenotypes, including differences in tiller angle, flower characteristics, and seed size, shape and number (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). Heterozygosity rates ranged from 0.14% to 0.36% (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). Using PacBio sequencing, we generated 1.42\u2009Tb (average depth of 116\u00d7) of HiFi reads (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>), assembling genomes with an average contig N50 of 85.94\u2009Mb using Hifiasm (v.0.19.5)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. &amp; Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods 18, 170&#x2013;175 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR22\" id=\"ref-link-section-d299435376e954\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a> (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Tab1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). High-throughput chromatin conformation capture (Hi-C) data (average depth of 112\u00d7) were used to corrected and orient contigs (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a> and Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>), and Oxford Nanopore Technologies (ONT) ultralong-read sequencing was used to fill gaps (average depth of 70\u00d7; Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>), resulting in T2T peanut genomes ranging from 1.18\u2009Gb to 2.63\u2009Gb (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Tab1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). Telomeres were identified using 7-bp telomere repeats (CCCATTT at the 5\u2032 end and TTTAGGG at the 3\u2032 end). Assembly quality was assessed using metrics including BUSCO<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 23\" title=\"Sim&#xE3;o, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. &amp; Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210&#x2013;3212 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR23\" id=\"ref-link-section-d299435376e981\" rel=\"nofollow noopener\" target=\"_blank\">23<\/a> completeness score of 98.67% and long terminal repeat (LTR) assembly index (LAI)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Ou, S., Chen, J. &amp; Jiang, N. Assessing genome assembly quality using the LTR Assembly Index (LAI). Nucleic Acids Res. 46, e126 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR24\" id=\"ref-link-section-d299435376e985\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a> score of 21.76, indicating high completeness (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Tab1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>). The accuracy was validated by a quality value<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 25\" title=\"Rhie, A., Walenz, B. P., Koren, S. &amp; Phillippy, A. M. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol. 21, 245 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR25\" id=\"ref-link-section-d299435376e992\" rel=\"nofollow noopener\" target=\"_blank\">25<\/a> of 47.52 and a mapping rate of 99.76% for Illumina short reads (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Tab1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>). In addition, k-mer distribution curves demonstrated high completeness in the duplicated regions (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>). Transcriptome sequencing achieved a mapping rate of 92.93% (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>). Our T2T genomes demonstrated superior quality and completeness compared to previously published genomes (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>). Using ab initio, homology and transcript-based evidence, we predicted 34,406 to 75,143 protein-coding genes across the accessions, with an average functional annotation rate of 98.75% (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Tab1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>). The average gene length was 2,654\u2009bp, with coding sequences (CDSs) between 1,080\u2009bp and 1,123\u2009bp in length (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a>). BUSCO evaluation showed that 98.88% of single-copy genes were completely annotated, further confirming the accuracy of gene annotations (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Tab1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>).<\/p>\n<p>Fig. 1: Phenotypes and genome characterization.<img decoding=\"async\" aria-describedby=\"figure-1-desc ai-alt-disclaimer-figure-1-1\" src=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2026\/04\/41588_2026_2577_Fig1_HTML.png\" alt=\"Fig. 1: Phenotypes and genome characterization.\" loading=\"lazy\" width=\"685\" height=\"297\"\/>The alternative text for this image may have been generated using AI.<\/p>\n<p>a\u2013f, From top to bottom: plant structures, pods, seeds and Circos plot depicting the genomic features of the <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/V14167\" rel=\"nofollow noopener\" target=\"_blank\">V14167<\/a> (a), <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/K30076\" rel=\"nofollow noopener\" target=\"_blank\">K30076<\/a> (b), S245 (c), HN873 (d), HN51 (e), and S83 (f) accessions. The scale for the chromosomes (outer bars) is Mb. The triangles represent telomeres, and the bars represent centromeres. From outside to inside, the different tracks of the Circos plots represent GC content, gene density, TE density, and distributions of Gypsy, Copia and DNA TEs. The gene and TE densities were calculated based on a 200-kb window size.<\/p>\n<p>Table 1 Overview of assembly and annotation statistics for diploid and tetraploid peanutsTEs and centromeres imply asymmetry evolution<\/p>\n<p>Transposable elements (TEs) are crucial sources of lineage-specific genomic innovation and have vital roles in peanut genome evolution. TEs accounted for an average of 76.27% of each genome, ranging from 74.65% to 77.69% (Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Tab1\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>). The TE content in the Bt subgenomes (average 77.77%) was consistently higher than that in the At subgenomes (average 75.08%), and a similar difference was also observed between the B (77.69%) and A (74.65%) diploid genomes. Specifically, Gypsy elements constituted 52.40% (~605.50\u2009Mb) of the A(t) genome\/subgenomes, whereas they accounted for 58.91% of the B(t) genome\/subgenomes (~865.57\u2009Mb) (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>). Analysis of base substitution rates using a rate of 1.64\u2009\u00d7\u200910\u22128 revealed distinct rates of LTR expansion between the At and Bt subgenomes. The Gypsy family in the A(t) genome\/subgenomes expanded approximately 0.20 million years ago (Ma), before tetraploidization, whereas two expansion peaks occurred at 0.69\u2009Ma and 0.27\u2009Ma in the B(t) genome\/subgenomes (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2a<\/a>). We observed different insertion times for Gypsy-type LTRs in the At and Bt subgenomes. The Athila, Retand, CRM and Ogre types experienced two expansions in the B(t) genome\/subgenomes, but only the Reina type underwent two expansions in the A(t) genome\/subgenomes (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">4<\/a>). We categorized Gypsy and Copia TEs based on their insertion times into four periods: S1 (1\u201311,000 years ago), S2 (11,001\u2013101,000 years ago), S3 (101,001\u2013301,000 years ago) and S4 (&gt;301,000 years ago). Counts for the A genome were 284, 1,445, 5,323 and 9,553 from S1 to S4; for the B genome, these counts were 467, 1,439, 3,433 and 15,873 (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a>). This indicates that TE insertion events occurred more frequently in the A genome during the S3 period. In the tetraploid subgenomes, counts in the At subgenomes were 343, 1,396, 4,655, and 9,747, whereas those in the Bt subgenomes were 416, 1,392, 3,314 and 15,592, respectively. Gypsy-type TEs were primary contributors to TE insertions in the A genome during the S3 phase, with counts of 4,388 in the A genome and 3,861 in the At subgenomes, compared to 2,417 and 2,259 in the B genome and Bt subgenomes, respectively. Comparative analysis of the two main types of Gypsy TEs, Athila and Retand, revealed shorter insertion times in the A(t) genome\/subgenomes than in the B(t) genome\/subgenomes (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>). Full-length Athila phylogenetic trees indicate distinct evolutionary characteristics after the divergence of the A(t) and B(t)genome\/subgenomes (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2b<\/a>). Notably, several Athila and Retand TEs were absent from tetraploid subgenomes but present in the diploid genomes (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>).<\/p>\n<p>Fig. 2: Asymmetric subgenome evolution.<img decoding=\"async\" aria-describedby=\"figure-2-desc ai-alt-disclaimer-figure-2-1\" src=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2026\/04\/41588_2026_2577_Fig2_HTML.png\" alt=\"Fig. 2: Asymmetric subgenome evolution.\" loading=\"lazy\" width=\"685\" height=\"977\"\/>The alternative text for this image may have been generated using AI.<\/p>\n<p>a, Insertion times obtained using the LTR retrotransposons (Gypsy and Copia) in the A. hypogaea At (S245A, HN873A, HN51A and S83A), A. hypogaea Bt (S245B, HN873B, HN51B and S83B), A. duranensis (<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/V14167\" rel=\"nofollow noopener\" target=\"_blank\">V14167<\/a>) and A. ipaensis (<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/K30076\" rel=\"nofollow noopener\" target=\"_blank\">K30076<\/a>) genomes. b, Phylogenetic analysis of the expansion of Athila LTR retrotransposons across the entire A(t) and B(t) genome\/subgenomes. c, The phylogenetic relationships between the A(t) and B(t) genome\/subgenomes, analyzed using the CentO repeat unit. d, Maximum likelihood phylogenetic tree based on alignments of the full-length CRM within the centromeric regions of the A(t) and B(t) genome\/subgenomes. e, Phylogenetic relationships and insertion time of full-length Retand elements in the centromeric regions of the A(t) and B(t) genome\/subgenomes. f, Synteny analysis and timing of insertion of full-length Retand elements in the At subgenomes and the diploid genome (<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/V14167\" rel=\"nofollow noopener\" target=\"_blank\">V14167<\/a>) in the centromeric region on chr. 4.<\/p>\n<p>In the diploid genomes, the total lengths of centromeres were 44.70\u2009Mb and 43.30\u2009Mb; by contrast, the At subgenomes showed an increase in average length of 46.11\u2009Mb, whereas the Bt subgenomes experienced a significant reduction, with an average length of 25.70\u2009Mb (range: 23.18\u201329.81\u2009Mb) (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">11<\/a>). This difference was primarily due to a reduction in the number of repetitive units in the Bt subgenomes. Phylogenetic analysis based on centromere monomers revealed that monomers from the A(t) genome\/subgenomes clustered together, as did those from the B(t) genome\/subgenomes, indicating independent evolution of CentO units (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2c<\/a>). Centromeric repetitive elements (CRMs) were interspersed between the At and Bt subgenomes, suggesting synchronous insertion and evolution of CRMs (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2d<\/a>). The A(t) and B(t) genome\/subgenomes showed different Retand element activity, with a later burst insertion observed in the A(t) genome\/subgenomes but absent from the B(t) genome\/subgenomes (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2e<\/a>). The noncollinearity results for Retand elements on chromosome 4 indicated that following tetraploid formation, the A genome and At subgenomes experienced different reshaping in centromere regions (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2f<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>).<\/p>\n<p>SVs during peanut evolution and domestication<\/p>\n<p>T2T genomes enable precise identification of SVs and rearrangements through comparison of diploid and tetraploid chromosomes. Using SyRI (v.1.5.3)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 26\" title=\"Goel, M., Sun, H., Jiao, W. B. &amp; Schneeberger, K. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol. 20, 277 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR26\" id=\"ref-link-section-d299435376e1789\" rel=\"nofollow noopener\" target=\"_blank\">26<\/a> and SVMU (v.4.0.0beta2)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 27\" title=\"Chakraborty, M., Emerson, J. J., Macdonald, S. J. &amp; Long, A. D. Structural variants exhibit widespread allelic heterogeneity and shape variation in complex traits. Nat. Commun. 10, 4872 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR27\" id=\"ref-link-section-d299435376e1793\" rel=\"nofollow noopener\" target=\"_blank\">27<\/a>, we detected 153,947 insertions, 182,467 deletions, 6,272 copy number variations (CNVs), 2,351 translocations and 644 inversions in four tetraploid peanut accessions compared to diploid genomes <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/V14167\" rel=\"nofollow noopener\" target=\"_blank\">V14167<\/a> and <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/K30076\" rel=\"nofollow noopener\" target=\"_blank\">K30076<\/a> (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6a,b<\/a>). Specifically, the At subgenomes exhibited 128,260 insertions, 145,963 deletions, 4,495 CNVs, 2,174 translocations and 520 inversions, whereas the Bt subgenomes showed only 25,687 insertions, 36,504 deletions, 1,777 CNVs, 177 translocations and 124 inversions. Genes located within 2\u2009kb upstream or downstream of the SV breakpoints (termed SV genes) were identified, with an average of 9,682 SV genes per accession (ranging from 9,522 to 10,006); 45.42% of the SV genes were TE-SV genes (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">13<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">7a,b<\/a>). Further comparisons among tetraploid cultivated accessions revealed a total of 17,216 insertions, 28,554 deletions, 1,692 CNVs, 177 inversions and 215 translocations when comparing S83 to S245, HN873 and HN51 (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6c<\/a>). The At subgenomes averaged 8,370 insertions and 14,712 deletions, whereas the Bt subgenomes had 8,864 insertions and 13,842 deletions. SV gene analysis indicated a total of 4,886, 4,533 and 4,392 SV genes for the S245 versus S83, HN873 versus S83, and HN51 versus S83 comparisons, respectively, with an average of 2,015 classified as TE-SV genes (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">13<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">7c<\/a>). To validate the accuracy of SV identification, we confirmed 202 large SVs via PCR amplification (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a>). Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses of SV genes showed significant associations with pathways involved in flavonoid biosynthesis, brassinosteroid biosynthesis and fatty acid metabolism (Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">15<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">17<\/a>). We also identified conserved SVs across all tetraploid subgenomes, indicating rearrangements from or before tetraploidization, particularly in the A(t) genome\/subgenomes. By contrast, fewer specific rearrangements were noted in the B(t) genome\/subgenomes, with a notable 4.48-fold difference in size between At:A and Bt:B (Supplementary Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">10<\/a>).<\/p>\n<p>Analysis of the relationship between SVs and domestication revealed unique variations among the tetraploid accessions. Compared to the landrace HN873, S245, HN51 and S83 had significantly more unique SVs and genes. Specifically, S245 contained 7,431 SVs (861 genes), S83 had 6,059 SVs (841 genes) and HN51 had 4,156 SVs (448 genes), whereas HN873 had 2,593 SVs (253 genes). In the GO enrichment analysis, the HN873 genes were significantly enriched in terms including calcium ion transport, metal ion transport and and monoatomic cation transport, whereas HN51 was linked to tissue development terms such as embryonic morphogenesis, flower development, pollen tube growth, post-embryonic development, shoot system development, vascular transport and phloem transport (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">18<\/a>). Unique SV genes in S245 and S83 were enriched in fatty acid synthesis and metabolism, with S245 also enriched in defense responses and nutrient level regulation, whereas S83 was related to root development and endoderm formation. These findings suggest that SVs have a crucial role in shaping the domestication of different peanut varieties through artificial selection.<\/p>\n<p>A 13.23-Mb insertion on chr. 14 was found in the <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/K30076\" rel=\"nofollow noopener\" target=\"_blank\">K30076<\/a>, S83, HN51 and HN873 accessions (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig3\" rel=\"nofollow noopener\" target=\"_blank\">3a\u2013e<\/a>), with next-generation sequencing showing the insertion to be present in all 161 var. hirsuta but only 33 of 161 var. hypogaea accessions (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">11<\/a>). This insertion could contribute to phenotypic differences between var. hirsuta and var. hypogaea. KEGG analysis of the 109 genes within the insertion regions highlighted enrichments in lipid metabolism, lignin biosynthesis and responses to environmental stressors (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig3\" rel=\"nofollow noopener\" target=\"_blank\">3f,h<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a>). Genes of chr. 14.1804 homologous to OsLAC (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig3\" rel=\"nofollow noopener\" target=\"_blank\">3g,i<\/a>), which plays an important part in regulation of plant architecture and seed development in rice<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 28\" title=\"Zhang, Y. C. et al. Overexpression of microRNA OsmiR397 improves rice yield by increasing grain size and promoting panicle branching. Nat. Biotechnol. 31, 848&#x2013;852 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR28\" id=\"ref-link-section-d299435376e1904\" rel=\"nofollow noopener\" target=\"_blank\">28<\/a>, were expressed in peanut lateral branch, axillary bud and hypocotyl (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a>). Another gene, chr. 14.1829, which is homologous to WOX3A\/B<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 29\" title=\"Cho, S. H., Kang, K., Lee, S. H., Lee, I. J. &amp; Paek, N. C. OsWOX3A is involved in negative feedback regulation of the gibberellic acid biosynthetic pathway in rice (Oryza sativa). J. Exp. Bot. 67, 1677&#x2013;1687 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR29\" id=\"ref-link-section-d299435376e1913\" rel=\"nofollow noopener\" target=\"_blank\">29<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 30\" title=\"Yoo, S. C., Cho, S. H. &amp; Paek, N. C. Rice WUSCHEL-related homeobox 3A (OsWOX3A) modulates auxin-transport gene expression in lateral root and root hair development. Plant Signal. Behav. 8, e25929 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR30\" id=\"ref-link-section-d299435376e1916\" rel=\"nofollow noopener\" target=\"_blank\">30<\/a>, was specifically expressed in the embryo, axillary buds and stem apices (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig3\" rel=\"nofollow noopener\" target=\"_blank\">3g,i<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">12<\/a>), suggesting that the insertion may have a role in the unique phenotype of var. hirsuta.<\/p>\n<p>Fig. 3: SVs among tetraploid peanuts.<img decoding=\"async\" aria-describedby=\"figure-3-desc ai-alt-disclaimer-figure-3-1\" src=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2026\/04\/41588_2026_2577_Fig3_HTML.png\" alt=\"Fig. 3: SVs among tetraploid peanuts.\" loading=\"lazy\" width=\"685\" height=\"916\"\/>The alternative text for this image may have been generated using AI.<\/p>\n<p>a, Coverage of SVs on chr. 14 of the <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/K30076\" rel=\"nofollow noopener\" target=\"_blank\">K30076<\/a>, S83, HN51, HN873 and S245 genomes using Illumina (next-generation sequencing) and HiFi (third-generation sequencing) reads. b, A large SV (deletion) was detected in S245 during collinearity analysis of S83, <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/K30076\" rel=\"nofollow noopener\" target=\"_blank\">K30076<\/a> and S245. c\u2013e, Hi-C heatmaps constructed using reads from the S245 accession and the reference genomes for S83 (c), S245 (d) and <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/K30076\" rel=\"nofollow noopener\" target=\"_blank\">K30076<\/a> (e). f, GO enrichment analysis of genes located in the SV regions of the S83. g, Functional homologous rice genes of the SV genes in S83. h, GO enrichment analysis of genes located in the SV regions of <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/K30076\" rel=\"nofollow noopener\" target=\"_blank\">K30076<\/a>. i, Functional homologous rice genes of the SV genes in <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/K30076\" rel=\"nofollow noopener\" target=\"_blank\">K30076<\/a>. NGS, next-generation sequencing; TGS, third-generation sequencing; UV, ultraviolet.<\/p>\n<p>Population structure<\/p>\n<p>A total of 521 globally collected peanut accessions were sequenced, yielding an average depth of 16.76\u00d7 and 96.71% genome coverage (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a>). Using the S245 genome as a reference, we identified 101,334,387 high-quality single-nucleotide polymorphisms (SNPs), averaging 39 SNPs per kilobase (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">20<\/a>). Phylogenetic and population structure analyses classified the accessions into five genetic groups: wild (G1), var. fastigiata\u2009+\u2009admixture (G2), var. vulgaris (G3), var. hypogaea (G4) and var. hypogaea + var. hirsuta (G5) (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig4\" rel=\"nofollow noopener\" target=\"_blank\">4a<\/a> and Supplementary Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">13<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">14a<\/a>). Principal component analysis (PCA) confirmed the five distinct clusters corresponding to lineages G1\u2013G5 (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig4\" rel=\"nofollow noopener\" target=\"_blank\">4b<\/a>). The G5 population showed significant differentiation from G2 and G3 but not from G4. Genetic diversity analysis indicated that the most pronounced differences were between G2 and other groups, whereas G4 and G5 exhibited minimal differences (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig4\" rel=\"nofollow noopener\" target=\"_blank\">4c<\/a>). The linkage disequilibrium (LD) decay rate was lowest in G1 and highest in G2 (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig4\" rel=\"nofollow noopener\" target=\"_blank\">4d<\/a>). Divergence times estimates indicated that cultivated peanuts diverged into var. fastigiata (G2+G3) and var. hypogaea (G4+G5) around 9,400 years ago, with further differentiation occurring among groups at approximately 7,200 and 8,900 years ago, respectively (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">14b<\/a>).<\/p>\n<p>Fig. 4: Population structure and introgression analyses.<img decoding=\"async\" aria-describedby=\"figure-4-desc ai-alt-disclaimer-figure-4-1\" src=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2026\/04\/41588_2026_2577_Fig4_HTML.png\" alt=\"Fig. 4: Population structure and introgression analyses.\" loading=\"lazy\" width=\"685\" height=\"689\"\/>The alternative text for this image may have been generated using AI.<\/p>\n<p>a, Phylogenetic tree and population structure analysis of 521 peanut accessions. b, PCA plot of the first two principal components (PC1 and PC2) of the five groups. c, Population differentiation (FST) and genetic diversity (\u03c0) across the five groups. d, LD decay estimation for the five groups. e\u2013g, Introgression analysis (fd) among different groups: G2 to G3 (top 1% threshold line: 0.64 (e)), G2 to G4 (top 1% threshold line: 0.49 (f)) and G3 to G4 (top 1% threshold line: 0.94 (g)) on different chromosomes, with G5 as the outgroup. Adm, admixture.<\/p>\n<p>Selected regions and introgressed tracts<\/p>\n<p>Genomic regions under selection were analyzed by calculating population fixation statistic (FST) and cross-population composite likelihood ratio (XP-CLR) values between groups (Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">21<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a>). The top 1% of FST and XP-CLR intervals were retained to identify selected genes within the G2\u2013G3, G2\u2013G4, G2\u2013G5, G3\u2013G4, G3\u2013G5 and G4\u2013G5 groups. TE insertions were categorized into five time periods for assessment of their impact on these selected sweep regions; this showed that recent TE insertions had smaller effects compared to earlier ones (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">15<\/a>). After filtering of TE intervals, the average length of selective sweep regions in the top 1% of FST was 69.39\u2009Mb, containing an average of 2,463 genes (ranging from 697 to 3,838) (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">21<\/a>). Notably, an average 4.98\u2009Mb of differentially selected regions were identified in the Bt subgenomes, compared to only 2.47\u2009Mb in the At subgenomes, indicating asymmetric evolution. Four-taxon fd statistics detected introgressed regions, with significant gene flow primarily observed in the At subgenome, especially in the G2\u2013G3 and G4\u2013G5 groups (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig4\" rel=\"nofollow noopener\" target=\"_blank\">4e\u2013g<\/a>, Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a> and Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">23<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a>). Compared to diploids, 22.48%, 51.97% and 49.19% of SNPs were retained in the top 1% of FST, XP-CLR and introgression regions in tetraploids, respectively (Supplementary Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">17<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">18<\/a>). GO and KEGG analyses identified selected genes related to fatty acid synthesis, flavonoid biosynthesis and nutrient metabolism (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a>).<\/p>\n<p>Identification of peanut haplotypes linked to higher oil content<\/p>\n<p>Genome-wide association study (GWAS) analysis identified significant signals on chromosome 8 associated with peanut oil content (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5a\u2013c<\/a>), pinpointing 151 genes in the 200-kb regions. Among these, we identified a candidate gene, chr. 8.2620 (named AhWRI1), located approximately 54,934\u2009bp from the peak at chr. 8:54293412 (P\u2009=\u20093.02\u2009\u00d7\u200910\u22128). Gene structure analysis revealed a base change (A to C) in the promoter region and a nonsynonymous SNP in the first exon (T to G) that altered an amino acid from arginine to methionine (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5d\u2013f<\/a>). Expression analysis of AhWRI1 indicated predominant expression during seed development (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">20<\/a>). Based on 153 accessions with RNA sequencing (RNA-seq) data, the Hap1 type had an average fragments per kilobase million (FPKM) value of 16.30 and an oil content of 48.41%, whereas the Hap2 type had an average FPKM value of 24.81 and an oil content of 54.10% (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">25<\/a>). Quantitative PCR with reverse transcription (RT\u2012qPCR) confirmed higher AhWRI1 expression in Hap2-type accessions (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">21<\/a>). The correlation between promoter haplotype and expression level suggested causal variation in the promoter region. Subcellular localization studies confirmed that AhWRI1 is a transcription factor (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5g<\/a>). Transgenic lines overexpressing AhWRI1 in rapeseed and soybean showed lipid droplet accumulation in the leaf mesophyll cells, as detected by confocal imaging and Nile red staining (green) (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5h,i<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">22a<\/a>). We assessed total triacylglycerol content in seeds and found that AhWRI1 overexpression significantly increased fatty acid levels (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5j<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">22b<\/a>). We also noted that four transcriptional regulators, LEAFY COTYLEDON1 (LEC1), FUSCA3 (FUS3) and ABSCISIC ACID INSENSITIVE3 (ABI3), were crucial for seed development<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Pelletier, J. M. et al. LEC1 sequentially regulates the transcription of genes involved in diverse developmental processes during seed development. Proc. Natl Acad. Sci. USA 114, E6710&#x2013;E6719 (2017).\" href=\"#ref-CR31\" id=\"ref-link-section-d299435376e2256\">31<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Jo, L., Pelletier, J. M. &amp; Harada, J. J. Central role of the LEAFY COTYLEDON1 transcription factor in seed development. J. Integr. Plant Biol. 61, 564&#x2013;580 (2019).\" href=\"#ref-CR32\" id=\"ref-link-section-d299435376e2256_1\">32<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Giraudat, J. et al. Isolation of the Arabidopsis AB13 gene by positional cloning. Plant Cell 4, 1251&#x2013;1261 (1992).\" href=\"#ref-CR33\" id=\"ref-link-section-d299435376e2256_2\">33<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Parcy, F., Valon, C., Kohara, A., Mis&#xE9;ra, S. &amp; Giraudatag, J. The ABSCISIC ACID-LNSENSITIVE3, FUSCA3, and LEAFY COTYLEDONf loci act in concert to control multiple aspects of Arabidopsis seed development. Plant Cell 9, 1265&#x2013;1277 (1997).\" href=\"#ref-CR34\" id=\"ref-link-section-d299435376e2256_3\">34<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 35\" title=\"Luer&#xDF;en, H., Kirik, V., Herrmann, P. &amp; Mis&#xE9;, S. FUSCA3 encodes a protein with a conserved VP1\/ABI3-like B3 domain which is of functional importance for the regulation of seed maturation in Arabidopsis thaliana. Plant J. 15, 755&#x2013;764 (1998).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR35\" id=\"ref-link-section-d299435376e2259\" rel=\"nofollow noopener\" target=\"_blank\">35<\/a> and oil production<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Tian, R. et al. Direct and indirect targets of the arabidopsis seed transcription factor ABSCISIC ACID INSENSITIVE3. Plant J. 103, 1679&#x2013;1694 (2020).\" href=\"#ref-CR36\" id=\"ref-link-section-d299435376e2263\">36<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Manan, S. et al. Soybean LEC2 regulates subsets of genes involved in controlling the biosynthesis and catabolism of seed storage substances and seed development. Front. Plant Sci. 8, 1604 (2017).\" href=\"#ref-CR37\" id=\"ref-link-section-d299435376e2263_1\">37<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 38\" title=\"Zhang, M., Cao, X., Jia, Q. &amp; Ohlrogge, J. FUSCA3 activates triacylglycerol accumulation in Arabidopsis seedlings and tobacco BY2 cells. Plant J. 88, 95&#x2013;107 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR38\" id=\"ref-link-section-d299435376e2266\" rel=\"nofollow noopener\" target=\"_blank\">38<\/a>. We found that AhLEC1, AhFUS3, AhABI3 and AhWRI1 had higher expression in high-oil varieties compared to low-oil ones (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">21<\/a>). Yeast one-hybrid and dual-luciferase assays showed that AhWRI1 regulated downstream plant fatty acid biosynthesis pathway protein genes AhACP1 and AhKAS1, whereas AhFUS3, AhABI3 and AhLEC1 bound to the AhWRI1 promoter (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5k,l<\/a>). The E-box motif, identified in the phase promoter of beans with respect to its strict expression during embryogenesis<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 39\" title=\"Chandrasekharan, M. B., Bishop, K. J. &amp; Hall, T. C. Module-specific regulation of the &#x3B2;-phaseolin promoter during embryogenesis. Plant J. 33, 853&#x2013;866 (2003).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR39\" id=\"ref-link-section-d299435376e2311\" rel=\"nofollow noopener\" target=\"_blank\">39<\/a>, was also recognized as an induced binding motif in cis-motifs associated with ABI3 binding peaks in Arabidopsis<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 36\" title=\"Tian, R. et al. Direct and indirect targets of the arabidopsis seed transcription factor ABSCISIC ACID INSENSITIVE3. Plant J. 103, 1679&#x2013;1694 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR36\" id=\"ref-link-section-d299435376e2321\" rel=\"nofollow noopener\" target=\"_blank\">36<\/a>. An A-to-C mutation was discovered in the 2,040-bp promoter region of AhWRI1, located within an E-box, with a conserved RY motif on the complementary strand recognized by seed-specific transcription factors (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5m<\/a>). In yeast one-hybrid assays, cloning of the 52-bp sequence containing the RY and E-box motifs revealed that the C-to-A mutation eliminated interaction with FUS3 (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5n<\/a>). In addition, dual-luciferase assays showed that the promoter activity of the C variant was significantly higher than that of the A variant, regardless of combination with SK (P\u2009=\u20094.14\u2009\u00d7\u200910\u22125, n\u2009=\u20096) or FUS-SK (P\u2009=\u20091.04\u2009\u00d7\u200910\u22126, n\u2009=\u20096) (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig5\" rel=\"nofollow noopener\" target=\"_blank\">5o<\/a>). This mutation affected binding with AhFUS3, leading to variations in transcriptional activity and expression of the AhWRI1 gene that resulted in differences in average oil content among varieties.<\/p>\n<p>Fig. 5: Causal gene and haplotype controlling oil content.<img decoding=\"async\" aria-describedby=\"figure-5-desc ai-alt-disclaimer-figure-5-1\" src=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2026\/04\/41588_2026_2577_Fig5_HTML.png\" alt=\"Fig. 5: Causal gene and haplotype controlling oil content.\" loading=\"lazy\" width=\"685\" height=\"991\"\/>The alternative text for this image may have been generated using AI.<\/p>\n<p>a, Genome-wide screening of selective sweep regions using FST analysis (F-statistics). The top 1% threshold line is at 0.57. b, Significant signals from a GWAS of oil-related traits on chr. 8. Horizontal lines represent the significance threshold (P\u2009=\u20096.26\u2009\u00d7\u200910\u22127, Bonferroni correction). c, LD block of a candidate peak with 200-kb upstream and downstream regions. d, Structure of the AhWRI1 gene and two variants in the promoter and the first exon. e, Significant difference in oil content between the Hap1 (range: 44.56\u201352.00%; n\u2009=\u200960) and Hap2 (range: 42.38\u201357.23%; n\u2009=\u2009436) accessions. Center line, median; box lower and upper edges, 25% and 75% quartiles, respectively; whiskers, 1.5\u00d7 interquartile range; colored dots, outliers. f, The distribution of Hap1 and Hap2 across different groups. g, Subcellular localization of AhWRI1. h, Accumulation of lipid droplets in rapeseed leaf mesophyll cells overexpressing AhWRI1. i, Confocal images of Nile red-stained lipid droplets (LDs; green) in rapeseed leaves with the trans-AhWRI1 gene. j, Total triacylglycerol content in rapeseed leaves expressing AhWRI1. Quantitative data are mean\u2009\u00b1\u2009s.e.m. n\u2009=\u20099 biologically independent samples. k, Yeast one-hybrid assays showing binding of AhWRI1 to the promoters of AhACP1 and AhKAS1 and binding of AhFUS3, AhABI3 and AhLEC1 to the AhWRI1 promoter. l, Dual-luciferase assays in N. benthamiana leaves showing that AhWRI1 activates the AhACP1 and AhKAS1 promoters, and that AhFUS3, AhABI3 and AhLEC1 activate the AhWRI1 promoter; an empty reporter (no promoter) served as a negative control. Quantitative data are mean\u2009\u00b1\u2009s.e.m. n\u2009=6 biologically independent samples. m, Mutation site in the AhWRI1 promoter; the E-box motif is highlighted in blue and the RY motif in pink. n, Yeast one-hybrid assay showing that the C-to-A substitution in the AhWRI1 promoter abolishes interaction with AhFUS3. o, Dual-luciferase assay comparing activities of the AhWRI1 promoters carrying the A or C allele. Quantitative data are mean\u2009\u00b1\u2009s.e.m. n\u2009=\u20096 biologically independent samples. P values were calculated by two-tailed Student\u2019s t-tests (e, j, i and o). TAG, triacylglycerol; TSS, transcription start site; UTR, untranslated region; WT, wild type.<\/p>\n<p><a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM6\" rel=\"nofollow noopener\" target=\"_blank\">Source data<\/a><\/p>\n<p>                        AhGSA1 gene for increased peanut seed size and weight<\/p>\n<p>Seed size and weight are critical traits in peanut breeding, yet understanding of the natural causal variation underlying these traits remains limited despite numerous identified QTLs. Through a GWAS, we identified 62 candidate SNPs on chromosome 16, spanning 130 genes (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">6a\u2013d<\/a>). These genes included chr. 16.3093 (named AhGSA1), annotated as glycosyltransferase, overexpression of which significantly increased grain size by modulating cell proliferation and expansion in rice<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 40\" title=\"Dong, N. Q. et al. UDP-glucosyltransferase regulates grain size and abiotic stress tolerance associated with metabolic flux redirection in rice. Nat. Commun. 11, 2629 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR40\" id=\"ref-link-section-d299435376e2563\" rel=\"nofollow noopener\" target=\"_blank\">40<\/a>. Notably, an insertion\u2013deletion within the promoter region (660\u2009bp from the transcription start site) of AhGSA1 was identified (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">6e<\/a>). RT\u2013qPCR analysis of 30 small-seeded and 30 large-seeded accessions further confirmed a significant difference in the expression levels of Hap1 and Hap2 (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">6f<\/a>). Forty-two accessions classified as Hap1 (ATT type) had an average thousand-grain weight of 490.82\u2009g, whereas 293 accessions of the Hap2 (AT type) exhibited a significantly higher average thousand-grain weight of 846.34\u2009g (P\u2009=\u20091.00\u2009\u00d7\u200910\u221220) (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">6g<\/a>). In addition, expression analysis showed that AhGSA1 had high expression levels in peanut pods at all developmental stages but was not expressed in the seed coat (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">23<\/a>). The correlation between haplotypes in the promoter region and expression levels suggested that AhGSA1 contributes to variation in seed size and weight. Furthermore, Hap1 was predominantly found in the G2 (var. fastigiata) population, indicating differential selection among cultivated groups (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">6h<\/a>). We cloned the 1,000-bp promoter sequence harboring the ATT or AT mutation and demonstrated its ability to activate GUS reporter gene expression. The GUS expression driven by the AT promoter was stronger than that driven by the ATT promoter (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">6i<\/a>). Subcellular localization studies showed that AhGSA1 was localized to the cytoplasmic membrane (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">6j<\/a>). Dual-luciferase analysis confirmed that AT promoter activity was significantly greater than that of ATT (P\u2009=\u20091.02\u2009\u00d7\u200910\u22124, n\u2009=\u20093) (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig6\" rel=\"nofollow noopener\" target=\"_blank\">6k<\/a>), highlighting the importance of the insertion\u2013deletion in the promoter region for AhGSA1 transcription.<\/p>\n<p>Fig. 6: Candidate gene associated with peanut seed size and weight.<img decoding=\"async\" aria-describedby=\"figure-6-desc ai-alt-disclaimer-figure-6-1\" src=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2026\/04\/41588_2026_2577_Fig6_HTML.png\" alt=\"Fig. 6: Candidate gene associated with peanut seed size and weight.\" loading=\"lazy\" width=\"685\" height=\"664\"\/>The alternative text for this image may have been generated using AI.<\/p>\n<p>a,b, Genome-wide scans for selective sweeps using XP-CLR test (threshold line: 326.54) (a) and FST (threshold line: 0.58; F-statistics) (b). c, Significant signals from GWAS of oil-related traits on chr. 16. Horizontal lines represent the significance threshold (P\u2009=\u20096.26\u2009\u00d7\u200910\u22127, Bonferroni correction). d, LD block around the candidate peak within a 200-kb interval. e, Gene structure and promoter variant of AhGSA1. f, RT\u2013qPCR analysis of AhGSA1 expression in 30 Hap1 and 30 Hap2 accessions. Quantitative data are mean\u2009\u00b1\u2009s.e.m. n\u2009=\u20093 biologically independent samples. g, Box plot showing significant differences in thousand-grain weight between Hap1 (range: 342\u2013693; n\u2009=\u200942) and Hap2 (range: 342\u20131,563.64; n\u2009=\u2009293) accessions. Center line, median; box lower and upper edges, 25% and 75% quartiles, respectively; whiskers, 1.5\u00d7 interquartile range; colored dots, outliers. h, Distribution of Hap1 and Hap2 across different groups. i, Subcellular localization of AhGSA1. j, Histochemical GUS staining driven by the Hap1 and Hap2 AhGSA1 promoters. k, Dual-luciferase assay comparing Hap1 and Hap2 promoter activities. Quantitative data are mean\u2009\u00b1\u2009s.e.m. n\u2009=\u20093 biologically independent samples. P values were calculated by two-tailed Student\u2019s t-tests (g and k).<\/p>\n<p><a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM7\" rel=\"nofollow noopener\" target=\"_blank\">Source data<\/a><\/p>\n<p>Gene networks for lipids and anthocyanins in seed development<\/p>\n<p>Peanut seeds have a high oil content, accounting for approximately 50\u201380% of their weight. We performed integrated transcriptomic and metabolomic analyses of S83 and S245 at five developmental stages (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7a<\/a><a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">,b<\/a> and Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">26<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">30<\/a>). A total of 497 lipid metabolites were identified, including 300 glycerolipids, 153 glycerophospholipids, 24 sphingolipids and 18 fatty acids (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7c<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">27<\/a>). Clustering based on lipid content revealed that S83 consistently had higher lipid levels than S245, especially during later developmental stages (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7d<\/a>). Lipid accumulation, which is critical for seed development, is regulated by complex metabolic pathways (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7e<\/a>). Distinct accumulation patterns for lipid metabolism were noted in both varieties, peaking at the end of stage S5 (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7f<\/a>). To identify key genes related to lipid metabolism, we conducted weighted gene coexpression network analysis (WGCNA); this identified 11 coexpressed gene modules (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">24a,b<\/a>). The module most correlated with lipid metabolite content contained several key genes, including those encoding fatty acid desaturase and ketoacyl-ACP synthase, which are crucial for fatty acid biosynthesis (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">28<\/a>). Phylogenetic analysis of these enzymes revealed evolutionary relationships that could explain their roles in lipid biosynthesis (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">25a,b<\/a>). Their expression patterns matched metabolite level trends, peaking during development (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7g<\/a>).<\/p>\n<p>Fig. 7: Transcriptomic and metabolomic analysis during seed development.<img decoding=\"async\" aria-describedby=\"figure-7-desc ai-alt-disclaimer-figure-7-1\" src=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2026\/04\/41588_2026_2577_Fig7_HTML.png\" alt=\"Fig. 7: Transcriptomic and metabolomic analysis during seed development.\" loading=\"lazy\" width=\"685\" height=\"947\"\/>The alternative text for this image may have been generated using AI.<\/p>\n<p>a,b, Seed diversity between two peanut subpopulations across five developmental stages for S83 (a) and S245 (b). c, Lipid metabolites (497) categorized into glycerolipids, glycerophospholipids, sphingolipids and fatty acids. d, Clustered heatmap of the levels of four classes of lipid in S83 and S245 across five developmental stages. e, Schematic of lipid metabolic pathways, with detected metabolites marked in bold black. f, Clustered heatmap displaying the contents of metabolites involved in the lipid metabolism pathway for S83 and S245. g, Heatmap of expression levels of candidate lipid metabolism genes; the correlation between gene expression and metabolite contents is indicated in teal, and significance is denoted by the olive gradient. h, Anthocyanidin contents of 78 metabolites with anthocyanins categorized into eight classes. i, Clustered heatmap of eight classes of anthocyanidins in S83 and S245 across five developmental stages. j, Schematic of anthocyanidin biosynthesis pathways, with detected metabolites marked in bold black. k, Clustered heatmap displaying the contents of candidate metabolites contributing to the difference in color between S83 and S245. l, Heatmap of the expression levels of candidate anthocyanidin biosynthesis genes; the correlation between gene expression and metabolite contents is indicated in teal, and significance is denoted by the olive gradient. CD, cardiolipin; CDP-DAG, cytidine diphosphate diacylglycerol; CDP-EA, cytidine diphosphoethanolamine; Cho, choline; DAG, diacylglycerol; DGDG, digalactosyldiacylglycerol; EA, ethanolamine; ER, endoplasmic reticulum; FAs, fatty acids; FFA, free fatty acids; G3P, glycerol-3-phosphate; GLs, glycerolipids; GPs, glycerophospholipids; LPA, lysophosphatidic acid; MGDG, monogalactosyldiacylglycerol; PA, phosphatidic acid; PC, phosphatidylcholine; P-Cho, phosphocholine; PE, phosphatidylethanolamine; P-EA, phosphoethanolamine; PG, phosphatidylglycerol; PG-P, phosphatidylglycerol phosphate; PI, phosphatidylinositol; PIP2, phosphatidylinositol 4,5-bisphosphate; PS, phosphatidylserine; SPs, sphingolipids; SQDG, sulfoquinovosyldiacylglycerol; TAG, triacylglycerol.<\/p>\n<p>We also investigated anthocyanidin metabolism and identified 78 metabolites across eight classes (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7h<\/a> and Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">29<\/a>). Clustering showed that S83 had higher concentrations of petunidin, malvidin, flavonoids and delphinidin, whereas S245 had more procyanidin, peonidin, pelargonidin and cyanidin (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7i<\/a>). These metabolites could influence differences in seed coat color between S83 and S245 (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7j,k<\/a>). WGCNA identified key genes with expression levels that were correlated with the abundances of differentially accumulated metabolites during seed development (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">24c,d<\/a>), in particular, genes encoding the MYB and bHLH transcription factors, which are known to regulate anthocyanin biosynthesis<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 19\" title=\"Zhao, Y. et al. Whole-genome resequencing-based QTL-seq identified AhTc1 gene encoding a R2R3-MYB transcription factor controlling peanut purple testa colour. Plant Biotechnol. J. 18, 96&#x2013;105 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR19\" id=\"ref-link-section-d299435376e2857\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 41\" title=\"Xu, W., Dubos, C. &amp; Lepiniec, L. Transcriptional control of flavonoid biosynthesis by MYB-bHLH-WDR complexes. Trends Plant Sci. 20, 176&#x2013;185 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR41\" id=\"ref-link-section-d299435376e2860\" rel=\"nofollow noopener\" target=\"_blank\">41<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 42\" title=\"Chen, H. et al. Fine-mapping and gene candidate analysis for AhRt1, a major dominant locus responsible for testa color in cultivated peanut. Theor. Appl. Genet. 134, 3721&#x2013;3730 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#ref-CR42\" id=\"ref-link-section-d299435376e2863\" rel=\"nofollow noopener\" target=\"_blank\">42<\/a> (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">30<\/a>). Phylogenetic analysis of these transcription factors, including those previously reported, provided insights into their evolutionary relationships and potential functional divergence (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">25c,d<\/a>). The expression patterns of these key transcription factors were consistent with the accumulation trends of anthocyanin metabolites, implying roles in the regulation of pigment biosynthesis during seed development (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-026-02577-z#Fig7\" rel=\"nofollow noopener\" target=\"_blank\">7l<\/a>).<\/p>\n<p>In summary, our analyses identified important genes involved in lipid and anthocyanin biosynthesis during peanut seed development. These genes may have central roles in determining oil content and seed coat color and could thus represent targets for future studies and crop improvement.<\/p>\n","protected":false},"excerpt":{"rendered":"Assembly and annotation of six representative peanut genomes To determine the genome structure and genetic diversity of A.&hellip;\n","protected":false},"author":2,"featured_media":607355,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[32],"tags":[2342,13114,258,8869,13113,257,263944,34411,3870,79],"class_list":{"0":"post-607354","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-science","8":"tag-agriculture","9":"tag-animal-genetics-and-genomics","10":"tag-biomedicine","11":"tag-cancer-research","12":"tag-gene-function","13":"tag-general","14":"tag-genome-assembly-algorithms","15":"tag-genome-wide-association-studies","16":"tag-human-genetics","17":"tag-science"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/607354","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/comments?post=607354"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/607354\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media\/607355"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media?parent=607354"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/categories?post=607354"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/tags?post=607354"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}