{"id":159525,"date":"2025-09-25T12:38:08","date_gmt":"2025-09-25T12:38:08","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/159525\/"},"modified":"2025-09-25T12:38:08","modified_gmt":"2025-09-25T12:38:08","slug":"a-haplotype-based-evolutionary-history-of-barley-domestication","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/159525\/","title":{"rendered":"A haplotype-based evolutionary history of barley domestication"},"content":{"rendered":"<p>Sample selection for genome sequencingWild barley<\/p>\n<p>Our wild barley panel (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>) comprised 285 accessions from the Wild Barley Diversity Collection (WBDC)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 41\" title=\"Sallam, A. H. et al. Genome-wide association mapping of stem rust resistance in Hordeum vulgare subsp. spontaneum. G3 7, 3491&#x2013;3507 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR41\" id=\"ref-link-section-d167944803e2000\" rel=\"nofollow noopener\" target=\"_blank\">41<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 42\" title=\"Steffenson, B. J. et al. A walk on the wild side: mining wild wheat and barley collections for rust resistance genes. Aust. J. Agric. Res. 58, 532&#x2013;544 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR42\" id=\"ref-link-section-d167944803e2003\" rel=\"nofollow noopener\" target=\"_blank\">42<\/a>, a collection of ecogeographically diverse accessions. The whole-genome sequencing (WGS) of the WBDC collection has been described in a companion paper<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 17\" title=\"Sallam, A. H. et al. Whole-genome sequencing of the wild barley diversity collection: a resource for identifying and exploiting genetic variation for cultivated barley improvement. Preprint at bioRxiv &#010;                https:\/\/doi.org\/10.1101\/2024.11.18.624148&#010;                &#010;               (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR17\" id=\"ref-link-section-d167944803e2007\" rel=\"nofollow noopener\" target=\"_blank\">17<\/a>. A further 95 diverse barley accessions, mainly from the panel of Russell et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 14\" title=\"Russell, J. et al. Exome sequencing of geographically diverse barley landraces and wild relatives gives insights into environmental adaptation. Nat. Genet. 48, 1024&#x2013;1030 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR14\" id=\"ref-link-section-d167944803e2011\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>, were also included. The latter set of samples had been sequenced to approximately 3\u00d7 coverage by Jayakodi et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 37\" title=\"Jayakodi, M. et al. The barley pan-genome reveals the hidden legacy of mutation breeding. Nature 588, 284&#x2013;289 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR37\" id=\"ref-link-section-d167944803e2015\" rel=\"nofollow noopener\" target=\"_blank\">37<\/a>. In the present study, we resequenced 32 of these samples to increase their coverage to approximately 10\u00d7.<\/p>\n<p>Domesticated barley<\/p>\n<p>Milner et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Milner, S. G. et al. Genebank genomics highlights the diversity of a global barley collection. Nat. Genet. 51, 319&#x2013;326 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR24\" id=\"ref-link-section-d167944803e2027\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a> defined 12 populations using model-based ancestry estimation with ADMIXTURE<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 43\" title=\"Alexander, D. H., Novembre, J. &amp; Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655&#x2013;1664 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR43\" id=\"ref-link-section-d167944803e2031\" rel=\"nofollow noopener\" target=\"_blank\">43<\/a> in a global diversity panel of 19,778 domesticated barley, which had been subjected to GBS<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Milner, S. G. et al. Genebank genomics highlights the diversity of a global barley collection. Nat. Genet. 51, 319&#x2013;326 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR24\" id=\"ref-link-section-d167944803e2035\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a>. We used the ADMIXTURE results and GBS SNP matrix of Milner et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Milner, S. G. et al. Genebank genomics highlights the diversity of a global barley collection. Nat. Genet. 51, 319&#x2013;326 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR24\" id=\"ref-link-section-d167944803e2039\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a> for sample selection. Except for the Near-eastern population (coloured orange in figure 1b of Milner et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Milner, S. G. et al. Genebank genomics highlights the diversity of a global barley collection. Nat. Genet. 51, 319&#x2013;326 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR24\" id=\"ref-link-section-d167944803e2043\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a>), we selected samples according to the following procedure. First, unadmixed samples, that is, those with an ADMIXTURE ancestry coefficient q\u2009\u2265\u20090.95 were used as input for a PCA with smartpca<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 44\" title=\"Patterson, N., Price, A. L. &amp; Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR44\" id=\"ref-link-section-d167944803e2051\" rel=\"nofollow noopener\" target=\"_blank\">44<\/a> (v7.2.1). Then, samples were selected to cover the PCA space evenly (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>). Owing to its higher genetic diversity and internal substructure, a more sophisticated procedure was followed for the Near-eastern population (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>). First, ADMIXTURE<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 43\" title=\"Alexander, D. H., Novembre, J. &amp; Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655&#x2013;1664 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR43\" id=\"ref-link-section-d167944803e2064\" rel=\"nofollow noopener\" target=\"_blank\">43<\/a> (v1.23) was run on 1,078 samples of Milner et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Milner, S. G. et al. Genebank genomics highlights the diversity of a global barley collection. Nat. Genet. 51, 319&#x2013;326 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR24\" id=\"ref-link-section-d167944803e2069\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a>, where the Near-eastern ancestry coefficient q was higher than that of all other populations, with q ranging from 0.25 to 0.98. Before running ADMIXTURE, the SNP set was thinned with PLINK<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 45\" title=\"Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559&#x2013;575 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR45\" id=\"ref-link-section-d167944803e2079\" rel=\"nofollow noopener\" target=\"_blank\">45<\/a> (v1.9) using the parameters \u2018&#8211;indep-pairwise 50 10 0.1\u2019. For each value of K (the number of ancestral populations) from 2 to 6, the output of 15 replicate runs of ADMIXTURE with different random seeds was combined with CLUMPP<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 46\" title=\"Jakobsson, M. &amp; Rosenberg, N. A. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23, 1801&#x2013;1806 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR46\" id=\"ref-link-section-d167944803e2086\" rel=\"nofollow noopener\" target=\"_blank\">46<\/a> (v1.1.2) and plotted with Distruct<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 47\" title=\"Rosenberg, N. A. distruct: A program for the graphical display of population structure. Mol. Ecol. Notes 4, 137&#x2013;138 (2004).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR47\" id=\"ref-link-section-d167944803e2091\" rel=\"nofollow noopener\" target=\"_blank\">47<\/a> (v1.1). Individuals with q\u2009\u2265\u200980% for their main ancestry component were considered unadmixed. The results for K\u2009=\u20096 was chosen for further analysis. The genetic separation of the defined populations was confirmed with smartpca<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 44\" title=\"Patterson, N., Price, A. L. &amp; Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR44\" id=\"ref-link-section-d167944803e2101\" rel=\"nofollow noopener\" target=\"_blank\">44<\/a> (v7.2.1). Only those samples of the Near-eastern subpopulations that were actually located in the Near East were selected for sequencing. The selected samples were distributed in an equidistant manner in the PCA diversity space. In total, we selected 302 samples from 15 populations (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">7<\/a>). The populations were named according to their geographical origins and three key traits closely connected to global population structure<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Milner, S. G. et al. Genebank genomics highlights the diversity of a global barley collection. Nat. Genet. 51, 319&#x2013;326 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR24\" id=\"ref-link-section-d167944803e2112\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a> (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">6<\/a>): row type (two-rowed (T), six-rowed (S) and mixed (M)); lemma adherence (hulled (H) and naked (N)); and annual growth habit (winter sown (W), spring sown (S) and mixed (M)). For example, ISR-THS refers to a population whose members are predominantly two-rowed hulled spring barleys from Israel. For each population, we selected about 20 accessions for WGS sequencing. Among these, 7\u201310 samples of each population (total: 116, \u2018high-coverage samples\u2019) were sequenced to approximately tenfold coverage. The remaining samples of each population were sequenced to approximately threefold coverage (total: 186, \u2018low-coverage samples\u2019). Seeds for all selected accessions can be ordered from the German Federal ex situ genebank at IPK Gatersleben.<\/p>\n<p>Plant growth, DNA isolation and Illumina sequencing<\/p>\n<p>Plant cultivation and DNA isolation were essentially as previously described<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Milner, S. G. et al. Genebank genomics highlights the diversity of a global barley collection. Nat. Genet. 51, 319&#x2013;326 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR24\" id=\"ref-link-section-d167944803e2128\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a>. Illumina Nextera DNA Flex WGS libraries were prepared and sequenced (paired end: 2\u2009\u00d7\u2009151 cycles) on an Illumina NovaSeq 6000 device at IPK Gatersleben according to the manufacturer\u2019s instructions (Illumina).<\/p>\n<p>Reads mapping and variant calling<\/p>\n<p>The reads of 682 barley genotypes, of which 380 were wild and 302 domesticated, were mapped to the MorexV3 genome sequence assembly<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 15\" title=\"Mascher, M. et al. Long-read sequence assembly: a technical evaluation in barley. Plant Cell 33, 1888&#x2013;1906 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR15\" id=\"ref-link-section-d167944803e2140\" rel=\"nofollow noopener\" target=\"_blank\">15<\/a> using Minimap2 (v2.24)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 48\" title=\"Li, H. New strategies to improve minimap2 alignment accuracy. Bioinformatics 37, 4572&#x2013;4574 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR48\" id=\"ref-link-section-d167944803e2144\" rel=\"nofollow noopener\" target=\"_blank\">48<\/a>. Mapping statistics of all 682 accessions are shown in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">1<\/a>. BAM files were sorted and deduplicated with Novosort (v3.06.05; <a href=\"https:\/\/www.novocraft.com\/products\/novosort\/\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/www.novocraft.com\/products\/novosort\/<\/a>). Variant calling was done with bcftools (v1.15.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"Danecek, P. et al. Twelve years of SAMtools and BCFtools. GigaScience 10, giab008 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR49\" id=\"ref-link-section-d167944803e2158\" rel=\"nofollow noopener\" target=\"_blank\">49<\/a> using the command \u2018mpileup -a DP,AD -q 20 -Q 20 &#8211;ns 3332\u2019. The resultant \u2018raw\u2019 SNP matrix was filtered as follows: (1) only biallelic SNP sites were kept; and (2) genotype calls were deemed successful if their read depth\u2009\u2265\u20092 and read depth\u2009\u2264\u200950; otherwise genotypes were set to missing. SNP sites with fewer than 20% missing calls, and fewer than 20% heterozygous calls were used for ADMIXTURE runs (with K ranging from 2 to 4) as described above. At K\u2009=\u20094, wild individuals with 15% or more ancestry from domesticated barley were considered admixed. A total of 80 wild admixed samples were excluded from subsequent analyses (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">21<\/a>). A total of 251 wild barley samples with high coverage (approximately 10\u00d7) without domesticated admixture were used for subsequent population genetic analyses (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">3<\/a>).<\/p>\n<p>We prepared two SNP matrices, SNP1 and SNP2, for downstream analysis. For SNP1, we extracted the data for 367 (251 wild and 116 domesticated) high-coverage samples from the raw SNP matrix. SNP1 was filtered as follows: (1) only biallelic SNP sites were kept; (2) homozygous calls were deemed successful if their read depth\u2009\u22652 and read depth\u2009\u2264\u200950 and set to missing otherwise; and (3) heterozygous calls were deemed successful if the allelic depth of both alleles was 5 or more and set to missing otherwise. The SNP2 matrix contained variants for 302 domesticated samples and was constructed from another bcftools run using the same parameters as above but with a downsampled dataset, in which the read alignments of the high-coverage samples (n\u2009=\u2009116) had been thinned so as to achieve a sequence depth comparable with that of the low-coverage samples (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a>) using SAMtools (v1.16.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"Danecek, P. et al. Twelve years of SAMtools and BCFtools. GigaScience 10, giab008 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR49\" id=\"ref-link-section-d167944803e2187\" rel=\"nofollow noopener\" target=\"_blank\">49<\/a> with the command \u2018samtools view -s 0.FRAC\u2019 (FRAC is the sampling rate). The targeted number of uniquely mapped (Q20), deduplicated mapped reads for the downsampled high-coverage data was set to a random number between 35 million and 52 million. Note that the read length was 2\u2009\u00d7\u2009150\u2009bp in all samples. The matrix SNP2 was filtered as follows: (1) only biallelic SNP sites were kept; (2) homozygous calls were considered successful if their read depth\u2009\u2265\u20092 and read depth\u2009\u2264\u200920 and set to missing otherwise; and (3) all heterozygous calls were set to missing. A flow chart describing the construction of the SNP matrices used in this study is shown in Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">23<\/a>. In analyses in which the use of an outgroup was required, we used WGS data of Hordeum pubiflorum<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 50\" title=\"Mascher, M. et al. Barley whole exome capture: a tool for genomic research in the genus Hordeum and beyond. Plant J. 76, 494&#x2013;505 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR50\" id=\"ref-link-section-d167944803e2196\" rel=\"nofollow noopener\" target=\"_blank\">50<\/a>. Read mapping and SNP calling were done as described above with one difference: a VCF file for all sites in the genome, including those identical to the reference genome, was obtained. This VCF file was merged with other VCF files to determine ancestral states. We also prepared a SNP matrix with 367 (251 wild and 116 domesticated) high-coverage samples using B1K-04-02 (FT11) as the genome reference<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 16\" title=\"Jayakodi, M. et al. Structural variation in the pangenome of wild and domesticated barley. Nature 636, 654&#x2013;662 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR16\" id=\"ref-link-section-d167944803e2201\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a> for candidate gene search and SNP age calculation. The reads mapping, SNP calling and filtering procedures were the same as those used for generating the SNP1 matrix.<\/p>\n<p>SNP-based genetic distances<\/p>\n<p>The number of SNPs between any two high-coverage genotypes were calculated as follows. First, pairwise SNP numbers were determined in genomic windows with PLINK2 (v2.00a3.3LM)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 51\" title=\"Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience 4, 7 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR51\" id=\"ref-link-section-d167944803e2213\" rel=\"nofollow noopener\" target=\"_blank\">51<\/a> with the command \u2018plink2 &#8211;from-bp x &#8211;to-bp y &#8211;sample-diff counts-only counts-cols=ibs0,ibs1 ids=s1 s2 \u2026\u2019, where x and y are the start and end coordinates of a window and \u2018s1 s2 \u2026\u2019 is a list of sample IDs. Different window sizes were used: 100\u2009kb (shift of 20\u2009kb), 500\u2009kb (shift of 100\u2009kb), 1\u2009Mb (shift of 200\u2009kb), 2\u2009Mb (shift of 400\u2009kb) and 5\u2009Mb (shift of 5\u2009Mb). Then, in each window, a normalized distance measure was calculated to account for the fact that owing to differences in the mappability of short reads, the effective coverage differs between genomic windows<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 40\" title=\"Zeng, X. et al. Origin and evolution of qingke barley in Tibet. Nat. Commun. 9, 5433 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR40\" id=\"ref-link-section-d167944803e2223\" rel=\"nofollow noopener\" target=\"_blank\">40<\/a> (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a>). Per-bp read depth was determined for each sample and each position of the reference genome with the command \u2018samtools view -q 20 -F 3332 | samtools depth\u2019. The effectively covered region of each window was defined as the union of sites with read depths between 2 and 50. For each, pairwise comparison between samples, the effectively covered regions were intersected using a Perl script. The pairwise distance in a genomic window was calculated as (hom\u2009+\u2009het\/2)\/cov, where hom and het are numbers of homozygous and heterozygous differences, respectively, and cov is the size of the intersection of the effectively covered regions of both samples. Genomic windows were considered only if the latter quantity amounted to half the size of the window; otherwise the distance was set to missing.<\/p>\n<p>Validation of SNP number estimation using accurate long reads<\/p>\n<p>To evaluate the accuracy of our SNP number estimates, we used data from the second version of the barley pangenome<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 16\" title=\"Jayakodi, M. et al. Structural variation in the pangenome of wild and domesticated barley. Nature 636, 654&#x2013;662 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR16\" id=\"ref-link-section-d167944803e2239\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a>. Among the 76 accessions included in the barley pangenome, 13 overlapped with our sample set (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">20<\/a>). We downloaded the HiFi reads of these 13 accessions and aligned them to the MorexV3 reference genome<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 15\" title=\"Mascher, M. et al. Long-read sequence assembly: a technical evaluation in barley. Plant Cell 33, 1888&#x2013;1906 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR15\" id=\"ref-link-section-d167944803e2246\" rel=\"nofollow noopener\" target=\"_blank\">15<\/a> using pbmm2 (v1.10.0; <a href=\"https:\/\/github.com\/PacificBiosciences\/pbmm2\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/PacificBiosciences\/pbmm2<\/a>). For HiFi reads, the effectively covered region was defined in the same manner as above but with read depths between 10 and 50 considering average HiFi sequencing coverage of approximately 25\u00d7. Variant calling was performed with DeepVariant (v1.6.0)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 52\" title=\"Poplin, R. et al. A universal SNP and small-indel variant caller using deep neural networks. Nat. Biotechnol. 36, 983&#x2013;987 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR52\" id=\"ref-link-section-d167944803e2257\" rel=\"nofollow noopener\" target=\"_blank\">52<\/a> to generate GVCF files for each sample, followed by joint genotyping using GLnexus (v1.3.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 53\" title=\"Yun, T. et al. Accurate, scalable cohort variant calls using DeepVariant and GLnexus. Bioinformatics 36, 5582&#x2013;5589 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR53\" id=\"ref-link-section-d167944803e2262\" rel=\"nofollow noopener\" target=\"_blank\">53<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 54\" title=\"Lin, M. F. et al. GLnexus: joint variant calling for large cohort sequencing. Preprint at bioRxiv &#010;                https:\/\/doi.org\/10.1101\/343970&#010;                &#010;               (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR54\" id=\"ref-link-section-d167944803e2265\" rel=\"nofollow noopener\" target=\"_blank\">54<\/a> to obtain a SNP matrix across the 13 samples. We applied the following filtering criteria: (1) only biallelic SNPs were retained; (2) only genotype calls with depth between 10\u00d7 and 50\u00d7 were kept; otherwise, the genotype was set as missing; and (3) for heterozygous calls, we required a minimum allele depth of 10 for each allele. We compared the effective covered region (uniquely mapped regions) of short-read and HiFi-read data across these 13 samples, as well as the intersection of effective covered regions between each pair of samples (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">25<\/a>). The missing rate was calculated for each sample as the number of missing genotype calls divided by its effectively covered region. We then calculated pairwise SNP number between samples using the same method as described above, with a window size of 1\u2009Mb (shift of 200\u2009kb). Only 1-Mb windows in which the intersection of effective covered regions between the two samples exceeds 0.5\u2009Mb were retained for SNP number calculation. Given that SNP number distributions along chromosomes are not always normally distributed \u2014 and may even be bimodal in certain cases \u2014 we applied Kendall rank correlation to evaluate the consistency between SNP numbers calculated from short reads and HiFi reads (Supplementary Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">26<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">27<\/a>). Confidence intervals for Kendall\u2019s tau correlation coefficients were calculated using a percentile bootstrap method with 1,000 resamples.<\/p>\n<p>Linkage disequilibrium decay<\/p>\n<p>The barley genome was split into three compartments (distal, interstitial and proximal) based on recombination rates<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 19\" title=\"Mascher, M. et al. A chromosome conformation capture ordered sequence of the barley genome. Nature 544, 427&#x2013;433 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR19\" id=\"ref-link-section-d167944803e2286\" rel=\"nofollow noopener\" target=\"_blank\">19<\/a> (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">21<\/a> and Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">28<\/a>). Linkage disequilibrium decay was calculated for both wild and domesticated barley in each compartment using PopLDdecay (v3.42)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 55\" title=\"Zhang, C., Dong, S. S., Xu, J. Y., He, W. M. &amp; Yang, T. L. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 35, 1786&#x2013;1788 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR55\" id=\"ref-link-section-d167944803e2296\" rel=\"nofollow noopener\" target=\"_blank\">55<\/a> with the command \u2018-Het 0.99 -Miss 0.2 -MAF 0.01 -MaxDist 500\u2019.<\/p>\n<p>Population structure and divergence times in wild barley<\/p>\n<p>Variants calls of 251 high-coverage wild barley samples were extracted from the matrix SNP1 (see above). SNP sites with fewer than 20% missing calls, fewer than 20% heterozygous calls and minor allele frequency (MAF)\u2009\u2265\u20095% were used in population structure analysis. Model-based ancestry estimation was done with ADMIXTURE (v1.23)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 43\" title=\"Alexander, D. H., Novembre, J. &amp; Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655&#x2013;1664 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR43\" id=\"ref-link-section-d167944803e2308\" rel=\"nofollow noopener\" target=\"_blank\">43<\/a>. The number of ancestral populations K ranged from 2 to 5. At K\u2009=\u20095, individuals with more than 85% of its main ancestry were considered as unadmixed wild barleys. PCA was done with smartpca (v7.2.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 44\" title=\"Patterson, N., Price, A. L. &amp; Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR44\" id=\"ref-link-section-d167944803e2318\" rel=\"nofollow noopener\" target=\"_blank\">44<\/a>. Genotype calls of the outgroup sample H. pubiflorum were merged with the SNP matrix, and an IBS-based genetic distance matrix was calculated with PLINK (v1.9)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 45\" title=\"Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559&#x2013;575 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR45\" id=\"ref-link-section-d167944803e2326\" rel=\"nofollow noopener\" target=\"_blank\">45<\/a>. The distance matrix was used to construct a neighbour-joining tree with Fneighbor (<a href=\"https:\/\/emboss.sourceforge.net\/apps\/cvs\/embassy\/phylipnew\/fneighbor.html\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/emboss.sourceforge.net\/apps\/cvs\/embassy\/phylipnew\/fneighbor.html<\/a>), which is part of the EMBOSS package<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 56\" title=\"Rice, P., Longden, I. &amp; Bleasby, A. EMBOSS: the European molecular biology open software suite. Trends Genet. 16, 276&#x2013;277 (2000).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR56\" id=\"ref-link-section-d167944803e2337\" rel=\"nofollow noopener\" target=\"_blank\">56<\/a>. The resultant tree was visualized with Interactive Tree Of Life (iTOL; v7)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 57\" title=\"Letunic, I. &amp; Bork, P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293&#x2013;W296 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR57\" id=\"ref-link-section-d167944803e2341\" rel=\"nofollow noopener\" target=\"_blank\">57<\/a>. In each of the five wild barley subpopulations, the nucleotide diversity<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 58\" title=\"Tajima, F. Evolutionary relationship of DNA sequences in finite populations. Genetics 105, 437&#x2013;460 (1983).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR58\" id=\"ref-link-section-d167944803e2345\" rel=\"nofollow noopener\" target=\"_blank\">58<\/a> (\u03c0) and Watterson\u2019s estimator<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 59\" title=\"Watterson, G. On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7, 256&#x2013;276 (1975).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR59\" id=\"ref-link-section-d167944803e2349\" rel=\"nofollow noopener\" target=\"_blank\">59<\/a> (\u03b8W) were calculated from the SNP matrix without MAF filtering using a published Perl script<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 40\" title=\"Zeng, X. et al. Origin and evolution of qingke barley in Tibet. Nat. Commun. 9, 5433 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR40\" id=\"ref-link-section-d167944803e2360\" rel=\"nofollow noopener\" target=\"_blank\">40<\/a>. Pairwise fixation indices (FST) between pairs of wild barley populations were calculated in genomic windows (size of 1\u2009Mb, shift of 200\u2009kb) using Hudson\u2019s estimator with the formula given as equation 10 (ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 60\" title=\"Bhatia, G., Patterson, N. J., Sankararaman, S. &amp; Price, A. L. Estimating and interpreting Fst: the impact of rare variants. Genome Res. 23, 1514&#x2013;1521 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR60\" id=\"ref-link-section-d167944803e2370\" rel=\"nofollow noopener\" target=\"_blank\">60<\/a>) using a published Perl script<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 40\" title=\"Zeng, X. et al. Origin and evolution of qingke barley in Tibet. Nat. Commun. 9, 5433 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR40\" id=\"ref-link-section-d167944803e2374\" rel=\"nofollow noopener\" target=\"_blank\">40<\/a>. Coverage-normalized SNP distances were calculated as described above in 1-Mb genomic windows (shift of 200\u2009kb). Distributions of log10-transformed distances in the genomic compartments distal, interstitial and proximal were plotted for each wild barley population in R (v3.5.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 61\" title=\"R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2022); &#010;                http:\/\/www.R-project.org&#010;                &#010;              .\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR61\" id=\"ref-link-section-d167944803e2381\" rel=\"nofollow noopener\" target=\"_blank\">61<\/a>. To infer divergence times, only SNPs in a 50-Mb region flanking the centromeres (\u00b125\u2009Mb) were used. SNP distances were converted into divergence times using the formula g\u2009=\u2009d\/2\u03bc, where g is the number of generations, \u03bc is the mutation rate and d is the number of SNPs per bp. We assumed that the generation time in the annual species H. vulgare is 1 year. We used a random mutation rate of 6.13\u2009\u00d7\u200910\u22129 as had been determined by Wang et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 62\" title=\"Wang, L. et al. The architecture of intra-organism mutation rate variation in plants. PLoS Biol. 17, e3000191 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR62\" id=\"ref-link-section-d167944803e2406\" rel=\"nofollow noopener\" target=\"_blank\">62<\/a> in the Pooideae grass Brachypodium distachyon. The SNP number distribution was visualized by frequency polygons with logarithmic binning (number of bins of 50, range of 101\u2013104.5 (31,622 SNPs)).<\/p>\n<p>Demographic history of wild barley<\/p>\n<p>Demographic inference was done with PSMC<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\" title=\"Li, H. &amp; Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493&#x2013;496 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR8\" id=\"ref-link-section-d167944803e2426\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a> (v0.6.5-r67, default parameters) using pseudo-diploid genomes, which were created by combining the BAM files of two homozygous individuals as previously described<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Meyer, R. S. et al. Domestication history and geographical adaptation inferred from a SNP map of African rice. Nat. Genet. 48, 1083&#x2013;1088 (2016).\" href=\"#ref-CR63\" id=\"ref-link-section-d167944803e2430\">63<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Cubry, P. et al. The rise and fall of African rice cultivation revealed by analysis of 246 new genomes. Curr. Biol. 28, 2274&#x2013;2282.e6 (2018).\" href=\"#ref-CR64\" id=\"ref-link-section-d167944803e2430_1\">64<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 65\" title=\"Shah, N. et al. Extreme genetic signatures of local adaptation during Lotus japonicus colonization of Japan. Nat. Commun. 11, 253 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR65\" id=\"ref-link-section-d167944803e2433\" rel=\"nofollow noopener\" target=\"_blank\">65<\/a>. We performed two types of PSMC analyses. The first was conducted separately for five wild barley subpopulations to infer their respective demographic histories (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">2<\/a>). The second treated all wild barley samples as a single population to capture the average demographic history of the species (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#Fig8\" rel=\"nofollow noopener\" target=\"_blank\">3b<\/a>). For the population-specific PSMC analysis, we first calculated the IBS distribution for all pairwise combinations of individuals within each group. On the basis of the distribution, IBS values were divided into two to four bins. Within each bin, we selected either all sample pairs (if the number of combinations was fewer than 50) or 50 pairs (if the number of combinations was more than 50) evenly distributed from low to high IBS values (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a>). In selecting sample pairs, we also considered the sequencing coverage of each individual. A pair was retained only if the ratio of coverage, defined as ratio\u2009=\u2009coveragesample2\/(coveragesample1\u2009+\u2009coveragesample2), fell within the range 0.45\u20130.55. For the species-level PSMC analysis, the method was the same, except that each pair of samples was required to come from different subpopulations (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">23<\/a>). PSMC is based on a panmictic model, assuming random mating, in which an individual (for example, a mammal) carries haplotypes derived from different ancestors. For selfing species, the outcome of pseudo-diploid PSMC is highly dependent on IBS. The higher the IBS, the closer the relationship between the pair, and the more likely the haplotypes come from a shared ancestor, which violates the assumption of random mating in PSMC. Conversely, pairs with lower IBS values are more likely to carry haplotypes from different ancestors, making them more consistent with the PSMC model. Therefore, we used the sample pairs from the lowest IBS bin (0.60\u2009&lt;\u2009IBS\u2009&lt;\u20090.67) to represent the average demographic history of wild barley (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#Fig2\" rel=\"nofollow noopener\" target=\"_blank\">2b<\/a> and Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#Fig8\" rel=\"nofollow noopener\" target=\"_blank\">3b<\/a>).<\/p>\n<p>Analysis of deep divergence region on chromosome 5H<\/p>\n<p>We used MUMmer (v4.0.0)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 66\" title=\"Mar&#xE7;ais, G. et al. MUMmer4: a fast and versatile genome alignment system. PLoS Comput. Biol. 14, e1005944 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR66\" id=\"ref-link-section-d167944803e2470\" rel=\"nofollow noopener\" target=\"_blank\">66<\/a> to align eight barley genome assemblies with different haplotypes<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 16\" title=\"Jayakodi, M. et al. Structural variation in the pangenome of wild and domesticated barley. Nature 636, 654&#x2013;662 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR16\" id=\"ref-link-section-d167944803e2474\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a> on chromosome 5H, 100\u2013300\u2009Mb. The minimum alignment identity was 90 and the minimum alignment length was 2,000\u2009bp.<\/p>\n<p>We used cross-population composite likelihood ratio (XP-CLR)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Chen, H., Patterson, N. &amp; Reich, D. Population differentiation as a test for selective sweeps. Genome Res. 20, 393&#x2013;402 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR22\" id=\"ref-link-section-d167944803e2481\" rel=\"nofollow noopener\" target=\"_blank\">22<\/a>, a method for detecting selective sweeps based on allele frequency differentiation, to assess whether a selective sweep signal exists in the deep divergence region on chromosome 5H. First, we performed genotype imputation and phasing of the SNP matrix using Beagle (v5.5)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 67\" title=\"Browning, B. L., Tian, X., Zhou, Y. &amp; Browning, S. R. Fast two-stage phasing of large-scale sequence data. Am. J. Hum. Genet. 108, 1880&#x2013;1890 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR67\" id=\"ref-link-section-d167944803e2485\" rel=\"nofollow noopener\" target=\"_blank\">67<\/a>. We then applied a Python implementation of XP-CLR (<a href=\"https:\/\/github.com\/hardingnj\/xpclr\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/github.com\/hardingnj\/xpclr<\/a>) to calculate XP-CLR scores between the southern Levant population and each of the other four wild barley groups. The analysis was performed using sliding windows of 1\u2009Mb in size (shift of 200\u2009kb). According to our previous definition (Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#Fig9\" rel=\"nofollow noopener\" target=\"_blank\">4b<\/a>), excluding the three intermediate haplotypes, the remaining wild and domesticated barley samples were classified into two haplotype types: haplotype1 and haplotype2. Candidate genes were identified based on the SNP matrix constructed using the wild barley accession B1K-04-02 (FT11) as the reference genome<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 16\" title=\"Jayakodi, M. et al. Structural variation in the pangenome of wild and domesticated barley. Nature 636, 654&#x2013;662 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR16\" id=\"ref-link-section-d167944803e2499\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a> (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">4b<\/a>). The effects of SNPs and indels residing in the genes of those regions were classified with SnpEff (v4.3t)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 68\" title=\"Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80&#x2013;92 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR68\" id=\"ref-link-section-d167944803e2507\" rel=\"nofollow noopener\" target=\"_blank\">68<\/a>, and variants with high allele frequency differentiation in haplotype1 and haplotype2 were prioritized (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">5<\/a>).<\/p>\n<p>Definition of ancestral haplotype groups<\/p>\n<p>AHGs were defined with IntroBlocker (v2)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\" title=\"Wang, Z. et al. Dispersed emergence and protracted domestication of polyploid wheat uncovered by mosaic ancestral haploblock inference. Nat. Commun. 13, 3891 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR9\" id=\"ref-link-section-d167944803e2522\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>. To determine an appropriate threshold for separating haplotypes, we computed coverage-normalized SNP-based distances in 1-Mb windows (shift of 200\u2009kb): (1) among wild samples; (2) among domesticated samples; and (3) between wild and domesticated samples. In each of the three cases, all possible pairwise combinations of samples were considered. We selected a threshold of 400 SNPs per Mb to separate AHGs. Coverage normalized SNP\u2013distance matrices computed from 367 high-coverage samples were used as input for IntroBlocker with the \u2018semi-supervised\u2019 model, giving precedence to wild over domesticated samples in the labelling of AHGs. IntroBlocker was run with different window sizes: 100\u2009kb (shift of 20\u2009kb), 500\u2009kb (shift of 100\u2009kb), 1\u2009Mb (shift of 200\u2009kb), 2\u2009Mb (shift of 400\u2009kb) and 5\u2009Mb (shift of 5\u2009Mb). The results of the 5-Mb run are shown in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#Fig3\" rel=\"nofollow noopener\" target=\"_blank\">3b<\/a> and Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#Fig10\" rel=\"nofollow noopener\" target=\"_blank\">5b<\/a>. After inspection of results (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">9<\/a>), the results from the 100-kb (shift of 20\u2009kb) run were used for downstream analyses.<\/p>\n<p>Analysis of the AHG matrix<\/p>\n<p>The proportions of shared and private AHGs in wild and domesticated barleys were determined with custom Perl scripts. Saturation curves were calculated as follows. We chose sets of k wild barleys (from a universe of 251 samples) at random, with k ranging from 1 to 250. For each k, the selection was repeated 100 times. For each of the samples, we determined the proportion of haplotypes seen in the domesticate that were shared with that set. Mean values and 95% confidence intervals for each k were calculated in R (v3.5.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 61\" title=\"R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2022); &#010;                http:\/\/www.R-project.org&#010;                &#010;              .\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR61\" id=\"ref-link-section-d167944803e2556\" rel=\"nofollow noopener\" target=\"_blank\">61<\/a>based on the\u00a0t-distribution (via the t.test() function). Two-dimensional haplotype frequency spectra were calculated with custom Perl scripts. Genomic windows with more than 20% missing data points were excluded.<\/p>\n<p>To infer the times at which wild haplotypes entered the domesticated gene pool, we ran IntroBlocker with different thresholds for haplotype separation: 400 SNPs (equivalent to an approximate divergent time of 32,000 years ago), 98 SNPs (8,000 years), 73 (6,000 years), 49 SNPs (4,000 years) and 24 SNPs (2,000 years). For each domesticated haplotype, we compared the results from IntroBlocker runs with different thresholds (divergence time brackets). The latest bracket in which haplotype sharing between wild and domesticated samples occurred was considered a terminus post quem for when a wild haplotype type entered the domesticated gene pool. This method is agnostic about the direction of gene flow. To exclude recent introgressions from domesticated to wild barley, we removed windows in which multiple domesticated barley samples and a few wild barleys share haplotypes that diverged within the past 8,000 years. To determine the spatial origin of haplotypes, we averaged the ancestry ADMIXTURE coefficients of all wild individuals in which a given domesticated haplotype occurred (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">29<\/a>). If two wild samples that shared a domesticated haplotype were highly similar (pairwise IBS\u2009\u2265\u20090.95), only one was used for the calculation.<\/p>\n<p>Haplotype-based genetic diversity and selective sweeps<\/p>\n<p>Saturation curves for the average number of haplotypes in a genomic window as a function of sample size were obtained by randomly selecting k individuals with k ranging from 1 to 115 for domesticated samples and from 1 to 250 for wild samples. For each k, the selection was repeated 100 times. Average haplotype numbers were determined for each subsample. Mean values and 95% confidence intervals were calculated in R (v3.5.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 61\" title=\"R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2022); &#010;                http:\/\/www.R-project.org&#010;                &#010;              .\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR61\" id=\"ref-link-section-d167944803e2590\" rel=\"nofollow noopener\" target=\"_blank\">61<\/a>\u00a0based\u00a0on the t-distribution (via the t.test() function). \u03b8W<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 59\" title=\"Watterson, G. On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7, 256&#x2013;276 (1975).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR59\" id=\"ref-link-section-d167944803e2603\" rel=\"nofollow noopener\" target=\"_blank\">59<\/a> and the Shannon diversity index<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 69\" title=\"Spellerberg, I. F. &amp; Fedor, P. J. A tribute to Claude Shannon (1916&#x2013;2001) and a plea for more rigorous use of species richness, species diversity and the &#x2018;Shannon&#x2013;Wiener&#x2019; index. Global Ecol. Biogeogr. 12, 177&#x2013;179 (2003).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR69\" id=\"ref-link-section-d167944803e2607\" rel=\"nofollow noopener\" target=\"_blank\">69<\/a> were calculated with a custom Perl script on haplotype matrices including only genomic windows with less than 20% missing data. The \u03b8W and Shannon index in seven barley chromosomes were plotted with Gnuplot using \u2018smooth bezier\u2019.<\/p>\n<p>We looked for regions of reduced diversity in domesticated relative to wild barley and therein searched for genes that might have been potential targets of selection. To not bias the analysis by the use of a domesticated reference genome (that of cultivar Morex), IntroBlocker was re-run using the SNP matrix based on the wild barley accession B1K-04-02 (FT11)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 16\" title=\"Jayakodi, M. et al. Structural variation in the pangenome of wild and domesticated barley. Nature 636, 654&#x2013;662 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR16\" id=\"ref-link-section-d167944803e2620\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a>. Regions with a Shannon index\u2009\u2264\u20091 were considered selective sweeps. The effects of SNPs and indels residing in the genes of those regions were classified with SnpEff (v4.3t)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 68\" title=\"Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80&#x2013;92 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR68\" id=\"ref-link-section-d167944803e2624\" rel=\"nofollow noopener\" target=\"_blank\">68<\/a>, and variants with high allele frequency differentiation were prioritized.<\/p>\n<p>The differentiation between populations of domesticated barley was assessed by computing the absolute allele frequency difference<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 70\" title=\"Berner, D. Allele frequency difference AFD &#x2014; an intuitive alternative to FST for quantifying genetic population differentiation. Genes 10, 308 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR70\" id=\"ref-link-section-d167944803e2631\" rel=\"nofollow noopener\" target=\"_blank\">70<\/a>. The following comparisons were done: NE\u2009+\u2009EU versus ETH, NE\u2009+\u2009EU versus Asia, ETH versus Asia, NE versus EUT, NE versus EUS, and EUT versus EUS. In addition, we calculated FST in genomic windows (size of 100\u2009kb, shift of 20\u2009kb) using the same method as in wild barley. Allele frequency difference was used for haplotypes derived from high-coverage samples (SNP1); FST calculations were performed for all samples, including low-coverage samples (SNP2).<\/p>\n<p>Demographic history of domesticated barley<\/p>\n<p>Trajectories of effective population size across time were inferred with PSMC<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\" title=\"Li, H. &amp; Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493&#x2013;496 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR8\" id=\"ref-link-section-d167944803e2655\" rel=\"nofollow noopener\" target=\"_blank\">8<\/a> (v0.6.5-r67, default parameters) using pseudo-diploid genome sequence from two homozygous barley individuals. A generation time of 1 year and a mutation rate of 6.13\u2009\u00d7\u200910\u22129 were used. We ran PSMC on 341 pseudo-haploid genomes obtained from all possible permutation of sample pairs from within 15 domesticated populations to reflect the population history of each subpopulation of domesticated barley. Given that domesticated barley originates from a mosaic genome composed of diverse wild barley lineages, we used the average demographic history of wild barleys (sample pairs from the lowest IBS bin between 0.60 and 0.67 in Extended Data Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#Fig8\" rel=\"nofollow noopener\" target=\"_blank\">3b<\/a>) as a reference background to compare temporal changes in effective population size (Ne) between 15 cultivated barley groups and wild barley.<\/p>\n<p>Split times between pairs of domesticated barley populations were determined by inspecting the distributions of SNP numbers between pairs of samples in those windows (size of 1\u2009Mb, shift of 200\u2009kb) where a given pair of samples differed by fewer than 300 SNPs (corresponding to a divergence of 24,470 years). Only 1-Mb windows in which the intersection of effective covered regions between the two samples exceeds 0.9\u2009Mb were retained for SNP number calculation.<\/p>\n<p>The SNP number distribution was visualized by frequency polygons (linear binning; number of bins of 50; range of 0\u2013300). SNP numbers were converted to divergence time using the following formula: time\u2009=\u2009(SNP number per Mb\/106)\/(2\u2009\u00d7\u20096.13\u2009\u00d7\u200910\u22129), where the 6.13\u2009\u00d7\u200910\u22129 was the random mutation rate (\u03bc) of B. distachyon<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 62\" title=\"Wang, L. et al. The architecture of intra-organism mutation rate variation in plants. PLoS Biol. 17, e3000191 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR62\" id=\"ref-link-section-d167944803e2684\" rel=\"nofollow noopener\" target=\"_blank\">62<\/a>.<\/p>\n<p>Validation of inferred split times<\/p>\n<p>We used a previously published two-rowed ancient barley sample, JK3014<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 71\" title=\"Mascher, M. et al. Genomic analysis of 6,000-year-old cultivated grain illuminates the domestication history of barley. Nat. Genet. 48, 1089&#x2013;1093 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR71\" id=\"ref-link-section-d167944803e2696\" rel=\"nofollow noopener\" target=\"_blank\">71<\/a> (approximately 6,000 years old, from Israel), to assess the accuracy of our method (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">30<\/a>). JK3014 was chosen because it is a high-depth sequenced sample (102\u00d7) and underwent uracil\u2013DNA\u2013glycosylase (UDG)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 72\" title=\"Rohland, N., Harney, E., Mallick, S., Nordenfelt, S. &amp; Reich, D. Partial uracil-DNA-glycosylase treatment for screening of ancient DNA. Phil. Trans. R Soc. B 370, 20130624 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR72\" id=\"ref-link-section-d167944803e2703\" rel=\"nofollow noopener\" target=\"_blank\">72<\/a> treatment, which reduces post-mortem DNA damage. JK3014 was jointly analysed with 116 high-depth modern barley samples for SNP calling. SNPs were filtered using the same preprocessing criteria that we applied in our SNP number calculation. We then calculated the SNP number between JK3014 and each of the 116 samples, without excluding C\u2192T and G\u2192A substitutions. The analysis used a 1-Mb sliding window (shift of 200\u2009kb). To convert the SNP number to time, we used two models:<\/p>\n<p>Model 1 assumes JK3014 is a direct ancestor of modern two-row Israel barley (ISR-THS). In this case, time\u2009=\u2009d\/\u03bc, where d equals the SNP number in 1-Mb windows\/106 and \u03bc is the mutation rate.<\/p>\n<p>Model 2 assumes JK3014 and ISR-THS share a common ancestor, and their divergence time slightly predates 6,000 years ago. In this case, time\u2009=\u2009d\/(coefficient\u2009\u00d7\u2009\u03bc). If JK3014 was a modern barley sample, the coefficient would be 2. Therefore, a reasonable estimate for this coefficient lies between 1 and 2. We used 1.2 to approximate a divergence time slightly earlier than 6,000 years ago. In addition, as UDG treatment cannot entirely eliminate ancient DNA damage, we assumed 10% of the C\u2192T and G\u2192A SNPs might be false positives. Thus, the final equation for model 2 becomes: time\u2009=\u2009(d\/1.1)\/(1.2\u2009\u00d7\u2009\u03bc).<\/p>\n<p>Estimation of haplotype age for domestication genes<\/p>\n<p>We used GEVA (v1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 30\" title=\"Albers, P. K. &amp; McVean, G. Dating genomic variants and shared ancestry in population-scale sequencing data. PLoS Biol. 18, e3000586 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR30\" id=\"ref-link-section-d167944803e2747\" rel=\"nofollow noopener\" target=\"_blank\">30<\/a> to estimate the age of haplotypes associated with three domestication genes in barley. For GEVA, the alternative allele is assumed to be the derived allele. As the domesticated haplotypes of these genes in domesticated barley are all recessive mutations compared with wild barley, we used the SNP matrix based on the wild barley reference genome B1K-04-12 (FT11)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 16\" title=\"Jayakodi, M. et al. Structural variation in the pangenome of wild and domesticated barley. Nature 636, 654&#x2013;662 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR16\" id=\"ref-link-section-d167944803e2751\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a>. This setup ensures that the causal variant of the domesticated haplotype is treated as the derived allele. Phasing of the SNP matrix was performed using Beagle (v5.5)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 67\" title=\"Browning, B. L., Tian, X., Zhou, Y. &amp; Browning, S. R. Fast two-stage phasing of large-scale sequence data. Am. J. Hum. Genet. 108, 1880&#x2013;1890 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR67\" id=\"ref-link-section-d167944803e2755\" rel=\"nofollow noopener\" target=\"_blank\">67<\/a>. For genes with known causal variants, we applied the following strategies to estimate haplotype age: if the causal variant is a SNP (for example, vrs1.a3 and ppd-H1), we directly used GEVA to estimate the age of that SNP. If the causal variant is a short indel (for example, btr1, btr2, vrs1.a1 and vrs1.a2), we constructed pseudo-SNPs at the indel position (for example, for the 1-bp deletion at position 41,130,358 in btr1, C\/\u2212), such as C\u2192A, C\u2192T and C\u2192G, and estimated their ages using GEVA.<\/p>\n<p>In both SNP and indel cases, we also identified haplotype-specific private SNPs that are in complete linkage with the causal variant and used these SNPs to estimate haplotype age. The defining feature of a causal variant is that it is private to the focal population and has a genotype frequency of 100%. \u2018Private\u2019 refers to those found exclusively in the focal haplotype relative to all other barley samples, including both wild and domesticated barley. The SNPs that we selected as being in \u2018complete linkage with the causal variant\u2019 share these same characteristics: they are private to the population and occur at a genotype frequency of 100%. Therefore, these SNPs probably originated either before or concurrently with the causal variant and can be used alongside it to estimate the age of the haplotype. The actual age of the haplotype is thus equal to or later than the age estimated by this method. For each haplotype, we randomly selected approximately 40 private SNPs, as well as the causal SNP or pseudo-causal SNPs for the calculation (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">14<\/a>). For large deletions (for example, Nud), haplotypes with unknown causal variants (for example, vrs1.a4) and functional (dominant) haplotypes in cultivated barley (Vrs1.b2, Vrs1.b3 and Nud), we estimated haplotype age using approximately 40 private SNPs specific to the domesticated haplotypes. To avoid confounding effects from recombination, we excluded all domesticated samples showing evidence of recombinant haplotypes in the regions of interest (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">13<\/a>). GEVA analyses were performed using default parameters, and downstream filtering was conducted using the \u2018estimate.R\u2019 script provided in the GEVA package. The mutation rate that we used is 6.13\u2009\u00d7\u200910\u22129 from B. distachyon<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 62\" title=\"Wang, L. et al. The architecture of intra-organism mutation rate variation in plants. PLoS Biol. 17, e3000191 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR62\" id=\"ref-link-section-d167944803e2810\" rel=\"nofollow noopener\" target=\"_blank\">62<\/a>. For each SNP, ten replicate runs were performed with different random seeds. Because recombinant haplotypes were excluded from the domesticated haplotype analyses, we reported haplotype ages based on the mutation clock model. Finally, given that barley is a highly selfing species with negligible heterozygosity (that is, nearly haploid in effect), and GEVA was originally developed under a diploid model (for human data), we multiplied all age estimates by 2 to account for ploidy differences and to report the final haplotype age.<\/p>\n<p>As a control group, for each gene locus, we randomly selected approximately 40 SNPs (0.2\u2009&lt;\u2009allele frequency\u2009&lt;\u20090.5) from wild barley within the same genomic region and estimated their ages (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">15<\/a>). Given their uncertain origin \u2014 either recent or ancient in the absence of selection \u2014 low-frequency SNPs are less suitable as reliable controls. By contrast, high-frequency SNPs (for example, those with frequencies above 20%) are likely to have arisen in the past and become fixed or nearly fixed in the population, and thus are expected to exhibit older ages. For wild barley SNP, the joint mutation and recombination clock model were used. In addition, the recessive ppd-H1 allele, which may predate domestication<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 34\" title=\"Jones, H. et al. Population-based resequencing reveals that the flowering time adaptation of cultivated barley originated east of the Fertile Crescent. Mol. Biol. Evol. 25, 2211&#x2013;2219 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR34\" id=\"ref-link-section-d167944803e2823\" rel=\"nofollow noopener\" target=\"_blank\">34<\/a>, was also included as a control group.<\/p>\n<p>To infer the most likely spatial origins of three genes, a neighbour-joining tree for each gene was constructed with SNPs from an interval within their sweep region. For the btr1\/2, vrs1 and nud loci, the interval extended from 39.4 to 39.7\u2009Mb on chromosome 3H, from 570.5 to 517.2\u2009Mb on chromosome 2H and 525.3\u2013525.7\u2009Mb on chromosome 7H, respectively. The neighbour-joining tree was constructed using SNPs based on the MorexV3 reference (SNP1).<\/p>\n<p>Archaeological excavations<\/p>\n<p>We analysed ancient DNA sequences of 23 barley grains excavated at three archaeological sites in Israel (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM4\" rel=\"nofollow noopener\" target=\"_blank\">16<\/a>). This number included published data of five barley grains from Yoram Cave<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 71\" title=\"Mascher, M. et al. Genomic analysis of 6,000-year-old cultivated grain illuminates the domestication history of barley. Nat. Genet. 48, 1089&#x2013;1093 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR71\" id=\"ref-link-section-d167944803e2850\" rel=\"nofollow noopener\" target=\"_blank\">71<\/a>. Archaeobotanical procedures were performed as described by Lev-Marom et al.\u00a0(manuscript in preparation). The sites Yoram Cave and Timna 34 have been described by Mascher et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 71\" title=\"Mascher, M. et al. Genomic analysis of 6,000-year-old cultivated grain illuminates the domestication history of barley. Nat. Genet. 48, 1089&#x2013;1093 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR71\" id=\"ref-link-section-d167944803e2854\" rel=\"nofollow noopener\" target=\"_blank\">71<\/a> and Lev-Marom et al. Abi\u2019or Cave is a medium-sized cave located on the eastern slopes of the Judean Desert, above Jericho, approximately 50\u2009m below sea level, across from the Karantal Monastery. The excavations at the cave were directed by the late H. Eshel in 1986. It is situated above a larger cave known as \u2018The Spies Cave\u2019 and has three openings above it. The cave contains a main long tunnel, approximately 50\u2009m long, and has revealed archaeological material dating from the Chalcolithic period to the time of the Bar Kochba Revolt (2nd century ce). The cave was found to be heavily disturbed by animals, antiquities robbers and monks who lived in it during the Islamic and more recent periods.<\/p>\n<p>Ancient DNA sequencing and analysis<\/p>\n<p>All laboratory procedures for sampling, DNA extraction, library preparation and library indexing were conducted in facilities dedicated to ancient DNA work at the University of T\u00fcbingen. Before DNA extraction, all seeds were cut into two parts: one part of each seed (36-6.5\u2009mg) was used for DNA extraction and further processing, the other part (26-3.4\u2009mg) was used for radiocarbon dating at the Klaus-Tschira-Arch\u00e4ometrie-Zentrum, Curt-Engelhorn-Zentrum Arch\u00e4ometrie gGmbH. DNA extraction was then performed according to a well-established extraction protocol for ancient plant material<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 71\" title=\"Mascher, M. et al. Genomic analysis of 6,000-year-old cultivated grain illuminates the domestication history of barley. Nat. Genet. 48, 1089&#x2013;1093 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR71\" id=\"ref-link-section-d167944803e2870\" rel=\"nofollow noopener\" target=\"_blank\">71<\/a> and double-stranded dual-indexed DNA libraries were produced<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 73\" title=\"Meyer, M. &amp; Kircher, M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb. Protoc. &#010;                https:\/\/doi.org\/10.1101\/pdb.prot5448&#010;                &#010;               (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR73\" id=\"ref-link-section-d167944803e2874\" rel=\"nofollow noopener\" target=\"_blank\">73<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 74\" title=\"Kircher, M., Sawyer, S. &amp; Meyer, M. Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform. Nucleic Acids Res. 40, e3 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR74\" id=\"ref-link-section-d167944803e2877\" rel=\"nofollow noopener\" target=\"_blank\">74<\/a>. Six ancient DNA samples (TU697 and JK2281-JK3014) were treated with UDG<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 72\" title=\"Rohland, N., Harney, E., Mallick, S., Nordenfelt, S. &amp; Reich, D. Partial uracil-DNA-glycosylase treatment for screening of ancient DNA. Phil. Trans. R Soc. B 370, 20130624 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR72\" id=\"ref-link-section-d167944803e2881\" rel=\"nofollow noopener\" target=\"_blank\">72<\/a> before sequencing. Sequencing was done on Illumina devices at IPK Gatersleben, the University of T\u00fcbingen and the Max-Planck Institute or the Science of Human History Jena.<\/p>\n<p>Paired-end Illumina reads of each sample were merged with leeHom (v1.2.17)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 75\" title=\"Renaud, G., Stenzel, U. &amp; Kelso, J. leeHom: adaptor trimming and merging for Illumina sequencing reads. Nucleic Acids Res. 42, e141 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR75\" id=\"ref-link-section-d167944803e2888\" rel=\"nofollow noopener\" target=\"_blank\">75<\/a> and mapped to the MorexV3 genome sequence assembly<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 15\" title=\"Mascher, M. et al. Long-read sequence assembly: a technical evaluation in barley. Plant Cell 33, 1888&#x2013;1906 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR15\" id=\"ref-link-section-d167944803e2892\" rel=\"nofollow noopener\" target=\"_blank\">15<\/a> using Minimap2 (v2.24)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 48\" title=\"Li, H. New strategies to improve minimap2 alignment accuracy. Bioinformatics 37, 4572&#x2013;4574 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR48\" id=\"ref-link-section-d167944803e2896\" rel=\"nofollow noopener\" target=\"_blank\">48<\/a>. BAM files were sorted and duplicates were marked with Novosort (v3.06.05; <a href=\"https:\/\/www.novocraft.com\/products\/novosort\/\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/www.novocraft.com\/products\/novosort\/<\/a>). Nucleotide misincorporation profiles were generated with mapDamage (v2.0.8)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 76\" title=\"J&#xF3;nsson, H., Ginolhac, A., Schubert, M., Johnson, P. L. F. &amp; Orlando, L. mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics 29, 1682&#x2013;1684 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR76\" id=\"ref-link-section-d167944803e2907\" rel=\"nofollow noopener\" target=\"_blank\">76<\/a>. Variant calling was done with bcftools (v1.15.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"Danecek, P. et al. Twelve years of SAMtools and BCFtools. GigaScience 10, giab008 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR49\" id=\"ref-link-section-d167944803e2912\" rel=\"nofollow noopener\" target=\"_blank\">49<\/a> using the command \u2018mpileup -a DP,AD -q 20 -Q 20 &#8211;ns 3332\u2019. We omitted the parameter \u2018&#8211;variants-only\u2019 in \u2018bcftools call\u2019 to output genotype in all sites. C\u2192T and G\u2192A were excluded, where the C and G are the alleles in the reference genomes and T and A are the alternative alleles called from the short-read data. The resultant SNP matrix was merged with the three different SNP matrices: SNP1 (367 high-coverage samples), SNP2 (302 domesticated barleys) and a published SNP matrix constructed from GBS data of 19,778 domesticated barleys<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Milner, S. G. et al. Genebank genomics highlights the diversity of a global barley collection. Nat. Genet. 51, 319&#x2013;326 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR24\" id=\"ref-link-section-d167944803e2916\" rel=\"nofollow noopener\" target=\"_blank\">24<\/a>. The GBS matrices had been filtered for site-level missing rate (less than 20%) before merging. The merged SNP1 matrix was used for PCA with smartPCA (v7.2.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 44\" title=\"Patterson, N., Price, A. L. &amp; Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR44\" id=\"ref-link-section-d167944803e2920\" rel=\"nofollow noopener\" target=\"_blank\">44<\/a> using the parameter \u2018lsqproject: YES\u2019. Neighbour-joining trees were constructed using only SNPs in a 50-Mb region flanking the centromeres (\u00b125\u2009Mb) on each of the seven chromosomes and including only six high-coverage ancient DNA samples, to determine the proximal haplotypes of ancient barley. The merged GBS matrix was used to compute an IBS matrix with PLINK (v1.9)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 45\" title=\"Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559&#x2013;575 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR45\" id=\"ref-link-section-d167944803e2924\" rel=\"nofollow noopener\" target=\"_blank\">45<\/a>. To examine the phylogenetic relationships between ancient DNA and modern domesticated barley, we constructed genome-wide phylogenetic trees using two merged SNP datasets: SNP1 and SNP2, each incorporating ancient DNA samples.<\/p>\n<p>To compare genetic diversity between individual ancient and modern barley samples without relying on population-level statistics, we leveraged rare alleles identified in a comprehensive wild barley panel as proxies for ancestral diversity (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM1\" rel=\"nofollow noopener\" target=\"_blank\">18<\/a>). Wild barley has the most extensive reservoir of allelic variation; alleles with very low frequency in this panel (for example, 0\u2009&lt;\u2009MAF\u2009\u2264\u20090.01) are unlikely to persist through strong bottlenecks or selective sweeps, and thus serve as sensitive markers of lost diversity. For each sample pair, we counted the number of these wild-derived rare alleles present in the ancient genome (A) and in the modern genome (M), and defined the \u2018relative diversity change\u2019 as (M\u2009\u2013\u2009A)\/A. A positive value indicates retention or gain of ancestral diversity in the modern sample, whereas a negative value signifies diversity loss relative to the ancient sample. This approach allows us to quantify diversity change at the single-sample level in a straightforwards, interpretable manner, without requiring large cohort sizes or population-based diversity estimators. We calculated the relative change in genetic diversity between six high-coverage ancient samples and modern domesticated barley individuals from 15 populations.<\/p>\n<p>The merged SNP1 and SNP2 were also used for the calculation of D statistics with the qpDstat program of ADMIXTOOLS (v3.0)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 77\" title=\"Patterson, N. et al. Ancient admixture in human history. Genetics 192, 1065&#x2013;1093 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#ref-CR77\" id=\"ref-link-section-d167944803e2940\" rel=\"nofollow noopener\" target=\"_blank\">77<\/a>. On the basis of previous phylogenetic analyses, we identified ISR-THS as the closest modern barley population to both Yoram Cave and Timna 34, and ME-SHS as the closest to Abi\u2019or Cave. To test for potential gene flow between ancient and modern barley, we performed the following three D statistics analyses: D (ISR-THS, Yoram Cave; P3, H. pubiflorum), D (ISR-THS, Timna 34; P3, H. pubiflorum) and D (ME-SHS, Abi\u2019or Cave; P3, H. pubiflorum). Here P3 refers to any of the 14 modern barley populations other than ISR-THS or ME-SHS, and H. pubiflorum is the outgroup.<\/p>\n<p>Reporting summary<\/p>\n<p>Further information on research design is available in the\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-09533-7#MOESM2\" rel=\"nofollow noopener\" target=\"_blank\">Nature Portfolio Reporting Summary<\/a> linked to this article.<\/p>\n","protected":false},"excerpt":{"rendered":"Sample selection for genome sequencingWild barley Our wild barley panel (Supplementary Table 1) comprised 285 accessions from the&hellip;\n","protected":false},"author":2,"featured_media":159526,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[7],"tags":[59,5459,4230,4231,72635,90,56,54,55],"class_list":{"0":"post-159525","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-science","8":"tag-gb","9":"tag-genomics","10":"tag-humanities-and-social-sciences","11":"tag-multidisciplinary","12":"tag-plant-evolution","13":"tag-science","14":"tag-uk","15":"tag-united-kingdom","16":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/159525","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=159525"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/159525\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/159526"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=159525"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=159525"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=159525"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}