Skip to main content

Analysis of ancient human mitochondrial DNA from the Xiaohe cemetery: insights into prehistoric population movements in the Tarim Basin, China



The Tarim Basin in western China, known for its amazingly well-preserved mummies, has been for thousands of years an important crossroad between the eastern and western parts of Eurasia. Despite its key position in communications and migration, and highly diverse peoples, languages and cultures, its prehistory is poorly understood. To shed light on the origin of the populations of the Tarim Basin, we analysed mitochondrial DNA polymorphisms in human skeletal remains excavated from the Xiaohe cemetery, used by the local community between 4000 and 3500 years before present, and possibly representing some of the earliest settlers.


Xiaohe people carried a wide variety of maternal lineages, including West Eurasian lineages H, K, U5, U7, U2e, T, R*, East Eurasian lineages B, C4, C5, D, G2a and Indian lineage M5.


Our results indicate that the people of the Tarim Basin had a diverse maternal ancestry, with origins in Europe, central/eastern Siberia and southern/western Asia. These findings, together with information on the cultural context of the Xiaohe cemetery, can be used to test contrasting hypotheses of route of settlement into the Tarim Basin.


The Tarim Basin in the Xinjiang region of China is situated on the Silk Road, the collection of ancient trade routes that for several millennia linked China to the Mediterranean (Fig. 1). The present-day inhabitants of the Tarim Basin are highly diverse both culturally and biologically as a result of extensive movements of peoples and cultural exchanges between east and west Eurasia [13]. Archaeological and anthropological investigations have helped to formulate two main theories to account for the origin of the populations in the Tarim Basin [412]. The first, so-called “steppe hypothesis”, maintains that the Tarim region experienced at least two population influxes from the Russo-Kazakh steppe. The earliest settlers may have been nomadic herders of the Afanasievo culture (ca. 3300–2000 B.C.), a primarily pastoralist culture derived from the Yamna culture of the Pontic-Caspian region and distributed in the Eastern Kazakhstan, Altai, and Minusinsk regions of the steppe north of the Tarim Basin (Fig. 1) [9, 1215]. This view is based on the numerous similarities between the material culture, burial rituals and skeletal traits of the Afanasievo culture and the earliest Bronze Age sites in the Tarim Basin, such as Gumugou (ca. 3800 BP), one of the oldest sites with human burials in Xinjiang [8, 9, 11, 12, 16]. These first settlers were followed by people of the Late Bronze Age Andronovo cultural complex (ca. 2100–900 B.C.), another pastoralist culture derived from the Yamna culture, primarily distributed in the Pamirs, the Ferghana Valley, Kazakhstan, and the Minusinsk/Altai region (Fig. 1) [8, 9, 11, 12, 15, 16]. This is signaled by the introduction of new material culture, clothing styles and burial customs around 1200 B.C. The second model, known as the “Bactrian oasis hypothesis”, also postulates a two-step settlement of the Tarim Basin in the Bronze Age, but maintains that the first settlers were farmers of the Bactria–Margiana Archaeological Complex (or BMAC, also known as the Oxus civilization) (ca. 2200–1500 B.C.) west of Xinjiang in Uzbekistan (north Bactria), Afghanistan (south Bactria), and Turkmenistan [17], followed later by the Andronovo people from the northwest (Fig. 1) [5, 7]. This model emphasises the environmental similarities between the Xinjiang and Central Asian desert basins, and suggests that certain features, including the irrigation systems, wheat remains, woolen textiles, bones of sheep and goats, and traces of the medicinal plant Ephedra found in Xinjiang could be evidence of links with the Oxus civilization [5, 7, 16]. These contrasting models can be tested using DNA recovered from archaeological bones. Previous genetic evidence on the origin of the earliest settlers was based on the analysis of mtDNA from burials at the Gumugou cemetery in the eastern edge of the Tarim Basin. In that study, researchers sequenced the first mtDNA hypervariable region (HVRI), but the results were inconclusive [18]. The discovery of another Bronze Age site of a similar age to Gumugou, with many well-preserved mummies, including individuals with European facial features, provided a unique opportunity to obtain genetic evidence about the first settlers of the Tarim Basin [1921].

Fig. 1
figure 1

Map of Eurasia showing the location of the Xiaohe cemetery, the Tarim Basin, the ancient Silk Road routes and the areas occupied by cultures associated with the settlement of the Tarim Basin. This figure is drawn according to literatures

We describe here the analysis of mtDNA from human remains recovered from the Xiaohe tomb complex, an important Bronze Age site in the eastern edge of the Tarim Basin (40°20′11″N, 88°40′20.3″E) (Fig. 1). Discovered originally in 1934 by the Swedish archaeologist Folke Bergman, it was subsequently lost, but rediscovered in 2000 by a team from the Xinjiang Archaeological Institute using global positioning equipment. The cemetery was excavated between 2002 and 2005, and consisted of five strata with radiocarbon dates ranging from 4000 to 3500 years before present (14C yBP) [19, 22]. The site has many notable features, including numerous large phallus and vulva posts made of poplar, striking wooden human figures and masks, well-preserved boat coffins, leather hides, wheat and millet grains, and many artifacts (Fig. 2). Importantly, it contains the oldest and best-preserved mummies so far discovered in the Tarim Basin, possible those of the earliest people to settle the region. Genetic analysis of these mummies can provide data to elucidate the affinities of the earliest inhabitants, and help understand later patterns of human migration in the Eurasian continent.

Fig. 2
figure 2

a Fourth layer of the Xiaohe cemetery showing a large number of large phallus and vulva posts; b A well-preserved boat coffin; c Female mummy with European features; d Double-layered coffin excavated from the Xiaohe cemetery

The necropolis consisted of five layers of burials spanning half a millennium, offering the opportunity to determine the extent of interactions between the people of Xiaohe and other populations after the original settlement of the Tarim Basin. Did the people remain comparatively isolated or did they intermarry with newcomers? In an earlier study, we analysed DNA recovered from the deepest and oldest layer of burials of the Xiaohe site, the fifth layer, corresponding to the earliest inhabitants. Our results revealed that the first settlers carried both European and central Siberian maternal lineages. These findings agreed with the archaeological evidence for a close connection to the Afanasievo culture of the steppe north of the Tarim Basin, in other words with the “steppe hypothesis” [23]. We describe here the analysis of the maternal lineages of individuals recovered from the remaining four burial layers, and discuss the results in the context of the contrasting views on the settlement and migration patterns of the Tarim Basin.


Bone samples

The human remains excavated from the Xiaohe burial complex exhibited excellent preservation by virtue of the dry, sandy, and well drained soil, which is both alkaline and high in salt. The cemetery, consisting of 167 graves, was excavated by the Xinjiang Provincial Institute of Cultural Relics and Archaeology, with permission from the State Administration of Cultural Heritage, who has control of archaeological excavations in China. After recording and photographing, the skeletal remains of 92 well-preserved individuals were placed in cardboard boxes, together with the surrounding sandy soil, and sent to the ancient DNA laboratory of Jilin University, where they were stored in a cool and dry environment. Bone and tooth samples were collected by two skilled staff members, wearing disposable gloves and face masks. Thirty individuals, representing the oldest layer, were analysed in a previous study [23]. The present study included 28 individuals of the fourth layer, seven from the third layer, and 27 from layers 1–2, among which 22 human samples were scattered on the surface of sand due to the burials of the uppermost two layers were damaged by looters and weathering. Teeth and bone were taken from each individual whenever possible. Details of the samples are included in the electronic supplement (Additional file 1: Table S1).

Bones were processed and DNA extracted as described previously [23], with the inclusion of an extraction blank for every three ancient samples.

DNA authentication and prevention of contamination

Strict precautions were taken to avoid contamination by modern DNA. Ancient DNA degradation and potential contamination were monitored as described by Gilbert et al. [24]. In brief, DNA extractions, and steps performed before polymerase chain reaction amplification (PCR), were performed in a building remote from the post-PCR laboratory, in a laboratory dedicated exclusively to ancient DNA research. The laboratory was equipped with positive air pressure, and rooms were irradiated overnight with UV light (254 nm). Surfaces were cleaned frequently with DNA Off. Extraction and amplification blanks were included in every PCR assay in order to detect any potential contamination from sample processing or reagents. Multiple extractions and amplifications from the same individual were undertaken at different times and from two different parts of the skeleton, such as bone and tooth, to detect artefactual sequences due to cross-contamination, pre-lab contamination, DNA damage or jumping PCR events. Partly samples were chosen randomly to do independent repetition in our new lab by one different laboratory member in order to detect the contamination in laboratory environment. PCR amplicons of six of the ancient DNA extracts were cloned to check for potential heterogeneity in the amplification products due to contamination, DNA damage, or jumping PCR. MtDNA amplicons of different sizes were analysed to investigate the inverse correlation between amplicon size and amplification efficiency. Ancient DNA from cattle remains, found at the same site, was isolated using the same procedure as for the human ancient DNA, providing an additional control for contamination. Lastly, the DNA types of the archaeologists and laboratory personnel were compared to the experimental results to check for potential contamination, as described in a previous study [23].

DNA quantification and PCR amplification

Three ancient extracts were chosen at random to quantify amplifiable mtDNA of four different fragment sizes, namely 138, 209, 235 and 393 base pairs (bp), using a GenAmp 5700 Sequence Detector (Applied Biosystems, USA). qPCR amplification was performed in 25 μL reactions containing 1X SYBR Green PCR Master Mix (Applied Biosystems, USA), 0.5 μM each primer, 2 mM BSA (Takara, Japan) and 5 μL DNA extract. The specificity of primers was validated using modern DNA, and a single peak was observed when monitoring post-PCR melt curve for all fragments, indicating specific binding. The Mitochondrial sequence polymorphisms (HVRI) were analysed by amplifying a segment spanning nucleotide positions 16035–16409, using two overlapping primer pairs. In addition, several mtDNA coding region polymorphisms diagnostic for major branches of the human mtDNA tree were typed, as follows: Haplogroups (Hgs) R (12705C), UK (12308G), HV (14766C), H (7028C), R1 (4917G), R11 (10031C), M5 (1888A), M25 (15928A), C4 (11969A) and G (4833G) were identified by direct sequencing. Hgs M (10400 T), C (14318C), T(15607G) and D (5178A) were analysed by the PCR product-length polymorphism method . Haplogroup (Hg) B was identified on the basis of the 9-bp deletion at position 8280 [2527]. A table of the primers is included in the electronic supplement to this paper (Additional file 2: Table S2). The sex of the Xiaohe individuals was determined by PCR of the sexually dimorphic amelogenin gene [28, 29]. PCR amplifications were performed in 20 μL reactions, as described previously [23].

DNA cloning and sequencing

To investigate potential contamination of the PCR amplicons, DNA amplified from six individuals chosen at random was cloned using the pGEM-T Easy Vector System I (Promega, USA). Eight white clones of each PCR fragment were sequenced using M13 primers. Cycle sequencing was performed as described previously [23], and the sequences analysed using an ABI310 Genetic Analyzer (Applied Biosystems, USA), following the instructions of the manufacturer.

Data analysis

Sequence alignments were performed using ClustalX 1.8 software, followed by manual editing. Published literature and the Genbank database were searched to identify shared sequences. The sequences were subject to statistical analysis, including 20 additional sequences previously obtained from the fifth and lowest layer of the Xiaohe cemetery. Haplotype diversity was investigated using DnaSPv5 ( The results for layers 1–3 were pooled, as the sample was small and the layers had been commingled by grave looters. The Networks of four mtDNA haplogroups were constructed by Network software ver. ( using the median-joining method. The multidimensional scaling (MDS) was conducted using Arlequin 3.5 software ( and SPSS16.0 (USA). Principal Component Analysis (PCA) was performed with SPSS 16.0 software (USA), using a haplogroup frequency database of ancient and present-day populations, with 17 different haplogroups (Additional file 3: Table S3). Fifteen of these were Hgs A, B, C, D, Z, F, G, N9, HV, U, K, W, X, R and TJ, while a further seven east Eurasian Hgs (E, M7, M8, M9, M10, M11 and M13) were pooled into one group, and an additional four west Eurasian Hgs (I, N1a, N1b and N*), were pooled into a final group.


Authentication of results

A total of 42 reproducible mtDNA sequences (345 bp) were obtained from 62 individual sets of human remains, after discarding 20 samples due to failed amplification or lack of reproducibility. Six of the 42 sequences matched with two archaeologists and one laboratory member were also removed from the study, even though they yielded consistent results through multiple independent extractions. The remaining 36 sequences were inferred to be unambiguous and believable. The following criteria supported the authenticity of the results: (i) an inverse correlation between the size of the PCR amplicons and amplification efficiency (Additional file 4: Table S4); (ii) consistent consensus cloned sequences, although a small number of sites differed from the directly sequenced PCR products, possibly due to random Taq mis-incorporation or DNA damage. Miscoding lesions in clones of PCR products showed that cytosine → thymine changes characteristic of damaged ancient DNA were the most frequent changes in the Xiaohe individuals (Additional file 5: Figure S1); (iii) sex determination by molecular and morphological methods gave consistent results (Table 1); (iv) the mtDNA HVRI sequences corresponded to the key coding region SNPs defined by the mtDNA phylogenetic tree [26]; (v) analysis of cattle bones from the Xiaohe site using the human-specific primers did not reveal human DNA, implying the bones were free of human DNA and the extractions were done cleanly; (vi) the mtDNA sequences from multiple independent DNA extractions and using different samples (tooth, femur) were consistent (Additional file 6: Table S5). The 36 sequences accepted as genuine bone sequences have been submitted to GenBank, with accession numbers KF436896-KF436931.

Table 1 Result for mitochondrial DNA typing

Mitochondrial DNA profiles and haplogroups

The 36 successfully typed individuals yielded 21 distinct mtDNA haplotypes, of which 18 could be assigned to 12 previously defined haplogroups [3032] by means of HVRI and coding region polymorphisms (Table 1). The haplogroups were the west Eurasian H, K, T, U7, U5a, U2e, the east Eurasian B, C4, C5, D, G2a, and the Indian M5.

The west Eurasian haplogroups of the Xiaohe people were more diverse (Hd = 0.9722 versus Hd = 0.8585), but less abundant (9 individuals versus 26 individuals) than the East Eurasian haplogroups. The predominant lineage was UK, of which four different subhaplogroups were observed: one K, two U7, two U5a, and one U2e. One individual with Hg T and one individual with Hg H were detected. The latter carried the HVRI Cambridge Reference Sequence (CRS), very common in living Europeans [31, 33, 34]. This sequence has also been observed in ancient human remains of Neolithic Europe [35, 36], the Bronze Age in central Asia [37], as well as the Mongolian Altai Mountains [38], and the Iron Age in southern Siberia [39]. The T haplotype observed in Xiaohe is found exclusively in Europeans, with the exception of Iran in modern people, and found mostly as T2. It has also been observed in human remains of Neolithic Europe [36], the Eneolithic/Bronze Age in the Pontic Caspian steppe [40], and the Bronze Age in Kazakhstan [37]. No exact match was found for the Xiaohe K haplotype in our database. The network shows that it clusters into one subclade with the 16093 mutation, which is mainly distributed in Europe and Iran (Fig. 3a). Therefore, the K haplotype sequenced in Xiaohe is currently uninformative about population affinity. There are two U5a haplotypes observed in Xiaohe, the basal U5a*(16192 T-16256 T-16270 T) was found broadly in Europe and central Asia, while the derived U5a haplotype(16192 T-16256 T-16270 T-291 T) was found exclusively in Europe for modern people. These two sequences have also been found in Neolithic Europe [35, 41, 42]. U5a is a very ancient and important European haplogroup and is thought to have expanded eastward into central Siberia. It has been observed in human remains of the Neolithic in the Baikal regions and the Bronze Age in the Altai and Xinjiang [39, 43, 44]. The U2e sequence observed in Xiaohe did not match any sequence in our database, the most matching sequences (showing one to two np differences) were mainly found in Europe. U2e also was an ancient European lineage like U5, and had spread into Central Eurasia in the Bronze Age [31, 39, 44]. The presence of individuals of Hgs H, T, U5a and U2e in Xiaohe indicates maternal lineages with an ultimate origin in Europe. HgU7 is absent in many parts of Europe, but its frequency increases to >4 % in the Near East and up to 5 % in Pakistan, reaching almost 10 % in Iranians, and its highest frequency in Gujarat. U7 haplogroup probably originated in the region between Iran and Indian Gujarat [4547]. The U7 variant observed in Xiaohe is currently found mostly in Iran, Europe and the Tibetan plateau. In addition, we found one individual with the Indian lineage M5 [48]. Nowadays, the M5 variant observed in this study is found mainly in south and southwest Asia. The presence of hgs U7 and M5 in the Xiaohe people suggests that populations of west/south Asia contributed to the gene pool of the Tarim Basin during the Bronze Age.

Fig. 3
figure 3

Median joining networks for mtDNA haplogroups K, C, D and G2a, based on HVS-I sequences between region np16050-16391. Circle areas are proportional to haplotype frequency. The length of the lines between nodes is proportional to the mutation steps. The diagnostic mutations used to classify the major branches are labeled on the line. The Number sign(#) and the following panels indicate the assumed root of each haplogroup

The most dominant east Eurasian haplogroup in the Xiaohe people was C, found in 18 of the 36 individuals (47 %) and associated with five distinct mtDNA C4 haplotypes and one C5 haplotype. Nine Xiaohe individuals carried the variant 16223-16298-16309-16327 and five carried the variant 16298–16327. The first of these variants, 16223-16298-16309-16327, has to our knowledge not been previously observed in ancient or living populations, while the variant 16298–16327 was only observed in present-day Siberia, although at low frequencies [4951]. A variant characterised by substitutions 16223-16298-16327, observed in one Xiaohe individual, is found widely in present-day Eurasia, with the highest frequency in central/eastern Siberia. It also been detected in a number of ancient individuals, three from Neolithic central Siberia [43], one from northeast Siberia (3600 yBP) [52], six from northeast Europe (3500yBP) [37], twelve from the Bronze Age West Siberian Plain [53], one from southern Xinjing(2800-2011yBP) [54] and four from late Neolithic northwest China [55]. Haplotype 16129-16223-16298-16327 is found mainly in currently northeast, central and south Siberian populations, in Mongolia and central Asia. It also was found in one ancient Mongolian (2000 yBP) [56]. Haplotype 16093-16129-16223-16298-16311-16327 is probably rare, since it has only been detected previously in four present-day individuals, one in south Siberia, one in Tibet, one in Southeast Asia, and one in China. One Xiaohe individual carried Hg C5 (16223-16288-16298-327), of a variant only observed previously in one individual of southern Siberia, and in one of the Tibetan Plateau (Fig. 3b).

The second most frequent east Eurasian haplogroup in the Xiaohe people was D, found in four individuals, with four different variants. The first, 16051-16223-16362, is found mainly in Southeast Asia. The second, 16223-16234-16316-16362, is found throughout the Eurasian continent, including China, Japan, Siberia, and Eastern Europe. The remaining two D haplotypes had no exact match in any of the available databases. Interestingly, hg D has been observed at high frequency in Hami people, a Bronze Age population of northeast Xinjiang [44]. It is also been observed in Neolithic Chinese and Siberians [43, 55]. In the Network Tree, We can see that some Xiaohe D haplotypes cluster into the East Asian subclade, the others cluster into the Siberian subclade (Fig. 3c). Therefore, the D haplotype sequenced in Xiaohe is currently uninformative about population affinity. One individual carried G2a, but no matching sequence was found in the database. G2a is relatively abundant in northern China and central Asia, reaching significant levels in Southern Siberia [50]. However, Xiaohe G2a haplotype clusters into one of the East Asian clades in the Network tree (Fig. 3d), indicating close affinities to East Asians. One single individual carried hg B, an important East Asian haplogroup, of a particular variant not previously observed. The presence of haplogroups C4, C5, D, G2a and B in Xiaohe people indicates close affinities to Siberians and East Asians.

Comparison of the Xiaohe population with ancient and extant populations of Eurasia

In order to characterise the genetic relationship between the Xiaohe population and other ancient and extant Eurasian populations, the PCA based on the mtDNA haplogroup frequencies and the MDS plot based on genetic distance between sequences were conducted. However, as many individuals had identical C4 haplotypes, indicating potential maternal relationships within the population, the frequency of C4 was likely to be overestimated. To account for this, we assumed a scenario of extreme maternal kinship, where identical haplotypes in several individuals of the same layer were only counted once. The PCA plot of the first two components showed that present-day populations largely segregate into three main clusters: Europeans, Siberians, and Central/East Asians (Fig. 4). Europeans and Central/East Asians were separated along the first component axis (23.34 % of the variance), reflecting their longitude. Europeans and Siberians were separated along the second component axis (23.04 % of the variance). Xiaohe maternal lineages were closest to the Xinjiang populations (modern Xinjiang population and ancient Hami people), and second-closest to the central Siberians (Tuvinians). An MDS plot confirmed the genetic affinity with Siberians inferred from the PCA, but showed a long distance with Central /East Asians (Additional file 7: Figure S2).

Fig. 4
figure 4

Principal Component Analysis of mitochondrial haplogroup frequencies. The first two dimensions account for 46.38 % of the total variance. Grey arrows represent haplogroup loading vectors, i.e., the contribution of each haplogroup. Ancient populations included in this study: aXH: Xiaohe cemetery; aCA: Nomads from Kazakhstan (2,100–3,400 yBP); aKur: Siberian Kurgans (1,600–3,800 yBP); aPWC: Scandinavian Pitted-Ware Culture foragers (4,500–5,300 yBP); aLBK: German early Neolithic Linear Pottery Culture population(6,900–7,500 yBP);aNEE: North East European ancient people (3,500–7,500 yBP):aLB: Neolithic Lake Baikal population (6,130–7,140 yBP); aHM: Xinjiang Hami people (4000yBP); aHB: Chinese Shanxi Hengbei people (3000yBP); aMG and aLJ: late Neolithic Qijia Culture peopulions in Ganqing region of China (4000yBP); aXN: nomads from Mongolia (2500yBP). Detailed information on the ancient and modern populations is provided in Additional file 3: Table S3


Our previous analysis of DNA from the deepest layer of burials of the Xiaohe site revealed that the first settlers had European paternal lineages, and maternal lineages of European and central Siberian origin, consistent with the “steppe hypothesis” of the origins of the first inhabitants of the Tarim Basin [23]. In the present study, analysis of the remaining four, more recent burial layers, confirmed that the origin of the mitochondrial lineages is more widespread, and we detected west Eurasian lineages H, K, U5, U7, U2e, T, east Eurasian lineages B, C4, C5, D, G2a, and Indian lineage M5. Haplotypes H, K, U5 and T are found mostly in Europe, suggesting genetic affinities with Europe. While Xiaohe U2e haplotype has not been observed in living populations, the hg U2e is thought to have originated in Europe, from where it had been spread into central Siberia in the Bronze Age [39]. The distribution of these haplogroups overlaps with the regions of the Afanasievo culture, Andronovo culture or Yamna culture, but is remote from the Oxus civilization. These west Eurasian genetic components in the Xiaohe people corroborate the “steppe hypothesis”.

However, layers 1–4 also had individuals with hgs U7 and M5, common in west/south Asian populations today, but rare in Europeans and Siberians. Although the genetic structure of the oasis people in the Bronze Age is unclear, archaeological evidence indicates that settled populations of the oasis civilization in central Asia descended from farmers from the southwest [17]. These ancient central Asians had been in contact with south Asians and likely received a genetic contribution from them. Considering the archaeological materials and the environmental similarities between central Asia and the Tarim Basin, hgs U7 and M5 observed in Xiaohe people more likely originated from the oasis peoples but not directly from west/south Asians. This suggests populations from the oasis may have made a later contribution to the gene pool of the Xiaohe people, giving some credence to the “oasis hypothesis”. The later Xiaohe people (layers 1–4) carried diverse east Asian maternal lineages, including the predominant C4, as well as C5, which has a similar geographical distribution to C4, suggesting links with Siberia, especially central/south Siberian populations. Although hgs B, D and G2a are common in East Asians and Mongolians besides Siberians, except for broomcorn millet (P. miliaceum), there was no archaeological or anthropological evidence in the Xiaohe cemetery for links with East Asia. However, hgs C and D have also been observed in Bronze Age human remains from North Xinjiang (Hami), a place where culture and human features appear to indicate a blend of both east and west. DNA analysis showed that the Hami people had close affinities with Neolithic people in Ganqing region of China [44]. Recently archaeobotanical analysis considered that East Asian domesticated broomcorn likely was introduced into Central Eurasia via the route of North Xinjiang from Ganqing region at middle third millennium BC. Therefore, some eastern components in the later Xiaohe people may have derived from North Xinjiang and have an ultimate origin in East Asia but not central/southern Siberia, something still consistent with the “steppe hypothesis”. This was indicated by the close relationship of the Xiaohe population with populations of Xinjiang in the PCA graph (Fig. 4).

Xiaohe people displays higher and higher levels of haplotype diversity (fifth layer Hd = 0.7381, fourth layer Hd = 0.9004, layers1-3 Hd = 0.9890) from earlier to later, suggesting multiple population incursions into the Tarim Basin after its initial settlement. People carrying European maternal lineages may have spread east into south Siberia, where they mingled with local populations and eventually spread south into Xinjiang via the Ertix River. However, ancient DNA analyses indicate that the west Eurasian lineages observed in ancient south Siberia were associated with the eastward spread of Europeans of the Afanasievo culture [39]. This suggests that the European components could have reached north Xinjiang later, via the Kazakh steppe northwest of the Tarim Basin. Interestingly, the cattle excavated from the Xiaohe cemetery carried mainly lineage T3, typical of European cattle [57]. These diverse lines of evidence support the“steppe hypothesis”. In contrast, people bearing the south /west Asian components could have reached the Tarim Basin through the Pamirs, moving eastward along the south or north edges of the Tarim Basin. Recently one study showed that agricultural populations had contact with nearby mobile pastoralists at the beginning of the second millennium BC in Central Asia [58], indicating that genetic components of agriculturalists might also introgress into pastoralist populations. This was confirmed by the evidence that one Indian haplogroup was found in ancient Kazakhstan [37]. Therefore, people bearing the south/west Asian components could have first married into pastoralist populations, and reached North Xinjiang through the Kazakh steppe following the movement of pastoralist populations, then spread from north Xinjiang southward into the Tarim Basin across the Tianshan Mountains, and intermarried with the earlier inhabitants of the region, giving rise to the later, admixed Xiaohe community. Given that the south/west Asian components are relatively minor in the Xiaohe population, it is likely that nomadic herders from northern steppe had a greater impact on the eastern Tarim Basin than the Central Asian oasis farmers.

The archaeological evidence for woolen textiles and the medicinal plant Ephedra in the earliest Xiaohe layer and the Gumugou site indicate that the oasis culture had reached the Tarim Basin in the early Bronze Age. It is well known that Ephedra was used by oasis farmers, whereas it does not grow in the Russo-Kazakh steppe, nor is associated with the Afanasievo or Andronovo cultures [5, 7]. Furthermore, the wheat excavated from Xiaohe was hexaploid bread wheat, a cereal grain cultivated originally in the Near East [59]. Therefore, it is possible that the oasis route may have been significant in the peopling of Xinjiang in the early Bronze Age, at least northern or western Xinjiang. This was supported by the evidence that Indian haplogroup M25 was observed in one ancient individual from later Neolithic Ganqing region (data unpublished). The groups reaching the Tarim Basin through the oasis route may have interacted culturally with earlier populations from the steppe, with limited gene flow, resulting in a small genetic signal of the oasis agriculturalists in the Xiaohe community.


Our data indicate multiple population influences in the Tarim Basin during 4000–3500 yBP, consistent mainly with the “steppe hypothesis”, but with elements of the “oasis hypothesis”. Meanwhile, we can’t exclude the possibility that East Asians had an indirect impact on the Tarim Basin at Bronze Age.



Polymerase chain reaction


Mitochondrial DNA


Single nucleotide polymorphism


Cambridge reference sequence




Multidimensional scaling


Principal component analysis


  1. Yao YG, Kong QP, Wang CY, Zhu CL, Zhang YP. Different matrilineal contributions to genetic structure of ethnic groups in the Silk Road region in China. Mol Biol Evol. 2004;21:2265–80. doi:10.1093/molbev/msh238.

    Article  CAS  PubMed  Google Scholar 

  2. Comas D, Calafell F, Mateu E, Perez-Lezaun A, Bosch E, Martinez-Arias R, et al. Trading genes along the Silk Road: mtDNA sequences and the origin of Central Asian populations. Am J Hum Genet. 1998;63:1824–38. doi:10.1086/302133.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  3. Cui Y, Li C, Gao S, Xie C, Zhou H. Early Eurasian migration traces in the Tarim Basin revealed by mtDNA polymorphisms. Am J Phys Anthropol. 2010;142:558–64. doi:10.1002/ajpa.21257.

    Article  PubMed  Google Scholar 

  4. Mair VH. Genes, Geography, and Glottochronology: The Tarim Basin during Late Prehistory and History. Washington, D.C: Institute for the Study of Man; 2005.

    Google Scholar 

  5. Hemphill BE, Mallory JP. Horse-mounted invaders from the Russo-Kazakh steppe or agricultural colonists from western Central Asia? A craniometric investigation of the Bronze Age settlement of Xinjiang. Am J Phys Anthropol. 2004;124:199–222. doi:10.1002/ajpa.10354.

    Article  PubMed  Google Scholar 

  6. Romgard J. Questions of Ancient Human Settlements in Xinjiang and the Early Silk Road Trade. In: Mair VH, editor. Sino-Platonic Papers. Philadelphia, PA: University of Pennsylvania; 2008.

    Google Scholar 

  7. Barber EJW. Bronze Age Cloth and Clothing of the Tarim Basin: The Kroran(Loulan) and Qumul(Hami) Evidence, The Bronze Age and Early Iron Age Peoples of Eastern Central Asia. Washington, D.C: Institute for the Study of Man in collaboration with University of Pennsylvania Museum Publications; 1998. p. 647–55.

    Google Scholar 

  8. Han KX. The Physical Anthropology of the Ancient Populations of the Tarim Basin and Surrounding Areas, The Bronze Age and Early Iron Age Peoples of Eastern Central Asia. Washington D.C: Institute for the Study of Man in collaboration with University of Pennsylvania Museum Publications; 1998. p. 558–70.

    Google Scholar 

  9. Mallory JP, Mair VH. The Tarim Mummies: Ancient China and the Mystery of the Earliest Peoples from the West. London: Thames and Hudson; 2000.

    Google Scholar 

  10. Cui YQ, Gao SZ, Xie CZ, Zhang QC, Wang HJ, Zhu H, et al. Analysis of the matrilineal genetic structure of population in the early Iron Age from Tarim Basin, Xinjiang, China. Chinese Sci Bull. 2009;54:3916–23. doi:10.1007/s11434-009-0647-8.

    Article  CAS  Google Scholar 

  11. Han KX. Physical Anthropological Studies on the Racial Affinities of the Inhabitants of Ancient Xinjiang, The Ancient Corpses of Xinjiang: the Peoples of Ancient Xinjiang and their Culture. Urumchi: Xinjiang People’s Publishing House Wang BH; 2001. p. 224–41.

    Google Scholar 

  12. Kuzmina EE. Cultural Connections of the Tarim Basin People and Pastoralists of the Asian Steppes in the Bronze Age, The Bronze Age and Early Iron Age Peoples of Eastern Central Asia. Washington D.C: The Institute for the Study of Man in collaboration with University of Pennsylvania Museum Publications; 1998. p. 63–93.

    Google Scholar 

  13. Svyatko SV, Mallory J, Murphy E, Polyakov AV, Reimer P, Schulting R. New radiocarbon dates and a review of the chronology of prehistoric populations from the Minusinsk Basin, Southern Siberia, Russia. Radiocarbon. 2009;51:243–74.

    CAS  Google Scholar 

  14. Anthony DW. The Horse, the Wheel, and Language: How Bronze-Age Riders from the Eurasian Steppes Shaped the Modern World. Princeton: Princeton University Press; 2007.

    Google Scholar 

  15. Thornton CP, Schurr TG. Gene, language, and culture: an example from the Tarim Basin. Oxford J Archeol. 2004;23:83–106. doi:10.1111/j.1468-0092.2004.00203.x.

    Article  Google Scholar 

  16. Chen KT, Hiebert FT. The late prehistory of Xinjiang in relation to its neighbors. J World Prehistory. 1995;9:243–300. doi:10.1007/BF02221840.

    Article  Google Scholar 

  17. Hiebert FT. Origins of the Bronze Age Oasis Civilization in Central Asia. MA: Peabody Museum of Archaeology and Ethnology, Harvard University; 1994.

    Google Scholar 

  18. Cui YQ, Xu Y, Yang YD, Xie CZ, Zhu H, Zhou H. Mitochondrial DNA polymorphism analysis of district of Lubunour at the Bronze Age in Xinjiang. Journal of Jilin University (in Chinese). 2004;30:650–2. doi:10.3969/j.issn.1671-587X.2004.04.055.

    CAS  Google Scholar 

  19. Mair VH. The Rediscovery and Complete Excavation of Ördek’s Necropolis. Washington, D.C: University of Pennsylvania, ETATS-UNIS; 2006.

    Google Scholar 

  20. Abuduresule I, Li WY, Hu XJ. A brief excavation report on Xiaohe graveyard located in Luobupo, Xinjiang Autonomous Region. Cultural Relics. 2007;10:4–42.

    Google Scholar 

  21. Li WY, Abuduresule I, Liu YS. Big discovery of Xiaohe cemetery. Natl Geogr. 2007;8:152–63.

    Google Scholar 

  22. Flad R, Li SC, Wu XH, Zhao ZJ. Early wheat in China: results from new studies at Donghuishan in the Hexi Corridor. The Holocene. 2010;20:955–65. doi:10.1177/0959683609358914.

    Article  Google Scholar 

  23. Li C, Li H, Cui Y, Xie C, Cai D, Li W, et al. Evidence that a West–east admixed population lived in the Tarim Basin as early as the early Bronze Age. BMC Biol. 2010;8:15. doi:10.1186/1741-7007-8-15.

    Article  PubMed Central  PubMed  Google Scholar 

  24. Gilbert MT, Bandelt HJ, Hofreiter M, Barnes I. Assessing ancient DNA studies. Trends Ecol Evol. 2005;20:541–4. doi:10.1016/j.tree.2005.07.005.

    Article  PubMed  Google Scholar 

  25. Malyarchuk B, Grzybowski T, Derenko M, Perkova M, Vanecek T, Lazur J, et al. Mitochondrial DNA phylogeny in Eastern and Western Slavs. Mol Biol Evol. 2008;25:1651–8. doi:10.1093/molbev/msn114.

    Article  CAS  PubMed  Google Scholar 

  26. Van Oven M, Kayser M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum Mutat. 2009;30:386–94. doi:10.1002/humu.20921.

    Article  Google Scholar 

  27. Behar DM, Van Oven M, Rosset S, Metspalu M, Loogvali EL, Silva NM, et al. A “Copernican” reassessment of the human mitochondrial DNA tree from its root. Am J Hum Genet. 2012;90:675–84. doi:10.1016/j.ajhg.2012.03.002.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  28. Stone AC, Milner GR, Paabo S, Stoneking M. Sex determination of ancient human skeletons using DNA. Am J Phys Anthropol. 1996;l 99:231–8. doi:10.1002/(SICI)1096-8644(199602)99:2<231::AID-AJPA1>3.0.CO;2–1.

    Article  Google Scholar 

  29. Haas-Rochholz H, Weiler G. Additional primer sets for an amelogenin gene PCR-based DNA-sex test. Int J Legal Med. 1997;110:312–5. doi:10.1007/s004140050094.

    Article  CAS  PubMed  Google Scholar 

  30. Kivisild T, Tolk HV, Parik J, Wang Y, Papiha SS, Bandelt HJ, et al. The emerging limbs and twigs of the East Asian mtDNA tree. Mol Biol Evol. 2002;19:1737–51. doi:10.1093/oxfordjournals.molbev.a004232.

    Article  CAS  PubMed  Google Scholar 

  31. Richards M, Macaulay V, Hickey E, Vega E, Sykes B, Guida V, et al. Tracing European founder lineages in the Near Eastern mtDNA pool. Am J Hum Genet. 2000;67:1251–76. doi:10.1016/S0002-9297(07)62954-1.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  32. Derenko M, Malyarchuk B, Grzybowski T, Denisova G, Dambueva I, Perkova M, et al. Phylogeographic analysis of mitochondrial DNA in northern Asian populations. Am J Hum Genet. 2007;81:1025–41. doi:10.1086/522933.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  33. Torroni A, Richards M, Macaulay V, Forster P, Villems R, Norby S, et al. mtDNA haplogroups and frequency patterns in Europe. Am J Hum Genet. 2000;66:1173–7. doi:10.1086/302789.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  34. Dubut V, Chollet L, Murail P, Cartault F, Beraud-Colomb E, Serre M, et al. mtDNA polymorphisms in five French groups: importance of regional sampling. Eur J Hum Genet. 2004;12:293–300. doi:10.1038/sj.ejhg.5201145.

    Article  CAS  PubMed  Google Scholar 

  35. Sarkissian CD, Balanovsky O, Brandt G, Khartanovich V, Buzhilova A, Koshel S, et al. Ancient DNA reveals prehistoric gene-flow from Siberia in the complex human population history of North East Europe. PLoS Genet. 2013;9, e1003296. doi:10.1371/journal.pgen.1003296.

    Article  Google Scholar 

  36. Haak W, Balanovsky O, Sanchez JJ, Koshel S, Zaporozhchenko V, Adler CJ, et al. Ancient DNA from European early neolithic farmers reveals their near eastern affinities. PLoS Biol. 2010;8(11), e1000536. doi:10.1371/journal.pbio.1000536.

    Article  PubMed Central  PubMed  Google Scholar 

  37. Lalueza-Fox C, Sampietro ML, Gilbert MT, Castri L, Facchini F, Pettener D, et al. Unravelling migrations in the steppe: mitochondrial DNA sequences from ancient Central Asians. Proc Biol Sci. 2004;271:941–7. doi:10.1098/rspb.2004.2698.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  38. Hollard C, Keyser C, Giscard PH, Tsagaan T, Bayarkhuu N, Bemmann J, et al. Strong geneticadmixture in the Altai at the Middle Bronze Age revealed by uniparental and ancestryinformative markers. Forensic Sci Int Genet. 2014;12:199–207. doi:10.1016/j.fsigen.

    Article  CAS  PubMed  Google Scholar 

  39. Keyser C, Bouakaze C, Crubezy E, Nikolaev VG, Montagnon D, Reis T, et al. Ancient DNA provides new insights into the history of south Siberian Kurgan people. Hum Genet. 2009;126:395–410. doi:10.1007/s00439-009-0683-0.

    Article  CAS  PubMed  Google Scholar 

  40. Wilde S, Timpson A, Kirsanow K, Kaiser E, Kayser M, Unterländer M, et al. Direct evidence for positive selection of skin, hair, and eye pigmentation in Europeans during the last 5,000 y. Proc Natl Acad Sci. 2014;111:4832–7. doi:10.1073/pnas.1316513111.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  41. Brandt G, Haak W, Adler CJ, Roth C, Szécsényi-Nagy A, Karimnia S, et al. Ancient DNA reveals key stages in the formation of central European mitochondrial genetic diversity. Science. 2013;342(6155):257–61. doi:10.1126/science.1241844.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  42. Bramanti B, Thomas MG, Haak W, Unterlaender M, Jores P, Tambets K, et al. Genetic discontinuity between local hunter-gatherers and central Europe’s first farmers. Science. 2009;326:137–40. doi:10.1126/science.1176869.

    Article  CAS  PubMed  Google Scholar 

  43. Mooder KP, Schurr TG, Bamforth FJ, Bazaliiski VI, Savel’ev NA. Population affinities of Neolithic Siberians: a snapshot from prehistoric Lake Baikal. Am J Phys Anthropol. 2006;129:349–61. doi:10.1002/ajpa.20247.

    Article  CAS  PubMed  Google Scholar 

  44. Gao SZ, Zhang Y, Wei D, Li HJ, Zhao YB, Cui YQ, et al. Ancient DNA reveals a migration of the ancient Di-qiang populations into Xinjiang as early as the early Bronze Age. Am J Phys Anthropol. 2015;157:71–80. doi:10.1002/ajpa.22690.

    Article  PubMed  Google Scholar 

  45. Metspalu M, Kivisild T, Metspalu E, Parik J, Hudjashov G, Kaldma K, et al. Most of the extant mtDNA boundaries in South and Southwest Asia were likely shaped during the initial settlement of Eurasia by anatomically modern humans. BMC Genet. 2004;5:26. doi:10.1186/1471-2156-5-26.

    Article  PubMed Central  PubMed  Google Scholar 

  46. Kivisild T, Rootsi S, Metspalu M, Mastana S, Kaldma K, Parik J, et al. The genetic heritage of the earliest settlers persists both in Indian tribal and caste populations. Am J Hum Genet. 2003;72:313–32. doi:10.1086/346068.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  47. Abu-Amero KK, Larruga JM, Cabrera VM, Gonzalez AM. Mitochondrial DNA structure in the Arabian Peninsula. BMC Evol Biol. 2008;8:45. doi:10.1186/1471-2148-8-45.

    Article  PubMed Central  PubMed  Google Scholar 

  48. Thangaraj K, Chaubey G, Singh VK, Vanniarajan A, Thanseem I, Reddy AG, et al. In situ origin of deep rooting lineages of mitochondrial Macrohaplogroup ‘M’ in India. BMC Genomics. 2006;7:151. doi:10.1186/1471-2164-7-151.

    Article  PubMed Central  PubMed  Google Scholar 

  49. Derenko M, Malyarchuk B, Grzybowski T, Denisova G, Rogalla U, Perkova M, et al. Origin and post-glacial dispersal of mitochondrial DNA haplogroups C and D in northern Asia. PLoS ONE. 2010;5, e15214. doi:10.1371/journal.pone.0015214.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  50. Starikovskaya EB, Sukernik RI, Derbeneva OA, Volodko NV, Ruiz-Pesini E, Torroni A, et al. Mitochondrial DNA diversity in indigenous populations of the southern extent of Siberia, and the origins of Native American haplogroups. Ann Hum Genet. 2005;69:67–89. doi:10.1046/j.1529-8817.2003.00127.x.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  51. Pimenoff VN, Comas D, Palo JU, Vershubsky G, Kozlov A, Sajantila A. Northwest Siberian Khanty and Mansi in the junction of West and East Eurasian gene pools as revealed by uniparental markers. Eur J Hum Genet. 2008;16:1254–64. doi:10.1038/ejhg.2008.101.

    Article  CAS  PubMed  Google Scholar 

  52. Ricaut FX, Fedoseeva A, Keyser-Tracqui C, Crubezy E, Ludes B. Ancient DNA analysis of human Neolithic remains found in northeastern Siberia. Am J Phys Anthropol. 2005;126:458–62. doi:10.1002/ajpa.20257.

    Article  PubMed  Google Scholar 

  53. Molodin VI, Pilipenko AS, Romaschenko AG, Zhuravlev AA, Trapezov RO, Chikisheva TA, et al. Human Migrations in the Southern Region of the West Siberian Plain during the Bronze Age: Archaeological, Palaeogenetic and Anthropological Data. In: Kaiser E, Burger J, Schier W, editors. Population Dynamics in Prehistory and Early History. 2012. p. 93–112.

    Google Scholar 

  54. Zhang F, Xu Z, Tan J, Sun Y, Xu B, Li S, et al. Prehistorical East–west admixture of maternal lineages in a 2,500-year-old population in Xinjiang. Am J Phys Anthropol. 2010;142:314–20. doi:10.1002/ajpa.21237.

    PubMed  Google Scholar 

  55. Gao SZ, Yang YD, Xu Y, Zhang QC, Zhu H, Zhou H. Tracing the genetic history of the Chinese people: mitochondrial DNA analysis of a Neolithic population from the Lajia site. Am J Phys Anthropol. 2007;133:1128–36. doi:10.1002/ajpa.20623.

    Article  PubMed  Google Scholar 

  56. Keyser-Tracqui C1, Crubézy E, Ludes B. Nuclear and mitochondrial DNA analysis of a 2,000-Year-Old Necropolis in the Egyin Gol Valley of Mongolia. Am J Hum Genet. 2003;73:247–60.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  57. Cai D, Sun Y, Tang Z, Hu S, Li W, Zhao X, et al. The origins of Chinese domestic cattle as revealed by Ancient DNA analysis. J Archaeol Sci. 2014;41:423–34. doi:10.1016/j.jas.2013.09.003.

    Article  CAS  Google Scholar 

  58. Spengler RN, Cerasetti B, Tengberg M, Cattani M, Rouse LM. Agriculturalists and pastoralists: bronze age economy of the Murghab alluvial fan, southern Central Asia. Veget Hist Archaeobot. 2014;23:805–20. doi:10.1007/s00334-014-0448-0.

    Article  Google Scholar 

  59. Li C, Lister DL, Li H, Xu Y, Cui Y, Bower MA, et al. Ancient DNA analysis of desiccated wheat grains excavated from a bronze age cemetery in Xinjiang. J Archaeol Sci. 2011;38:115–8. doi:10.1016/j.jas.2010.08.016.

    Article  CAS  Google Scholar 

Download references


This work was supported by the National Natural Science Foundation of China, grant numbers 31371266, 31200935 and J1210007. We thank Xinjiang Cultural Relics and the Archaeology Institute for providing the human remains. We certify that all financial and material support for this research and work are clearly identified in the manuscript. The data set supporting the results of this article is available in the Genbank repository, with accession numbers KF436896-KF436931 [].

Author information

Authors and Affiliations


Corresponding author

Correspondence to Hui Zhou.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

CXL and CN contributed equally to this work, they performed the molecular genetic studies and data analysis and wrote the manuscript. EK helped to draft the manuscript. LHJ participated in performing experiments. YBZ participated in the statistical analysis. WYL and IA provided materials and background documents. Zhu H participated in conceiving and designing the study. Zhou H designed the study and wrote the manuscript. All authors read and approved the final manuscript.

Additional files

Additional file 1: Table S1.

Archaeological information for 92 Xiaohe individuals.

Additional file 2: Table S2.

Primers used in this study.

Additional file 3: Table S3.

Ancient and present-day populations used in the principal component analysis.

Additional file 4: Table S4.

The mtDNA yield of three Xiaohe individuals.

Additional file 5: Figure S1.

Alignment of cloned mtDNA sequences from six samples. The primer sequences are shadowed.

Additional file 6: Table S5.

Results of mtDNA HVR-1 multiplex sequencing and the SNP typing.

Additional file 7: Figure S2.

Multidimensional scaling plot of genetic distances calculated for mtDNA sequences (16050–16391). Population abbreviations are consistent with Fig. 4.

Rights and permissions

Open Access  This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit

The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, C., Ning, C., Hagelberg, E. et al. Analysis of ancient human mitochondrial DNA from the Xiaohe cemetery: insights into prehistoric population movements in the Tarim Basin, China. BMC Genet 16, 78 (2015).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: