Supplementary Text
Contents
Part 1, Filters for variants and selection of calculation
points
Part 2, Discussion about history of modern
populations
To obtain a confident dataset for age estimation, we
applied a seriesof strict filters on the original variants file,
including: 1, restrict to confident region (~8.439M
bp) that are widely used by Poznik et
al.1, Karmin et al.2 and Adamov et
al.3; 2, restrict to variants that are single nucleotide
polymorphisms (Y-SNP); 3, remove all position with call rate
<90% on all samples; 4, remove position with
heterozygosis call rate >5% on all samples; 5, base
coverage ≥3,
base quality >20, and distance between SNPs
>10 bp; 6, remove all recurrent or triadic mutations even
though they are confident on the currently known phylogenetic
tree. Back imputation was also done on Y-SNP markers on the
known phylogenetic tree, so as to infer missing genotype on some
samples.
After applying these filters, we found that the
accumulated mutations of samples varied significantly among
different haplogroup (see the Maximum Parsimony tree in Figure S1).
This means that the mutation rates on each haplogroup are different
from each other after they split from the common ancestor. The
haplogroup D-M174, C-M130 and NO-M214 had been separated from each
other for more than 73k years. Therefore, we concluded that a
unified mutation rate is unsuitable for the age calculation of a
dataset that contain samples of all three haplogroups. We used the
age for haplogroup CT-M168 (71,760 years, 95% confidence interval
[CI] = 69,777–73,799)2 as the calibration point for
estimating the coalescent times of all studied samples. All of four
major paternal haplogroups in East Eurasian (D-M174, C-M130,
NO-M214, and Q-M242) are sub-branches of CT-M168. Using the age of
CT-M168 as the calibration point will resolve the issue caused by
different mutation rate on its sub-branches.
On the other hand, as indicated by the study of
Icelandic males, the genealogical mutation rate of Y-chromosome
(0.871x10-10 mutations per position per year)4 is slightly higher than the
evolutionary mutation rate (0.74 x10-10 mutations per position per
year) calculated by ancient DNA of Ust’Ishim
male.2 This means that more mutations
will be accumulated in the recent historical age. Thus, we also use
the splitting time (470±20 years ago) of samples from Aisin Gioro
family as a recent calibration point.5
In this section, we discussed
about the founding history of Manghit, Keneges tribes in moder
Nogay, Uzbeks and Karakalpak populations, Dulat, Uysun, and Kerey
in modern Kazakhs, and Hazara population is Afghanistan. Also, we
discussed their connection with ancient Mongol Niru’un clan or
ordinary ancient Mongols.
According
to Rashid Al-din,6 all the Mongol people were divided
into two sub-tribes. One was the Darlakin Mongol tribe and the
other was the Niru’un Mongol tribe. The Darlakin Mongol referred to
the commonality of the Mongol people, while the Niru’un Mongol
(means “The pure Mongol”) was believed to be the descendants of
Alan Quo’a. Genghis Khan belonged to Kiyan clan, one clan of
Niru’un tribe. Combining the data from published researches, C3*-ST
were found in several modern populations who can traced back to
Mongol Niru’un clan.
Manghit
Manghit
was also called Manghuds and Mangqut. According to history records,
Manghit was derived from the Niru’un Mongols.6,7,8 They moved westward together with Jochi’s armies and
lived in the Golden Horde ( the Mongol empire in Central Asia and
Northeast of Europe, 1219 A.D. - 1502 A.D.). They extensively involved in the
political activities of the Khanates.6,8 At the beginning, they followed
Nogai Khan (d.1299) to establish their own semi-independent horde
from the Golden Horde in Sarai. Then, they supported their own
commander Edigu (1352-1419) to found Nogai Horde or Manghit Horde
in the 14th-15th century.8 Their descendants called
themselves Nogay and lived in the plains of northern Caucasus and
the Crimea since 17th century. Another group of
them moved
southward as a part of Uzbeks and finally
established the Manghit Dynasty to rule
the Emirate of
Bukhara in Transoxania in
1785.9 Descendants of them became parts
of modern Uzbeks
and Karakalpakpopulation.10
Most of the C3 samples in
Nogay belong to C3*-ST.11,12 In addition, the only one sample of Manghit tribe in
Uzbekstan reported by
Sabitov et.al13 also belongs to the above lineage.
Among Qarasyraq clan of Manghit tribe reported by Chaix et.al
2007,14 11 out of 14 haplotypes process
the same Y-STR profile as C3*-ST. These data indicated that they
shared a common ancestor. Nogay population was list as Nogaits,
population No. 26 in Supplementary Table S1. Manghit tribe was
included in Karakalpaks, population No. 28 in Supplementary Table
S1.
Keneges
According
to history records, Keneges was also
derived from the Niru’un
Mongol. As one of four thousand “original” Mongol troops that
Genghis Khan bequeathed as auxiliary of
Jochi, Keneges (Leaded by Кутаном
Кенегесом) came in Jochi Ulus (The Golden Horde) in the early 13th
century.6,7,8 After
the collapse of The Golden Horde,
Manghit and Keneges moved southward as a part of Uzbeks and then
became a part of modern Uzbeks and Karakalpak
population.10,15
All the three C3
samples in Keneges reported by
Sabitov et.al13 belong to C3*-ST. Another sample was predicted as G2a-P15
according to its special Y-STR profile. G2a is now predominant in
Caucasian populations16and hence can be considered as indigenous mixture
after Keneges’ settlement in the Golden Horde. Keneges tribe was
included in Karakalpaks (No. 28) in Supplementary Table
S1.
Dulat
Dulat, aka
Duqlat or Dughlat, were also derived from the Niru’un
Mongols.6,7,15 Dulat was one of four thousand
“original” Mongol troops that Genghis Khan bequeathed as auxiliary
of Chagatai and eventually settled in
the area comprising the Ulus
of Chagatai Khan.6 During the last age of
Moghulistan, which also called the Eastern Chagatai Khanate, many
Mongol-origin tribes joined
the new-born Kazakh Khanate, like Husin, Jalair and Duqlat. Later,
they became Uysun,
Zhalair and Dulat tribe as a part of the Great Jüz (aka
Senior Juz) in modern Kazakh populations.15,17
Within four C3 samples in Dulat tribe, three of them
belong to C3*-ST.18 High frequency of C3*-ST were also
found in other three sampling places within the area of the Great
Jüz in southeast part of Kazakhstan (Population
No.32~33, Supplementary Table S1). This lineage was also dominant
in our Kazakh samples from Alban tribe at Zhaosu County, one tribe
of the Great
Jüz (Population No. 223,
Supplementary Table S1).
Uysun
and Kerey in Kazakhs
Highest
frequencies of C3*-ST were observed in both Southeast Kazakhstan
(the Great Jüz) and Kazakh-Kerey (see all Kazakh
in Supplementary Table S1). Kazakh Khanate was established by a
group of population that separated from Uzbek Horde (late period of
the Golden Horde). We found that C3*-ST is also dominant part of C3
in Uzbeks (Population No.146, Supplementary Table S1). After the
joining of several tribes from Eastern Chagatai Khanate, they were
called the Great Jüz (also the Uysun
Jüz).15,17 Regarding the origin of Kerey
tribe in Kazakh, several genealogical legends about conflict with
each other and give no well-resolved conclusion.19,20One
legend said that that Kerey-Ashmaily is
the descendant of forefather Kerey and Kerey-Abakh is the
descendant of the Great Jüz. The other legend said Kerey-Ashmaily originated
from one of the three sons of the
mythical forefather of Kazakhs. And Kerey-Abakh
was derived from Kerey-Ashmaily. In this study, the most frequent
haplotype in Kerey-Abkh (C3*-ST-Kazakh clade), has only one step
mutation from the central haplotype of C3*-ST. So it is possible
that C3*-ST-Kazakh clade was derived
from the C3*-
ST in either Kerey-Ashmaily or the Great
Jüz. More evidences are needed to clarify these
two hypotheses about the origin of Kerey-Abakh.
Hazara
According
to history records, the Hazara are the descendants of Genghis
Khan’s soldiers, not male-line descendants of Genghis Khan himself.
In Zerjal et al21, Hazara people were considered as direct
descendants of Genghis Khan and hence became a strong evidence for
their conclusion. However, the original material cited by
Zerjal et al21 described that the Hazara were
derived from ten military
detachments sent by Genghis
Khan.22According to the available history records, those
military detachments, 20,000 soldiers totally, were only ordinary
people of Mongol tribe. There is no evidence that they were direct
descendants of Genghis Khan, whose descendants had
been well-documented at that
era.6,7 Four sets of samples of Hazara
population were list in Supplementary Table S1.
Heritage of Niru’un Mongol
According
to history records, Manghit,
Keneges and Dulat can be traced back clearly
to Niru’un Mongol. At the time of Genghis Khan, the population of
Niru’un Mongol (the Pure Mongols) had multiplied
and they divided into more than twenty
tribes (Figure S1). Before and after the foundation of the Mongol
Empire, people of Niru’un Mongol were the core components of the
Khanate’s upper classes and provided most of soldiers and generals
of army. Participating in the conquest of Genghis Khan, they
scattered and settled in all regions comprising the empire from
East Asia to Northeast Europe. As discussed above, Manghit,
Keneges, Dulat, and Hazara are all descendants of the armies sent
to different regions of the Mongol Empire by Genghis
Khan.
References
1. Poznik
GD, Xue Y, Mendez FL et al: Punctuated
bursts in human male demography inferred from 1,244 worldwide
Y-chromosome sequences. Nature
genetics 2016; 48: 593-599.
2. Karmin
M, Saag L, Vicente M et al: A recent
bottleneck of Y chromosome diversity coincides with a global change
in culture. Genome
research 2015; 25: 459-466.
3. Adamov
D, Guryanov V, Karzhavin S, Tagankin V, Urasin V: Defining a New
Rate Constant for Y-Chromosome SNPs based on Full Sequencing
Data. The Russian Journal of Genetic
Genealogy 2015; 7: 68-89.
4. Helgason
A, Einarsson AW, Guethmundsdottir VB et al:
The Y-chromosome point mutation rate in
humans. Nature
genetics 2015; 47: 453-457.
5. Wei
LH, Yan S, Yu G et al: Genetic trail for
the early migrations of Aisin Gioro, the imperial house of the Qing
dynasty. Journal of human
genetics 2016.
6. Al-din.
R, Yu. D, Zhou J: Jámi'u-t
Tawáríkh(Shiji). Beijing:
Commercial Press, 1997.
7. Onon
U: The Secret History of the Mongols: The Life
and Times of Chinggis Khan, Ill edn. London and New York:
Routledge, 2001.
8. Греков.
БД, Якубовский. АЮ, DajunYu: The rise and fall
of the Golden Horde. Beijing: Commercial Press,
1985.
9. Soucek
S: A History of Inner Asia. Cambridge:
Cambridge University Press, 2000.
10. Adle.
C, Habib. I, Baĭpakov
KM: Development in contrast: from the sixteenth
to the mid-nineteenth century. Paris: UNESCO,
2003.
11. Zakharov
IA: A search for a "Genghis Khan
chromosome". Genetika 2010; 46: 1276-1277.
12. Marchani
EE, Watkins WS, Bulayeva K, Harpending HC, Jorde LB: Culture
creates genetic structure in the Caucasus: autosomal,
mitochondrial, and Y-chromosomal variation in
Daghestan. BMC
Genet 2008; 9: 47.
13. Жаксылык
Сабитов БД: Гаплогруппы и гаплоттипы 50-ти
каракалпаков. The Russian Journal of Genetic
Genealogy 2012; 4: 83-90.
14. Chaix
R, Quintana-Murci L, Hegay T et al: From
social to genetic structures in central
Asia. Curr
Biol 2007; 17: 43-48.
15. Bennigsen.
A, Wimbush SE: Muslims of the Soviet Empire: A
Guide. London: C. Hurst & Co. Publishers,
1985.
16. Nasidze
I, Ling EY, Quinque D et al: Mitochondrial
DNA and Y-chromosome variation in the
caucasus. Annals of human
genetics 2004; 68: 205-221.
17. Olcott
MB: The Kazakhs. Stanford: Hoover Press,
1995.
18. Yerlan
Turuspekov ZS, B. Daulet, M. Sadykov, O. Khalidullin: The
Kazakhstan DNA project hits first hundred Y-profiles for ethnic
Kazakhs. The Russian Journal of Genetic
Genealogy 2011; 2: 69-84.
19. Janabul
H: Abakh Kerey. Ili: People's press of Ili
1993.
20. Yancheng
Zhou LG: Female tribal leader in history of
Kazakhs. Research of Northwest
Minorities 1998; 1: 100-105.
21. Zerjal
T, Xue Y, Bertorelle G et al: The genetic
legacy of the Mongols. American journal of
human
genetics 2003; 72: 717-721.
22. Bellew
H: The races of Afghanistan. New Delhi:
Asian Educational Services, 1880.
Supplementary Table S5, The summary of depth and
coverage of sequenced samples on the confident region (8.439M bp)
of Y-chromosome.
ID
|
Mean
depth
|
Region with reads (M bp)
|
coverage
|
FD-DX15
|
8.89477
|
7836236
|
0.928581044
|
HLB-024
|
4.40302
|
7473128
|
0.885553345
|
HLB-035
|
3.63212
|
7302213
|
0.865300199
|
HLB-116
|
8.85246
|
7475167
|
0.885794963
|
HLB-048
|
24.1957
|
7656492
|
0.907281676
|
HLB-049
|
3.87101
|
7335101
|
0.869197373
|
HLB-172
|
5.23753
|
7595582
|
0.900063942
|
HLB-160
|
15.243
|
7685332
|
0.910699169
|
HLB-095
|
3.99257
|
7417313
|
0.878939359
|
HLB-179
|
10.5409
|
7256946
|
0.859936134
|
HLB-192
|
16.3225
|
7720845
|
0.914907401
|
FD-Kaz34
|
13.979
|
8092268
|
0.958920414
|
FD-Kaz69
|
9.41822
|
7935665
|
0.940363216
|
FD-Kaz03
|
11.5085
|
7920125
|
0.938521752
|
FD-KIR29
|
9.75178
|
7799220
|
0.924194709
|
FD-KIR80
|
17.8018
|
7945923
|
0.941578772
|
YCH509
|
10.6394
|
7890542
|
0.935016215
|
YCH1981
|
15.4821
|
7628674
|
0.903985289
|
YCH508
|
24.5437
|
7876689
|
0.933374658
|
HLB-065
|
7.54805
|
7706917
|
0.913256956
|
FD-DX15
|
8.89477
|
7836236
|
0.928581044
|