The Y-STR analyses of 17 males give evidence on a surprisingly homogeneous Y chromosomal
138 composition (Table S1). Y chromosomal STR profiles of 14 males could be assigned to
139 haplogroup N-Tat (also N1a1-M46, see Methods and Table S1). N-Tat haplotype I was found
140 in four males from Kunpeszér with identical alleles on at least nine loci. The full Y-STR
141 haplotype I, reconstructed from AC17 with 17 detected STRs, is rare in our days. Only nine
142 matches were found among 205,059 haplotypes in YHRD database, such as samples from the
143 Ural Region, Northern Europe (Estonia, Finland), and Western Alaska (Yupiks). We performed
144 Median Joining (MJ) network analysis using 162 N-Tat haplotypes with ten shared STR loci
145 (Fig. 3, Table S9). All modern N-Tat samples included in the network had derived allele of
146 L708 as well. Haplotype I (Cluster 1 in Fig. 3) is shared by eight populations on the MJ network
147 among the 24 identical haplotypes. Cluster 1 represents the founding lineage, as it is described
in Siberian populations17 148 , because this haplotype is shared by the most populations and it is
149 more diverse than Cluster 2.
150 Nine males share N-Tat haplotype II (on a minimum of eight detected alleles), all of them buried
151 in the Danube-Tisza Interfluve (Table S1). We found 30 direct matches of this N-Tat haplotype
152 II in the YHRD database, using the complete 17 STR Y-filer profile of AC1, AC12, AC14,
153 AC15, AC19 samples. Most hits came from Mongolia (seven Buryats and one Khalkh) and
154 from Russia (six Yakuts), but identical haplotypes also occur in China (five in Xinjiang and
155 four in Inner Mongolia provinces). On the MJ network, this haplotype II is represented by
156 Cluster 2 and is composed of 45 samples (including 32 Buryats) from six populations (Fig. 3).
157 A third N-Tat lineage (type III) was represented only once in the Avar dataset (AC8), and has
158 no direct modern parallels from the YHRD database. This haplotype on the MJ network (see
159 red arrow in Fig. 3) seems to be a descendent from other haplotype cluster that is shared by
160 three populations (two Buryat from Mongolia, three Khanty and one Northern Mansi samples).
161 This haplotype cluster also differs one molecular step (locus DYS393) from haplotype II.
162 We classified the Avar samples to downstream subgroup N-F4205 within the N-Tat
haplogroup, based on the results of ours and Ilumäe et al.18 163 and constructed a second network
164 (Fig. S4). The N-F4205 network results support the assumption that the N-Tat Avar samples
165 belong to N-F4205 subgroup (see SI chapter 1d for more details).
166 Based on our calculation, the age of accumulated STR variance (TMRCA) within N-Tat lineage
167 for all samples is 7.0 kya (95% CI: 4.9 - 9.2 kya), considering the core haplotype (Cluster 1) to
168 be the founding lineage. (See detailed results on the N-Tat and N-F4205 haplotypes in the SI
169 chapter 1f.) Y haplogroup N-Tat was not detected by large scale Eurasian ancient DNA
studies9,19 but it occurs in late Bronze Age Inner Mongolia20 and late medieval Yakuts21 170 , among
them N-Tat has still the highest frequency22 171 .
172
173 Two males (AC4 and AC7) from the Transtisza group belong to two different haplotypes of Y174 haplogroup Q1. Both Q1a-F1096 and Q1b-M346 haplotypes have neither direct nor one step
175 neighbour matches in the worldwide YHRD database. A network of the Q1b-M346 haplotype
176 shows that this male had a probable Altaian or South Siberian paternal genetic origin (Fig. S5).
Bookmarks