Log in

View Full Version : G25 Bronze Age model for West Asians



Halgurd
08-11-2020, 01:06 AM
I am sure there is room for improvement here, but it seems to be working well for West Asian ethnic groups. Use scaled coordinates.

Main components:

Steppe: Peaks in Iranic and Turkic speaking groups.
Zagros: Peaks in Iranic speaking groups.
Siberian: Peaks in Turkic speaking groups.
Levant: Peaks in Arabic speaking groups.
Anatolian: Peaks in Armenians and Anatolian Turks.
South Central Asia: Peaks in Iranic speaking groups.

The samples I’ve used are visible in the code, let me know if any should be changed/added.


Steppe:RUS_Sintashta_MLBA:I0939,0.132035,0.112724, 0.058454,0.081073,0.016926,0.03765,0.00611,-0.000231,-0.022498,-0.040274,-0.006333,0.009292,-0.01115,-0.017891,0.02728,0.01127,-0.005346,-0.00228,0.003645,0.008254,-0.00861,0.002597,0.001602,0.015785,-0.00479
Zagros:IRN_Hajji_Firuz_C,0.0896355,0.1327805,-0.0780642,-0.0667802,-0.0437002,-0.0196618,0.0098702,-0.0065768,-0.035178,-0.0030977,0.0051965,-0.0013113,-0.0006317,-0.0019613,-0.0046145,0.0036128,0.00088,0.0022802,0.0032995,-0.0063468,0.0045232,-0.0033388,-0.00835,-0.0118992,0.0006588
NW_Africa:MAR_Taforalt:TAF009,-0.162767,0.07718,-0.024513,-0.079135,0.009848,-0.047969,-0.052877,0.020538,0.13294,0.002005,0.025333,-0.026676,0.057978,-0.052985,0.072882,-0.034076,0.002347,-0.05625,-0.130475,0.04152,-0.026204,-0.120314,0.067909,-0.016508,0.012334
Siberian:RUS_Ust_Ishim:Ust_Ishim,-0.050082,-0.11577,-0.090886,0.073644,0.027082,-0.018128,-0.00376,-0.004384,0.0452,0.010387,0.006008,-0.001798,0.000149,-0.003991,0.004614,-0.001724,-0.004955,0.004687,-0.005154,0.015382,0.006613,0.008532,-0.007641,-0.014942,0.007784
East_Africa:ETH_4500BP:mota,-0.511066,0.043668,0.000754,0.000969,-0.00277,-0.011435,0.050997,-0.045229,0.089172,-0.087838,-0.012991,-0.002997,-0.031219,0.000688,0.02158,-0.029965,0.027772,0.039273,0.00176,-0.009004,0.000374,0.006183,-0.003451,-0.00241,-0.000838
Levant:TUR_Alalakh_MLBA,0.0970122,0.1472908,-0.0604988,-0.090117,-0.0156477,-0.0341748,0.0015728,-0.0050412,-0.00306,0.0125111,0.006408,-0.0047555,0.0086109,0.004912,-0.0113588,0.0070272,-0.0005767,0.0018711,0.0047668,-0.002535,0.0021548,0.0038048,-0.0040007,-0.0040968,0.0005988
Levant:Levant_JOR_EBA:I1705,0.081953,0.146236,-0.056568,-0.113697,-0.008001,-0.046296,-0.010575,-0.007384,0.030883,0.004556,0.016401,-0.020532,0.040287,-0.001651,-0.003393,0.030496,0.009127,0.007095,-0.004022,0.017884,0.003369,0.009769,0.000616,0.006 507,-0.013891
South_Central_Asia:IRN_Shahr_I_Sokhta_BA2:I11456,0 .039838,-0.042652,-0.180641,0.100453,-0.107405,0.049364,0.00423,-0.001385,0.00225,0.001458,-0.004709,0.001948,0.003419,-0.002615,0.007057,0.016043,-0.002999,-0.001014,0.006662,-0.02051,-0.00025,-0.014591,-0.007888,-0.018316,0.00455
Anatolian:TUR_Isparta_EBA:I2495,0.113823,0.1635,-0.035072,-0.081073,0.002462,-0.024263,0.00188,-0.007615,-0.009817,0.032802,0.003248,0.004796,-0.019475,0.003853,-0.024973,0.006099,0.022296,-0.002534,0.008422,0.006378,-0.007612,0.000618,0.002095,-0.009037,0.000958

Target: Halgurd_scaled
Distance: 3.4225% / 0.03422476
68.2 Zagros
16.8 Steppe
7.6 South_Central_Asia
7.4 Levant

Target: Kurdish
Distance: 1.7400% / 0.01740034
73.0 Zagros
15.4 Steppe
7.2 South_Central_Asia
4.4 Levant

Target: Azeri
Distance: 2.8263% / 0.02826333
70.8 Zagros
16.6 Steppe
8.2 Siberian
2.4 South_Central_Asia
2.0 Levant

Aren
08-11-2020, 01:26 AM
That’s a bad model cause Hajji Firuz Chl has too little Iran_N. Try using Seh Gabi Chl instead.

Zoro
08-11-2020, 01:48 AM
I am sure there is room for improvement here, but it seems to be working well for West Asian ethnic groups. Use scaled coordinates.

Main components:

Steppe: Peaks in Iranic and Turkic speaking groups.
Zagros: Peaks in Iranic speaking groups.
Siberian: Peaks in Turkic speaking groups.
Levant: Peaks in Arabic speaking groups.
Anatolian: Peaks in Armenians and Anatolian Turks.
South Central Asia: Peaks in Iranic speaking groups.



Bira although i don't have much faith in G25 you should try to keep your samples close to the same time period. For example you have 40,000 year old Ust Ishim which is basal to some of the Eurasian samples you're using and then you have some samples only 3000 years old.

Also some of your samples are too admixed such as Iran-Chl and Sintashta

I would suggest using the following instead:

Anatolia-N
Iran-N
Israel-Chl
Shamanka-EN (Siberian)
Devils Gate (E Asian)
EHG (steppe)
WHG
Taforalt


Also don't use Davidski averages because some of the samples he has thrown in are very damaged low coverage samples. Here is a list of the best quality samples for each of those groups. You can average them yourself.

I picked out the best samples for you and didn't include the low quality ones that have more than 400,000 SNPs missing which btw Davidski has used to calculate averages. You can average these yourself manually.


<colgroup width="152"></colgroup> <colgroup width="85" span="3"></colgroup> <tbody>
SAMPLE
ID
NO MISSING SNPs
TOTAL SNPS


Anatolia_N
Bar8.SG
6031
727443


Anatolia_N
Bar31.SG
59882
728459


Anatolia_N
I0707
114781
727443


Anatolia_N
I0746
122086
728459


Anatolia_N
I0745
127413
728459


Anatolia_N
I1583_published
132760
728459


Anatolia_N
I0709
133888
728459


Anatolia_N
I0708
137209
728459


Anatolia_N
I1580_published
166381
727443


Anatolia_N
I0744
195852
728459


Anatolia_N
I1581_published
214034
727443


Anatolia_N
I1585_published
215166
727443


Anatolia_N
I1579_published
218031
727443


Anatolia_N
I1098
241362
727443


Anatolia_N
I0736
242071
727443


Anatolia_N
ZHAG_BON004.A0101_Luk10
244315
727443


Anatolia_N
I1096
267488
728459


Anatolia_N
I1097
268586
728459


Iran_GanjDareh_N
I1954
156984
728459


Iran_GanjDareh_N
I1947
158182
728459


Iran_GanjDareh_N
I1290
204000
727443


Russia_Shamanka_Eneolithic
DA249.SG
23588
728459


Russia_Shamanka_Eneolithic
DA246.SG
82497
727443


Russia_Shamanka_Eneolithic
DA253.SG
97313
728459


Russia_Shamanka_Eneolithic
DA252.SG
116769
728459


Russia_Shamanka_Eneolithic
DA247.SG
117288
727443


Russia_Shamanka_Eneolithic
DA248.SG
125827
728459


Russia_Shamanka_Eneolithic
DA245.SG
142111
727443


Israel_C
I1169
101883
727443


Israel_C
I1178
291565
727443


Israel_C
I0644
356202
727443


Israel_C
I1152
357749
727443


Morocco_Iberomaurusian
TAF011
97611
727443


Morocco_Iberomaurusian
TAF013
123991
728459


Morocco_Iberomaurusian
TAF014
131891
728459


Morocco_Iberomaurusian
TAF010
219833
728459


Russia_HG_Karelia
I0061
105785
728459


DevilsCave_N
NEO240.SG
2079
727443


DevilsCave_N
NEO236.SG
60229
727443


DevilsCave_N_GmomNEO240
NEO235.SG
188658
727443

</tbody>
<style type="text/css"> body,div,table,thead,tbody,tfoot,tr,th,td,p { font-family:"Liberation Sans"; font-size:x-small } a.comment-indicator:hover + comment { background:#ffd; position:absolute; display:block; border:1px solid black; padding:0.5em; } a.comment-indicator { background:red; display:inline-block; border:1px solid black; width:0.5em; height:0.5em; } comment { display:none; }</style>

SUPREEEEEME
08-11-2020, 06:22 AM
Target: SUPREEEEEME_scaled
Distance: 3.9128% / 0.03912777
35.2 Levant
32.0 Steppe
26.2 Anatolian
6.6 NW_Africa

When swapping out JOR_EBA for Ashkelon_LBA:
Target: SUPREEEEEME_scaled
Distance: 3.6789% / 0.03678912
34.8 Levant
32.2 Steppe
27.4 Anatolian
5.6 NW_Africa

Luso
08-11-2020, 06:29 AM
Target: Luso_scaled
Distance: 7.6151% / 0.07615074
46.4 Steppe
43.4 Anatolian
10.2 NW_Africa

happycow
08-11-2020, 07:08 AM
Distance: 1.7240% / 0.01724006
81.8 Levant
7.0 Steppe
5.6 NW_Africa
4.2 East_Africa
0.8 South_Central_Asia
0.6 Siberian

Pedro Ruben
08-11-2020, 10:55 AM
Target: Pedro_scaled
Distance: 6.6769% / 0.06676859
48.2 Steppe
40.4 Anatolian
10.6 NW_Africa
0.8 East_Africa

gixajo
08-11-2020, 11:03 AM
Target: gixajo_scaled
Distance: 7.9019% / 0.07901898
50.6 Steppe
44.6 Anatolian
4.8 NW_Africa

Target: gixajo_dad_scaled
Distance: 7.6943% / 0.07694274
49.8 Anatolian
43.8 Steppe
6.4 NW_Africa

Target: gixajo_mom_scaled
Distance: 8.6302% / 0.08630231
55.0 Steppe
40.0 Anatolian
5.0 NW_Africa

Gallop
08-11-2020, 11:14 AM
I have taken Taforalt because I consider that they are supposedly prefabricated coordinates with Iberia values that Eas Africa also concealed from me, which is more in line with the origin of my dnaY haplogroup.

Target: Gallop_scaled
Distance: 7.9457% / 0.07945700
49.2 Steppe
48.2 Anatolian
2.6 East_Africa


Target: Gallop_scaled
Distance: 7.9724% / 0.07972350 | ADC: 0.25x
49.6 Steppe
48.8 Anatolian
1.6 East_Africa