7
This paper is from 2022 but I can't see it posted anywhere. I thought it had some interesting information.
This is a new genetic plot with all of Europe including Turkey.Significance
Recent haplotype sharing analyses in specific European populations have revealed fine-scale genetic differentiation that echoes history. An equivalent understanding across the whole European continent would place these insights into a wider context and extend understanding to underdescribed regions. Here, we leverage haplotype data from 5,500 European-ancestry individuals from the UK Biobank (UKBB) in a methodological approach to update and expand the European genetic landscape, extending coverage to regions in the southeast of Europe, identifying communities of high haplotype sharing that may be of interest to genetic mapping, such as Malta. Together, our results highlight European communities with diverse ancestries sampled within the UKBB and demonstrate the potential for insights to be made with other non-European ancestry communities using this dataset.
A sample of European structure in the UKBB. (A) The number of individuals included from each European country analyzed. Countries are grouped by geographic region; these regions are chosen as a means of group representation and do not necessarily imply historical links. Sample sizes from each region are also shown. Abbreviations are as follows: SE Europe (southeastern Europe), S Europe (southern Europe), E Europe (eastern Europe), C Europe (central Europe), N Europe (northern Europe), W Europe (western Europe), Brit. & Ire. (Britain and Ireland). (B) The sample counts for each European region. (C) The first two PCs calculated by PLINK of 5,500 European individuals. Individual genotypes are shown by letters that encode the alpha-2 ISO 3166 international standard codes and are color coded according to geographic region. The median PC for each country/region of birth is shown as a label. Plots were generated using the ggplot2 package (65) in the R statistical computing language (59).
Leiden clustering of 5,500 Europeans from the UKBB. (A) The dendrogram of Leiden clusters, grouping them according to their hierarchical relationships. The three main branches are color coded, with additional subdivisions shown as vertical lines. Each of the 41 cluster labels are shown alongside their associated color and shape coding. (B) The membership of each of the 41 Leiden clusters. Along the x axis shows country/region of birth, and along the y axis cluster membership. The heat map shows the proportion of individuals from each country of birth in each cluster (Freq), and the absolute number. (C) The first two PCs of the pbwt paint chunkcounts coancestry matrix. Each point represents the phased genotype of an individual, color and shape coded according to Leiden cluster membership, using the convention shown in A. Additional labels are shown to show the broad European region that individuals were born from. Plots were generated using the ggplot2 package (65) in the R statistical computing language (59).
https://www.pnas.org/doi/10.1073/pnas.2119281119These groups correspond to individuals predominantly born within the northwest of Europe (NW Europe), center and east of Europe (CE Europe), and the south of Europe (S Europe). We highlight interesting information or findings from these regions below; for a full discussion of the clustering and ancestry of these clusters see SI Appendix, Supplementary Data 5.
Bookmarks