Quote:
Here is a concise list in English of each sample from your model, with a very short description of its typical/represented origin (based on common G25 usage, datasets like Eurogenes/Global25, HGDP, and community references):
Romanian:G421 — Romanian from Gorj county (Tismana area), Oltenia region, southern Romania; often used as a "core" or typical modern Romanian reference.
Spanish:CAN924 — Spanish from the Canary Islands (insular Atlantic Iberian with Guanche/North African admixture component).
Estonian:Est391 — Estonian (modern ethnic Estonian, representative of the general Baltic Estonian cluster; no specific sub-region detailed, but typical northern Baltic).
Swedish:GSM1884792 — Swedish (modern Swede, generic representative; likely central or southern Sweden based on common GSM series usage in datasets).
German:GSM1031514 — German from northern Germany (often associated with Hamburg or nearby northern/central areas in sample sets; sometimes shows minor eastern/Slavic influence in averages).
Sardinian:HGDP00674 — Sardinian from the Gennargentu massif (central mountainous interior of Sardinia, e.g., Ogliastra province villages like Tortoĺ/Lanusei area); classic high-isolation Neolithic-like Sardinian reference.
These are modern reference samples commonly included in G25 tools for PCA and modeling, capturing broad regional variation in Europe.
NOTE: The AI of Google says "Romanian:G421" is from Northeast of Romania, while Grok says it is from Wallachia.