Hi,
NE Italy can be easily done by grouping Italian_Veneto + Italian_Trentino-Alto-Adige + Italian_Northeast (Triveneto region).
For C. Italy, I'd go for Italian_Lazio + Italian_Tuscany + Italian_Umbria + Italian_Marche.
(Abruzzo is "ethnically" Southern Italian)
S. Italy: Italian_Abruzzo + Italian_Apulia + Italian_Basilicata + Italian_Calabria* + Italian_Campania + Italian_Molise* + Sicilian_East
I didn't put Sicilian_West bc they're abnormally plotting with Central Italians (lack of samples?).
*Some here mentioned Calabrese avg isn't well represented because there are three samples only, but honestly I've being using it and IMO I don't think they're too outlying, there are some Campanian individual samples very similar to them. If you think it's better to remove the Italian_Calabria avg, consider also removing Italian_Molise, since it's composed of only two samples and it's quite central-italian shifted, otherwise the "northern" part of South Italy would be overrepresented. To summarize, I'd put both Calabria and Molise or remove both, but I'd certanly remove Sicilian_West.
NW Italy is a problem. Swiss_Italian is acting almost as a Tuscan avg for some reason and Italian_Piedmont is pretty shaky.
I've read people saying Italian_Piedmont samples are from Val Borbera, which is "ethnically" Ligurian, I'm not sure if this is the case, could someone here confirm? :confused:
Anyways, a NW Italy sample would have a huge hole because it's missing the piece that would practically be its protagonist: a consistent Italian_Piedmont avg.
I'm not sure how I would group them together :icon_ask:

