PDA

View Full Version : Is there any distortion?



Repent
11-28-2025, 01:16 PM
So, I've got Genotek (Russian DNA test company) RAW data file *.vcf, I've converted it via Genotek official service to 23andme V5 *.txt file and I had given that file to Davidski for real G25 coordinates.

What's your take on possible data distortion on the stage when I did convert my *.vcf file?

Can I compare those files between each other somehow?

ScandinavianCelt
11-28-2025, 06:02 PM
Not sure.

Peterski
11-28-2025, 06:49 PM
If you converted it via Genotek official service then there should be no distortion, I think.

BTW how large is your VCF file?

Repent
11-28-2025, 07:07 PM
If you converted it via Genotek official service then there should be no distortion, I think.

BTW how large is your VCF file?

Thank you. My *.vcf file has this size: 26.97 mb.

Luke35
11-28-2025, 08:07 PM
I think that it's possible that the 23andMe v5 format will impact your results in a small way. 23andMe v5 does not have great overlap/snp coverage with GEDmatch. In other words, it has about 2/3 less overlapping snps (when used on GEDmatch) compared to AncestryDNA data. It's not that it's worse data, it's just less compatible with GEDmatch as 23andMe designed the v5 chip to be propriety, to focus on specific snps.

Now, I remember a long time ago that Davidski recommended AncestryDNA data over 23andMe data for use with G25. But later I saw that he said that it doesn't really matter. So whether or not he changed something about how he processes the data, or just decided that he thinks the usually minor difference in results is no big deal, I'm just not sure.

Luke35
11-28-2025, 09:15 PM
I want to add that I believe Davidski analyzes the same set of snps that GEDmatch does, hence my connecting the two (GEDmatch and G25) in my previous comment.

Repent
12-01-2025, 06:10 PM
If you converted it via Genotek official service then there should be no distortion, I think.

BTW how large is your VCF file?

Look, bro:

https://i.postimg.cc/qRzryB5F/20251201-210605.jpg

https://i.postimg.cc/TPKNcyTs/20251201-210531.jpg

https://i.postimg.cc/wT9WG46H/20251201-210603.jpg

Peterski
12-01-2025, 06:16 PM
^^^
LOL, there must be some bug with PLINK file.
What about Eurogenes K36 results for your PLINK file, are they also bugged?

Anyway, Merged should be the most accurate.

Repent
12-01-2025, 06:38 PM
^^^
LOL, there must be some bug with PLINK file.
What about Eurogenes K36 results for your PLINK file, are they also bugged?

Anyway, Merged should be the most accurate.

Here you go:

https://i.postimg.cc/3xqjZnS2/20251201-213514.jpg

https://i.postimg.cc/nL7mGZ9W/20251201-213517.jpg

https://i.postimg.cc/FzPkHHwc/20251201-213520.jpg

Can you tell me, why merged file is the most accurate?

Peterski
12-01-2025, 06:40 PM
Can you tell me, why merged file is the most accurate?

Because it includes all SNPs from 23andMe V5 as well as all other SNPs which could be omitted in V5 but are in your VCF file.

Repent
12-01-2025, 06:41 PM
Because it includes all SNPs from 23andMe V5 as well as all other SNPs which could be omitted in V5 but are in your VCF file.

Sounds cool, thank you. :thumb001:

Repent
12-01-2025, 07:00 PM
Distance to: Repent_K13
1.18275103 Repent_K13_Merged

This is big.

Repent_K13_Merged,24.52,42.15,10.13,8.83,7.16,0.91 ,0.68,1.00,3.00,0.86,0.74,0,0

Repent_K13,25.14,41.98,9.95,8.89,7.27,0.72,0,1.52, 2.65,1.05,0.84,0,0

Update: Or not. I've forgot how to interpret K13 distance already.

BabtistJohn
12-01-2025, 09:19 PM
Hey Trevor. There is distortion on any of these things since the populace you are comparing yourself also has distortion. Don't take it as a matter of fact, but a rough estimate instead.

Repent
12-02-2025, 06:01 AM
Hey Trevor. There is distortion on any of these things since the populace you are comparing yourself also has distortion. Don't take it as a matter of fact, but a rough estimate instead.

Trevor from GTA5?