GEDmatch.com Adds Phasing Tool

Today (or perhaps yesterday?) popular DIY genomics website GEDmatch.com released a new tool for phasing DNA data.  Listed under a link entitled “Generate phased data file,” the tool allows users of the GEDmatch.com site to phase their chromosomes if they have their parent’s raw data.

(A similar tool was previously created by David Pike at http://www.math.mun.ca/~dapike/FF23utils/; with David’s tool, users receive their results directly and do not need to upload their DNA test results; accordingly, users have a variety of options depending on their privacy tolerance).

What the Heck is “Phasing”?

Currently, SNP chip testing performed by 23andMe or Family Tree DNA is unable to attribute a test result to either one of your parents.  For example, if your results for SNP rs00000 are “AG,” the test alone cannot determine whether the “A” came from your mother or father.

“Phasing” refers to the process of separating the mixed DNA results (the “AG”) into the DNA obtained from your mother (the “A”) and the DNA obtained from your father (the “G”).  This is typically done by comparing your results to your parents’ results and determining which parent could have and/or must have contributed each SNP.

For example, if mother’s results are “AA” at rs00000, and father’s results are “GA” at rs0000, then the data can be phased into “A” from mom and “G” from dad (since only dad could have contributed the “G”).  Every once in a while, the data can’t be phased, however (say you’re “AG,” mom is “AG,” and dad is “AG”).

What Good is Phasing?

Many genealogists are using phased data to identify which DNA came from individual grandparents, great-grandparents, and beyond.  I won’t get into that in detail now, but hope to at some point in the future (and eventually in the book I’m working on!).

As another example of using phased data, I used the new GEDmatch.com tool to phase my data.  Both my parents and I had previously uploaded our 23andMe and FTDNA data into GEDmatch.  I then performed some admixture analysis to compare unphased v. phased data.

Here is my unphased chromosome painting (Dodecad World9):

For comparison, here is my chromosome painting using the DNA I obtained from my father (same settings):

And here is my chromosome painting using the DNA I obtained from my mother (same settings):

Note that since this is so early I can’t say for certain whether using phased data creates some unwanted effects on the analysis (I’d love some input on that).  It is interesting, however, to compare the results of phased v. unphased data.

What uses will you put your phased data to?

Blaine Bettinger

Intellectual property attorney, genealogist, and author of The Genetic Genealogist since 2007


  1. I was in hopes I could use the phased report to clarify mom from dad since I am adopted. My birth mom took the dna test for me after I did mine. Not sure how accurate the reports I got would be. I keep holding out hope of someday being able to discern what matches could be from my unk fathers side.

  2. My brother recommended I might like this website.
    He was once totally right. This publish actually made my day.
    You can not consider simply how a lot time I had spent for this information! Thanks!

  3. Call me dense but I’m trying to figure out what the use of the phasing tool is if I already have both parents, or if I only have my father what is the point in creating a phased account for him?

  4. If a man does not know who his biological father is, there are some other methods that can help in FamilyFinder. If the man has a sibling and they know for sure who the father is then testing this sibling should help sort out FamilyFinder matches. Both individuals have the same mother but different fathers. If in using FamilyFinder you sort matches so those that only match the sibling are likely from the maternal side (both testers have the same mother). Now we are left with other matches that only show on the man’s matches and therefore are likely connected to his paternal side. Of course we often find we get many matches so that is just the start of digging deeper. Very distant cousins, 5th or higher are difficult to find a common ancestor since they go back so many generations. If we find someone who is a 1st or 2nd cousin we have a better chance of narrow down a common ancestor. All of this means you still need to have a good genealogy database of ancestors to possibly find the link.

Leave a Reply

Your email address will not be published. Required fields are marked *