The Genetic Genealogist

Adding DNA to the Genealogist's Toolbox

Archive for the "Genealogy" Category


Exploring New Scientific Research With My Genotype In Hand

This morning, a single tweet sent me on a 2-hour tour (more, if you count drafting this post!) of my genome.

In the tweet, Mary Carmichael expressed interest in a potential book regarding the orchid/dandelion theory recently described in a December 2009 article in The Atlantic “The Science of Success.”  Before this morning, I was not familiar with either the article or the theory.

The introduction to the article, reproduced below, does a good job of summarizing the main thrust of the very long (but extremely interested and worthwhile) report:

“Most of us have genes that make us as hardy as dandelions: able to take root and survive almost anywhere.  A few of us, however, are more like the orchid: fragile and fickle, but capable of blooming spectacularly if given greenhouse care.  So holds a provocative new theory of genetics, which asserts that the very genes that give us the most trouble as a species, causing behaviors that are self-destructive and antisocial, also underlie humankind’s phenomenal adaptability and evolutionary success.  With a bad environment and poor parenting, orchid children can end up depressed, drug-addicted, or in jail—but with the right environment and good parenting, they can grow up to be society’s most creative, successful, and happy people.”

As the introduction suggests, the article examines the complicated interaction between environment and genetics and suggests that while genetics can present hurdles in life, environmental factors can increase or perhaps even eradicate those hurdles.

Nature v. Nurture

The article begins with a discussion of complex behavioral science experiments using humans or monkeys before bringing in recent studies of genetics that tie into these experiments.  For example, the author mentions the 5-HHTLR gene, which is involved in serotonin processing:

“As I researched this story, I thought about such questions a lot, including how they pertained to my own temperament and genetic makeup. Having felt the black dog’s teeth a few times over the years, I’d considered many times having one of my own genes assayed—specifically, the serotonin-transporter gene, also called the SERT gene, or 5-HTTLPR. This gene helps regulate the processing of serotonin, a chemical messenger crucial to mood, among other things. The two shorter, less efficient versions of the gene’s three forms, known as short/short and short/long (or S/S and S/L), greatly magnify your risk of serious depression—if you hit enough rough road. The gene’s long/long form, on the other hand, appears to be protective.”

From SNPedia:

“5-HTTLPR (serotonin-transporter-linked polymorphic region) is a degenerate repeat polymorphic region in SLC6A4, the gene that codes for the serotonin transporter. It has been extensively investigated in connection with the behavioral, psychiatric, pharmacogenetic aspects of neuropsychiatric disorders.  In contrast to earlier reports, a June 2009 article in JAMA showed no association between 5-HTTLPR genotype and depression.”

My 5-HTTLPR Status

Perhaps not surprisingly for anyone who has read The Genetic Genealogist, I was immediately interested in determining my own 5-HTTLPR status.  Based solely on my personal history (for example, I’ve never been overly prone to depression) and family history, I quickly predicted that my status would be S/L.

The author of The Atlantic article was also interested in his 5-HTTLPR status and sent away a saliva sample to a researcher she knew for analysis.  You can read the article to learn his status in the last few terrific paragraphs.

However, being one of the most extensively genotyped people in the world (which still doesn’t require much genotyping; I’ve had whole-genome scans performed by two different companies, along with Y-DNA and mtDNA testing), I turned to the results I already had in hand.

Unfortunately, the main SNPs used to examine the S or L version of 5-HTTLPR are not examined by 23andMe.  However, there has been extensive discussion of the gene in the 23andMe forums, and one member pointed out (here) that a 2009 study associated the CA haplotype of SNPs rs4251417 and rs2020934 is coupled with the short allele of 5-HTTLPR (although not perfectly, with r(2) = .72).

Of these “surrogate SNPs,” 23andMe only tests rs4251417.  A quick glance at my results revealed that I am C/C homozygous at rs4251417, suggesting that I might be 5-HTTLPR S/S, not S/L as I had predicted.  (I should note here that with just the rs4251417 allele and with a combined r(2) of 0.72, it is not clear how well the rs4251417 allele alone predicts 5-HTTLPR status despite the discussions found in the 23andMe forums).

There are a myriad of articles examining the S/L alleles, including research regarding their effect on stress (“We found that the s allele of 5-HTTLPR was associated with depression and perceived stress in patients with coronary disease.”); aggressive behavior in alcoholics (“Data suggests that the presence of s allele may confer a genetic vulnerability factor to the development of aggressive behaviour in alcohol dependent subjects, specially, in interaction with acute alcohol consumption stage”); and my favorite, financial risk (“We find that the 5-HTTLPR s/s allele carriers take 28% less [financial] risk than those carrying the s/l or l/l alleles of the gene.”).  Interestingly, it appears that none of this research considered the environmental factors that appear to be so influential on the 5-HHTLPR genotype, something that is undoubtedly endemic to genotype/phenotype studies.

The Future

Now that my wife has had her genome analyzed, I can do something that I couldn’t do with my results alone; I can predict the possible 5-HTTLR genotypes of our offspring.

This is, of course, tricky business.  I’m still not sure how I feel about purchasing genetic testing for my children, but this is a far cry from buying them a test.  I’m simply using basic genetic techniques to predict possible genotype outcomes, something that high school biology students have been doing for decades (determining the % of blue vs. brown eyed-children using various parental genotypes, for example).

Although an interesting exercise (and one that I’ve been performing often), given the state of the 5-HHTLR science I don’t believe that I’ve gained any useful or actionable information from an estimate of my children’s genotype.  Of course, I’m not even sure exactly how strong the research would have to be to make almost any genotype actionable!

Caveats

This discussion and analysis is for my personal interest only.  Specifically, I’m intrigued by the (as-of-yet unregulated) ability to check my own genotype against the results of new research.  I do not plan to make any lifestyle or parenting changes based on the results discussed in this post, and I do not suggest that you should do so either.  I simply examined my genetic code to determine my allele status and then examined the primary research to review the discussion of that allele status in the literature.  And I certainly hope I will be able to continue to do this in the future.

Edit Before Posting:

I was finally able to obtain a copy of the 2009 study that associated the CA haplotype of SNPs rs4251417 and rs2020934 is coupled with the short allele of 5-HTTLPR.  The authors include the following in their analysis, revealing that the rs4251417 SNP is not a useful proxy for determining your 5-HTTLPR status:

“Unfortunately, rs2020934 has not been genotyped as part of the HapMap project and has not been included on any of the genome-wide SNP platforms.  SNP rs4251417 is included on the Illumina 610K and 1M chips, but on its own, it is not a useful proxy for 5HTTLPR (r2 = .06).”

While this means that the above analysis was not fruitful, it emphasizes three very important points regarding personal genomics: (1) people will increasingly turn to their personal genotype as they read new research; (2) be sure to confirm everything for yourself; and (3) at this stage of the game, you should be prepared for everything you’ve discovered and/or concluded to be turned on its head with new research.

Family Tree DNA’s 6th International Conference on Genetic Genealogy Announced

From a Press Release issued by Family Tree DNA on August 11, 2010:

FAMILY TREE DNA’S 6th INTERNATIONAL CONFERENCE ON GENETIC GENEALOGY FOR GROUP ADMINISTRATORS TO BE HELD OCTOBER 30 & 31, 2010 IN HOUSTON

HOUSTON, (August 11, 2010) — Family Tree DNA, the world leader in genetic genealogy, will host its 6th International Conference on Genetic Genealogy on October 30-31, 2010, at the Sheraton North Houston in Houston, Texas. Each year, world renowned experts in genetics and science present cutting-edge developments and exciting new applications at this two-day educational forum which draws attendees from Family Tree DNA’s Group Administrators from around the world. This year’s conference will focus on the new Family Finder test which allows customers to find relatives across all ancestral lines.

Founded in April 2000, Family Tree DNA was the first company to develop the commercial application of DNA testing for genealogical purposes. Previously, this type of testing had only been available for academic and scientific research. Almost a decade later, the Houston-based company continues to establish standards and create new milestones in the increasingly popular and rapidly growing field of genetic genealogy.

Today – with over 300,000 individual records – Family Tree DNA has the largest DNA databases in genetic genealogy, a number that makes it the prime source for anyone researching recent and distant family ties. Family Tree DNA’s database also encompasses over 95,000 unique surnames and nearly 6,000 lineage and geographic projects.

In 2005, Family Tree DNA was selected by National Geographic and IBM as the designated DNA testing company for their Genographic Project, a history-making study of the migrations of mankind. To date, the company has processed more than 300,000 Genographic Project DNA tests. Family Tree DNA’s own laboratory-the Genomics Research Center-participated in the Genographic Project’s first published paper and other scientific papers.

Offering the most popular and wide-ranging DNA-testing service in the field of genetic genealogy, Family Tree DNA prides itself on its commitment to the practice of solid, ethical science. Since its beginnings, the company has associated itself with leading researchers and scientists in the field, many of whom will be speaking at this year’s conference. Among these prominent names are Dr. Michael Hammer, Dr. Doron Behar, and Thomas Krahn. Family Tree DNA has also been involved with several scientific papers and has provided assistance in updating the YCC Y-Chromosome Phylogenetic Tree.

* * * * *

Online information and registration for the 2010 conference is available at: http://www.familytreedna.com/conference/

For registration information, please contact Jane Buck-tel: 713-868-1438; e-mail: info@familytreedna.com

Media contact for Family Tree DNA: Sharon Weisz, W3 Public Relations-tel: 323-934-2700; e-mail: Sharon@familytreedna.com

For media information on The Genographic Project, please contact Glynnis Breen at National Geographic-tel: 202-857-7481; e-mail: gbreen@ngs.org

A Review of Family Tree DNA’s Family Finder – Part II

Last week I wrote about the results of my Family Finder autosomal DNA test by Family Tree DNA (see “A Review of Family Tree DNA’s Family Finder – Part I“).  The Family Finder test uses a whole-genome SNP scan to find stretches of DNA shared by two individuals, thus identifying your genetic cousins (and will soon include the Population Finder analysis of admixture percentages).  I currently have over 33 genetic cousins in Family Finder, and I’m working with them to identify our common ancestor(s).

The Affymetrix microarray chip used by FTDNA includes over 500,000 pairs of SNPs located on the X chromosome and the autosomes (no Y chromosome SNPs).  Via SNPedia:

FamilyTreeDNA uses an Affymetrix Axiom CEU microarray chip with 3,269 SNPs removed (563,800 SNPs reported) for autosomal and X (but not Y or mitochondrial) ancestry testing for $289. Other sources have cited 548011 snps. This platform tests 1871 of the 12442 snps in SNPedia.

FTDNA states that the Family Finder test is not intended to be medical.  From the FTDNA FAQ:

Question: Is the Family Finder test medical?

Answer: No, it is not.

This is entirely accurate of course; FTDNA does not analyze the test results for health, traits, or other medically-relevant information, and does not provide the user with any medical information or analysis tools that might reveal medical information.

However, when DNA is involved there is almost never any such thing as a completely non-medical test.  It’s often impossible, at any given point in time, to know which of an individual’s SNPs might be affiliated – remotely or closely – with a medical state or condition.  Ann Turner recently wrote the following at the Rootsweb GENEALOGY-DNA mailing list in response to another individual’s question:

Question:  “I am wondering if FTDNA really left out the genes and just lists the intergenic areas?”  Answer:  “No, the claim was that they scrubbed medically significant SNPs.  They still include over 1600 SNPs with entries in SNPedia, which would have some phenotype implications, according to an analysis posted at DNA-Forums: http://tinyurl.com/27slbj8.”

Indeed, as of August 3rd, 2010, there are 12,442 SNPs in SNPedia, of which a total of 1,871 are tested by Family Tree DNA’s Family Finder test.

Promethease Analysis

I was curious as to what information my Family Finder results might contain, so I ran my results through Promethease, a free software tool used to analyze whole-genome SNP scan results.  From the Promethease website:

“Promethease is a tool to build a report based on SNPedia [an impressive database of annotated SNPs] and a file of genotypes [i.e., your Family Finder results]. Customers of testing services (23andMe, deCODEme, Navigenics, …) can use it to learn more about their DNA. It can also pool the data from multiple testing services. The program runs for approximately 3 hours. An optional $2 payment per run unlocks extra features and reduces runtime to approximately 5 minutes.”

Similar to several of the other autosomal SNP scan testing companies, Family Tree DNA allows the customer to download their own DNA testing results.  Autosomal results and X-chromosome results are separately downloaded as compressed files which can then be extracted for analysis.  After downloading and installing Promethease, I ran the program using just my Family Finder results (after paying the $2 for a faster runtime.  I’m impatient.).

Promethease was  indeed able to analyze my Family Finder results and returned a report that included 1881 annotated genotypes. Here, for example, is a screenshot from my results (click to embiggen):

In addition to the “most interesting snps” category, there are categories for “medicines”, “medical conditions” (below), and others.  After clicking on “more” for each category, I receive more information about those annotated SNPs.  To get an idea of what the full results look like, there are a number of people who have shared their real promethease reports.

Promethease also lets you upload your results from different companies, so I also analyzed my Family Finder results together with the results of my 23andMe test.  Since there isn’t much overlap between the SNPs in the FTDNA test and the SNPs in the 23andMe test (see this ISOGG Wiki page for more information about FTDNA’s testing versus 23andMe’s testing, for example), I was able to extract information about 7691 of my personal genotypes using the SNPedia database (compared to 1881 genotypes with my Family Finder results alone).  Thus it appears that the 23andMe results are more likely to contain SNPs that are annotated in SNPedia.  This isn’t surprising considering that, according to reports, FTDNA designed their chip to contain fewer annotated SNPs.

My Results

Since I have taken whole-genome tests before and was familiar with both testing and the interpretation of results, my report was not surprising.  Indeed, I was already aware of my increased risk of type-2 diabetes (see Personalized Genomics: A Very Personal Post ), as well as the fact that I’m “probably light-skinned” (see e.g., my bathroom mirror).  However, it might not be clear to those taking these tests that the results contain a large amount of medically-relevant information.  This can be problematic when considering the fact that Family Finder test-takers might share or reveal their data with other people.  Indeed, even knowledge that you share a region of DNA with another person can reveal medically-relevant information that the two people share in that region.

On the other hand, this ability to apply Family Finder results to information in SNPedia will be of great interest to a number of test-takers who are interested in this type of genetic analysis.  This type of “do-it-yourself biology” is becoming more and more popular everyday.  Although there is still much debate regarding the utility of such information, exploring one’s genome can be highly interesting, informative, and interesting (and, to date, no one has adequately shown that exploring one’s genetic data is harmful for anything other than a tiny minority of people).

Conclusions

In conclusion, it is important for consumers to realize that ALL genomic information has the potential to reveal medically-relevant information (even Y-DNA and mtDNA results can include health information, for example).  By no means, however, am I suggesting that people should forgo whole-genome SNP scans, or that governmental regulation is needed.  Instead, I think it is vital that consumers understand the testing process and possible outcomes before testing, and I fully believe that it is the consumer, not the government, who should decided whether the consumer should or can undergo testing.

Indeed, rather than expend thousands of dollars in hearings, [faulty] investigations, and regulation, the government could use that money to fund programs that educate the population about genetics and DTC testing.  After all, we are entering a future that will involve our personal genomes in many aspects of our lives.

I’m interested to hear your thoughts on this subject, so please feel free to leave a comment below.

(Disclaimer: Please note that I received my Family Finder test without charge from Family Tree DNA for purposes of this review.  Regardless, I have attempted to review this product as honestly and as objectively as possible in order to provide valuable information about Family Finder to my readers.  I am also a consultant for Pathway Genomics.)

Using Genome-Wide SNP Scans to Explore Your Genetic Heritage

Mary Carmichael, a science editor for Newsweek, is in the midst of a week-long dilemma.  This Friday, after reading a series of articles written by members of the DTC genetic testing community, she will decide whether she should purchase a genome-wide SNP analysis.  Although the decision might be a simple one for some, in light of the recent critique of DTC genetic testing in the media, in the literature, and by the government, it is certainly understandable that Mary is looking for further insight into her decision.

Today, Mary is asking “What Can I Learn From At-Home DNA Tests?” and has gathered answers to her question from a wide variety of writers and scientists, including myself.  Since the Newsweek site only has space for a brief introduction to each topic, this post is meant to be a more in-depth answer what Mary could learn about her ancestry from a DTC test.

Genome-wide SNP scans explore a test-taker’s autosomal DNA, the 22 pair of non-sex chromosomes found inside the nucleus of each of our cells (although some tests examine the sex chromosomes as well as the mtDNA).  Rather than sequence the entire genome, an endeavor that is still too expensive for the average consumer, genome-wide SNP scans analyze roughly 600,000 locations in the human genome.

Using the results of a SNP scan, testing companies offer an array of educational and/or recreational analyses that offer exciting and informative insight into ancestry, medical propensities, and phenotypic traits such as eye color.  However, these tests are not without certain concerns and limitations.

To help Mary – and perhaps you – make an informed decision about whether to purchase a genome-wide SNP test, we will explore several of the most important benefits and limitations of the ancestral side of autosomal DNA testing.

Explore Your Genetic Tree

One of the most important – and confusing – concepts that people who are new to autosomal testing encounter is the fact that everyone has both a Genetic Tree and a Genealogical Tree.  Your genealogical tree includes every one of your ancestors throughout history.  Your genetic tree, however, only includes those ancestors who were lucky enough to contribute DNA to your genome.

Your parents are absolutely in your genetic tree, as are your grandparents and great-grandparents.  Go back a few more generations, however, and your genealogical ancestors start disappearing from your genetic tree.  Thus, your genetic tree is actually a tiny subset of your genealogical tree.  Further, while a genealogical tree remains constant (an ancestor will always be in a particular genealogical tree), a genetic tree changes with every new generation (that is, some ancestors will fall off the genetic tree with each new generation).

I recently posed the following hypothetical questions regarding the genealogical tree vs. genetic tree issue:

“At 10 generations, I have approximately 1024 ancestors (although I know there is some overlap). How many of these ancestors are part of my Genetic Tree? Is it a very small number? A surprisingly large number?”

“What percentage, on average, of an individual’s genealogical tree at X generations is part of their genetic tree?”

Luke Jostins at Genetic Inference kindly looked into my questions and offered some helpful and creative insight. Using a statistical analysis that he based on data from a recent study, Luke concluded that. based on his data, on average only about 120 of our 1024 genealogical ancestors at 10 generations (or 11.7%) are our genetic ancestors.  Luke also concluded that:

“The probability of having DNA from all of your genealogical ancestors at a particular generation becomes vanishingly small very rapidly; there is a 99.6% chance that you will have DNA from all of your 16 great-great grandparents, only a 54% [chance] of sharing DNA with all 32 of your G-G-G grandparents, and a 0.01% chance for your 64 G-G-G-G grandparents. You only have to go back 5 generations for genealogical relatives to start dropping off your DNA tree.”

So what does this mean for Mary?  It means that it is important for her to note that her autosomal ancestry testing results will only apply to her genetic tree, not to her entire genealogical tree.  With this limitation in mind, we can explore some of the analyses offered by most autosomal ancestry testing companies.

Discover Your Ancient Ancestry

Genome-wide SNP scans often explore the test-taker’s Y-chromosome and mitochondrial DNA (or “mtDNA”).  The Y-chromosome is passed down from a father to only his sons, so only males possess a Y-chromosome.  The mtDNA, however, is passed down from a mother to her children, so everyone has mtDNA.

(Image Courtesy of Wapondaponda)

SNP testing has been used for well over 30 years to classify Y-chromosomes and mtDNA into discrete but related groups called ‘haplogroups.’  For example, all humans on Earth are maternally descended (i.e. through our mother’s mother’s mother’s mother…and so on) from Mitochondrial Eve, a woman who is believed to have lived about 200,000 years ago in Africa.  Mitochondrial Eve passed on her mtDNA to all humans who are alive today.  However, in the intervening 200,000 years, her mtDNA has repeatedly branched off into different haplogroups through the accumulation of SNP mutations.  Thus, through analysis of a few SNPs on the mtDNA genome, an individual’s mtDNA can be classified into a particular haplogroup.  And, since research has shown that particular SNP mutations arose in certain areas at certain times, it is often possible to assign the haplogroup – and thus the test-taker’s ancient ancestry – to a broad geographic location.

Similarly, all humans on Earth are paternally descended (i.e. through our father’s father’s father’s father’s father…and so on) from Y-chromosomal Adam, a man who is believed to have lived about 80,000 years ago in Africa.  Just like mtDNA, Y-chromosomes have accumulated SNP mutations that allow scientists to group them into haplogroups and broadly estimate the time and place in which the mutation arose.

SNP testing of my own mtDNA haplogroup, for instance, has shown that it belongs to haplogroup A2 which is most often found in Native populations in the Americas.  This suggests that my mother’s mother’s mother’s…mother was Native American.

SNP testing of the Y-chromosome and mtDNA does have some limitations.  As critics often like to point out, at best Y-chromosome and mtDNA analysis only reveals information about 2 ancestors.  However, this fact typically doesn’t prevent the genetic pioneers and explorers from taking these tests, since learning about those 2 ancestors can be extremely rewarding and enjoyable.

Although Mary does not have a Y-chromosome, her genome-wide SNP scan will include an analysis of her ancient maternal ancestry using her mtDNA.

Reveal Your Genetic Admixture

One of the most exciting products offered by autosomal ancestry testing companies is the admixture analysis.  This analysis examines segments of the autosomal DNA and determines for each segment whether it was likely to have been inherited from ancestors in Africa, Asia (including Native Americans), or Europe (and sometimes sub-populations in those regions, depending on the test).  For example, my 23andMe test suggested that I am 97.89% European, 1.84% Asian, and 0.27% African, as shown in my 23andMe Ancestry Painting:

So do these results mean that exactly 97.89% of my genetic ancestors were European, and that 1.84% were Asian?  Not really.  An individual’s admixture results can differ from company to company based on which SNPs are used for analysis, which reference populations are used for analysis, and which algorithm is used for the analysis.  For example, my deCODEme admixture analysis (also based on my 23andMe results) suggested that I am in fact 87% European, 9% European, and 4% African.

While the exact percentages can vary, the admixture test results provide the test-taker with an important and previously unavailable glimpse into their genetic heritage.  For example, my results unexpectedly revealed African ancestry, which in hindsight this makes perfect sense considering that I likely have Garifuna/Caracol ancestry from the island of Roatan.  As my own results show, the genome can hold long-forgotten information about your personal heritage that can be uncovered and explored for the first time in hundreds of years.

So what does this mean for Mary?  On the one hand, it means that with the purchase of a test she can receive her own admixture results and explore her genetic heritage, thereby possibly uncovering long-hidden ancestry.  On the other hand, it means that: (i) her admixture results will not be absolutes, and that they might change if new SNPs, data, or algorithms are used; and (ii) she should be prepared for the possibility of unexpected results.  In my own experience, however, I’ve found that many consumers are fascinated by unexpected results.

Identify Genetic Cousins

Another primary driver of the autosomal ancestry market is the ability of test-takers to identify and connect with genetic cousins, both close and distant.

Again, however, the dichotomy of the genealogical tree versus the genetic tree is important.  It’s important to note that everyone of us have both genealogical cousins to whom we are related because we share a common ancestor with them, and genetic cousins to whom we are related because we both inherited DNA from a common ancestor.  While a cousin can be both genealogical and genetic, often they will only be one or the other.

The genetic ancestry market has developed specific tools that allow consumers to identify and connect with genetic cousins.  23andMe offers Relative Finder, a service that looks for segments of DNA that a customer shares with other 23andMe customers.  If two people in the 23andMe database share a segment of DNA, this suggests that they share a common ancestor.  The amount of shared DNA, which is reported by the companies, can suggest how recent their common ancestor was.  Thus, in addition to identifying genetic cousins these products offer a suggested relationship range for the cousins.  Similar to 23andMe’s Relative Finder, Family Tree DNA offers a product called Family Finder which compares segments of the test-taker’s DNA to DNA in the Family Finder database.

Once a match is discovered, the genetic cousins can share their genealogical trees as well as their suggested relationship in order to identify an overlap in their trees.  That overlap is potentially the common ancestor from whom they inherited the same stretch of DNA, and thus this ancestor is located in their genetic trees.

For example, I’ve discovered a relative in Family Finder with whom I share at least 12 ancestors in the past 200-400 years.  Many other customers report success finding a single shared ancestor with their genetic cousins.  Identifying a shared genetic ancestor, together with the segment of shared DNA from that ancestor, allows genetic cousins to trace the path of the shared segment through both time and space from that ancestor to themselves.

A genome-wide SNP scan, therefore, will potentially allow Mary to track segments of her genome through both time and space!

Possibly Reveal Medical Information

In addition to ancestral information, most autosomal DNA testing companies offer some type of analysis regarding physical traits and medical propensities.  Other companies offer limited tests that look at only ancestry or only medical information.  Regardless of the type of test purchased, it is important to note that it is almost impossible to separate medically-relevant DNA from ancestrally-relevant DNA.

Although a SNP or short stretch of DNA used for ancestral analysis may not currently harbor any known data regarding physical traits or health information, a paper could be published the very next day which shows that the same bit of DNA actually reveals one’s propensity for a certain medical condition.  As a result, sharing raw DNA data with another individual – even if that data is only believed to reveal ancestral information – can potentially reveal medical information.

What does this mean for Mary?  Education is always the best method of preparation.  Knowing that raw data can reveal medical information, Mary will be aware of the possibilities and can use that information to decide the level of sharing she is most comfortable with.

Do-It-Yourself Biology

In addition to the myriad tools offered by DTC genetic ancestry testing companies, the ability to download raw data gives users the ability to explore their ancestry and genome in other ways.  Several members of the DTC genetic testing community and the genetic genealogy community have created novel tools for further analysis of DTC test results.

David Pike, for example, has created a suite of tools for analyzing raw data from 23andMe and/or FTDNA, including the following:

Dienekes Pontikos of Dienekes’ Anthropology Blog created Euro DNA Calc, which uses 23andMe or deCODEme data to calculate the probability that an individual is Northwest European, Southeast European or Ashkenazi Jewish.

Promethease is a free utility that uses the SNPedia database to generate a report about medical information and traits.  It uses raw data from most of the major testing services and can even pool data from multiple testing services.

Adriano Squecco maintains the Y-Chromosome Genome Comparison database, which is a spreadsheet of raw Y-DNA results from male 23andme customers.  The data is being used to identify new SNPs for Y-DNA testing.

Dr. Doug McDonald performs a BGA Analysis, which analyzes raw data to determine global similarity and admixture percentages.

These tools offer analysis beyond what testing companies currently offer, allowing the user to be an early explorer in this area.

Conclusions

Through autosomal DNA testing, Mary can learn about her genetic heritage and connect with long-lost genetic cousins to explore her genetic family tree.  Further, Mary can use her results in novel do-it-yourself ways using tools developed by the DTC genetic testing community.  By being cognizant of the privacy issues and limitations known to be associated with genetic ancestry testing, Mary can also make informed interpretations of her data and decide with whom she will share her results.

(Potential Biases:  Although I don’t have a direct financial stake in the success of DTC companies, I have performed consulting work for Pathway Genomics, a DTC genetic testing company, and have received a  complementary SNP scan from 23andMe and Family Tree DNA for reviews here on the blog.  I am opposed to any unreasonable or paternalistic regulatory barrier to our genetic information, but I also believe that potential test-takers should perform their own research and investigation into genetic testing in order to understand the benefits and limitations of these tests prior to purchasing a test.)

A Review of Family Tree DNA’s Family Finder – Part I

Since late 2007, several “direct-to-consumer” or “DTC” genetic testing products have entered the marketplace, many of which offered some degree of autosomal ancestry analysis (including 23andMe, deCODEme, and Pathway Genomics, among others).

In early 2010, genetic ancestry testing company Family Tree DNA announced that it would begin offering a new genetic genealogy product (see “Announcing Family Finder – An Autosomal Test From Family Tree DNA”).  The new product, called “Family Finder,” is one of only a very few autosomal genetic genealogy tests available to consumers.

The Family Finder test uses an Affymetrix microarray chip that includes over 500,000 pairs of locations called single nucleotide polymorphisms (SNPs) in your autosomal DNA.  Once the SNPs are analyzed, FTDNA detects linked blocks of DNA that indicate a common ancestor.  The number and size of these linked blocks is used to determine how recently or closely two people are related.  From the Family Finder FAQ page:

“The Family Finder test works by comparing your autosomal DNA to that of other people in our database who have taken the test. Your relationship with a match is calculated based on sharing linked segments of DNA. Although any two people from the same population may have some of their DNA in common, as a matching segment of DNA becomes longer and you share more segments, it becomes more likely that the sharing is due to a recent common ancestor than a chance match.”

Thus, the results of the Family Finder test are used to find stretches of DNA shared by two individuals, to identify your “genetic cousins” (as compared with “genealogical cousins,” who you may or may not share DNA with).

The Family Finder landing page is packed with information, including videos and information about the potential uses of the product:

“We place you in control. When you take the Family Finder test, your results are compared against our Family Finder database. Your list of matches is designed to be quickly sorted to allow you to focus on your near or distant cousins. Because email addresses are provided for easy communication with your near or distant cousins you will be able to share research easily. We notify you by email when you have new matches. Your raw data file is freely available for download.”

Frequently Asked Questions Page

The Family Finder FAQ page is especially well-developed for such an early stage product.  There are currently over 75 FAQs including a wide range of questions and answers, including the following:

Question: What is the probability that my relative and I share enough DNA to be detected by Family Finder?

Answer: If you are related within five generations (3rd or more recent cousins) then Family Finder is almost sure to detect your relationship. Testing will also detect many 4th and 5th cousins and a small percentage of more distant cousins.  Chances of finding a match if the relationship is:

Relationship Match Probability
2nd cousins or closer > 99%
3rd cousin > 90%
4th cousin > 50%
5th cousin > 10%
6th cousin and more distant remote (typically less than a few percent)

Connecting with Cousins:

Unlike 23andMe’s Relative Finder, where communicating with genetic relatives in their database can be challenging (although 23andMe is launching improvements to the system that will make identifying and communicating with relatives easier), this product is intended for and marketed to genealogists.   The results are provided using the following format (picture courtesy of the ISOGG wiki Family Finder page, image has been altered for privacy reasons):

The results provide information about the identified genetic cousin, including the suggested relationship, the predicted relationship range, the shared cM (centimorgans), the longest block of shared DNA, and the ancestral surnames that the user has provided in their profile (if any).  Also provided is a link to the user’s email address to facilitate communication.

As a result, there are several privacy issues involved in the Family Finder test that test-takers should be aware of.  It is important to recognize that your name and the email address you sign up with will be made available to your genetic relatives.  For most genealogists this is a welcome development, but it is worth highlighting.  Additionally, if you share closely-matching DNA with an individual, that individual will see your name in their results and can share that information with other people.  Although ethically all test-takers should always keep these privacy issues in mind, there is nothing to prevent them from sharing the information.  Please be informed before you order this test.

Chromosome Browser:

Family Finder also provides a Chromosome Browser which test-takers can use to explore and compare the blocks of DNA that they share with genetic cousins.  Users can compare the blocks of up to 3 people, and can filter blocks from 10+ cM, 5+ cM, 3+ cM, down to 1+ cM.  Users can also view the comparison information in a table and download it to an Excel file.

Download of Results

Like 23andMe, Family Tree DNA offers customers the ability to download the results of the SNP test.  The autosomal results and X-chromosome results are offered in separate zipped files.

My Results:

I currently have 33 genetic relatives in the Family Finder database with the following break-down:

  • Only one person with a suggested relationship (my closest relative in the database), suggested at the 4th cousin stage, with a range of 3rd to 5th cousin;
  • Eight cousins at the 4th cousin to distant cousin stage; and
  • 24 cousins at the 5th cousin to distant cousin stage.

I am communicating with my matches in order to identify a shared ancestor in our respective trees.   In the one instance where we’ve identified shared ancestry, we share relatives in a minimum of twelve different lines (via the early colonial era).  I’ve also matched several relatives from an isolated geographic region where I have confirmed recent ancestors, although we have not yet identified a common ancestor.

Future Developments

Ancestral Percentages

At the current time, the Family Finder test results do not include information about possible ethnicity or biogeographical ancestry.  However, it appears that Family Tree DNA plans to offer this type of information in the future.  See, for example, “Relative Finder vs. Family Finder” at The Melungeon Historical Society blog.  There Roberta Estes writes the following:

“Family Tree DNA does not initially offer the percentages of ethnicity, but that will be added shortly. The 23andMe ethnicity percentages (European, African and Asian) are very, very conservative and I believe so conservative as to be significantly incorrect. Suffice it to say that I have been involved with the new ethnicity percentage information and presentation at Family Tree DNA, and it will blow the socks off of anything out there today.”

23andMe Results at FTDNA

What if you’ve already tested at 23andMe?  Once again, Roberta Estes writes the following (which includes information I’ve seen at several other places):

Family Tree DNA will (shortly) facilitate an upload of 23andMe raw data for a $40 and they will then compare the 180,000 (280,000 by inference) common locations between their data base participants and your 23andMe data. If you later decide to take the Family Finder FtDNA test, they will credit your $40 to that test. Only the people who ordered the full health traits and ancestry version of the 23andMe product can gain access to their raw data at 23andme. Everyone who participated in the beta can download their raw data.”

Experiences and More Information:

Family Finder Links:

Conclusions

I first had part of my genome sequenced over 7 years ago via an AncestryByDNA test.  Since then I’ve had mtDNA sequencing, Y-DNA sequencing, SNP scans, and a number of other tests performed.  Accordingly, I consider myself to be an early explorer in the field of DTC genetic testing.  I enjoy learning about my genetic ancestry, about genetic cousins, and about my own genome.  Many of the other early adopters of the Family Finder test are also pioneers.  I would recommend this test to anyone who is interested in their genetic ancestry, or anyone that is interested in learning more about their own genetic heritage.

One of the best things about the Family Finder test is that it gives the user information and then allows them to use that information as they so choose.  Although the test does reveal your name and email address to genetic relatives, it is up to you whether you reply to requests or explore those relationships.  Family Finder is yet another tool that allows personal genome explorers to learn about themselves.

Have you used FTDNA’s Family Finder test?  I’d love to hear about your experiences in the comments section.

More Soon…

Stay tuned, in the next week or so I’ll be posting more of my review of Family Tree DNA’s Family Finder, including some advanced tools for Family Finder and/or 23andMe users .

Disclosures

I received my Family Finder test without charge from Family Tree DNA for purposes of this review.  Regardless, I have attempted to review this product as honestly and as objectively as possible in order to provide valuable information about Family Finder to my readers.  I am also a consultant for Pathway Genomics.

A Mother’s Day Post

In honor of mother’s day, I’m reposting a portion of an entry from March 16, 2009 (“Visualizing Your Genetic Genealogy“).  It also follows a SNGF from Randy at Genea-Musings called “Matrilineal Line.”

In my genealogical research, I have sometimes found myself missing the trees by focusing on the forest.  I think it happens to many genealogists – we get caught up in the research, the dates, the places, and we forget that there was so much more to people than their vital statistics.

This can happen to genetic genealogists as well.  The connection between the results of a DNA test and the individuals in our tree can be easy to forget and difficult to visualize.  Take the results of an mtDNA test, for example.  The results are obtained from a tiny piece of DNA that has traveled thousands of years (and often thousands of miles) through hundreds of individuals to end up in your cheek cells and on the tip of a swab.  Everyone’s mtDNA is the product of an amazingly rich story that has largely been lost to history.

However, we as genealogists can do our part to connect the DNA to as much of the story as possible and prevent further loss.  In your own recent past, who were the people that contributed your mtDNA, your Y-DNA, or your autosomal DNA?

Visualizing My mtDNA Line

This is a compilation of the five most recent generations of my mtDNA line over the past 125 years, as shown in photographs:

Cora’s mother was Sarah L. Bodden, born January 1846.  Sarah’s mother was Julia Ann Rebecca, of which very little information is known.  What I do know, however, is that my mtDNA Hapl0group is A2, meaning that my matrilineal line is Native American.  Thus, Julia Ann Rebecca’s mother, grandmother, or more distant maternal ancestor was Native American, most likely of Central American or Caribbean origin.

Happy Mother’s Day to all my maternal ancestors.

Thank You: The Genetic Genealogist Named Among Family Tree Magazine’s 40 Best Genealogy Blogs

Family Tree Magazine 40 Best Genealogy Blogs

Late last fall, Family Tree Magazine requested nominations for the best genealogy blogs, and then opened voting for the nominated list.  Yesterday, they announced the winners of the voting.  Diane Haddad wrote about the announcement on the Genealogy Insider blog, and Maureen Taylor wrote the article that will appear in the May issue of Family Tree Magazine: “Fab Forty.”

I am very pleased and honored to announce that TGG was selected as one of the 40 Best Genealogy Blogs, in the category of genetic genealogy. I would like to thank everyone who nominated and voted for me.  I have been very fortunate over the last few years to interact with a fascinating array of readers, and I am thankful for every one of them.

When I started blogging in February 2007 (I just recently counted my third anniversary of TGG!), there were very few blogs in the genetic genealogy space.  Today there are a number of interesting and well-written genetic genealogy blogs.  See my recent round-up at “10 Great Blogs for Genetic Genealogists.“  Each of these blogs is well worth adding to your reading list.

I would also like to congratulate all the other blogs on the list, as I am truly honored to be listed among them.  I am an avid reader of the vast majority of them, and I look forward to so much more.  Here are few links to their own announcements:

And here is the full list of winners.  A huge congratulations to them, as well as to all the blogs that were nominated:

All-Around

Cemetery

Corporate

Genetic Genealogy

Heritage

How-To

Local & Regional

News & Resources

Photos & Heirlooms

Personal & Family

Columbia Professor Alondra Nelson Reviews The PBS Series “Faces of America”

Faces of America

In October 2008, I reviewed an article by Dr. Alondra Nelson in the journal Social Studies of Science entitled “Bio Science: Genetic Genealogy Testing and the Pursuit of African Ancestry” (Social Studies of Science 2008 38: 759-783).  The article was about the complex interpretation of the results of genetic genealogy testing by African-Americans and black British.  Dr. Nelson is Associate Professor of Sociology at Columbia University in NY.

On Friday, an article by Dr. Nelson appeared in The Chronicle of Higher Education entitled “Henry Louis Gates’s Extended Family,” which is an introduction and review of the current PBS documentary miniseries Faces of America. Regarding the genetic testing aspect of the show, Nelson writes:

If the findings of conventional genealogical research produce fireworks, the results of the DNA analysis generate shock and awe. “Know Thyself,” the final episode, which shares its title with the slogan of Knome Inc., focuses mostly on genetic genealogy. Whereas prior shows relied heavily on analysis of mitochondrial DNA (mtDNA) and Y-chromosome (Y-DNA), yielding results that included at most about 2 percent of one’s complete genetic inheritance, in Faces techniques are used that probe deeper into more of the genome.

The technical aspects of genetic ancestry tracing are explained, but without sufficient social context, much the way a manual can tell you how to operate a car without explaining automobiles’ role in modern industry, the development of suburbia, or the emergence of youth culture. We can’t hold a documentary for a general audience responsible for not presenting a complex metanarrative on the philosophy of genetic science. But we can expect some acknowledgment and interpretation of technology’s limits.

It is likely that some genetic genealogists will instantly disagree with or discredit Nelson after reading this article, since it might appear that she is being critical of genetic genealogy, but I would disagree.  In my opinion, however, it is important to be aware of Nelson’s concerns, since they are concerns shared by many people across the globe.  For better or for worse, Faces of America will be many individual’s first introduction to genetic genealogy, and without seeing the whole series yet, I hope that Gates does a fair job of introducing this wonderful technology without glossing over its limitations, particularly as they might apply to minority or marginalized populations.

That being said, I also believe that the individual shares the responsibility for understanding this technology before deciding to undergo testing.  We are all responsible, in part, for our own education.

Rather than discrediting genetic genealogy, I believe that Nelson embraces the ability of genetic testing to help some people – and ultimately society – understand our present and our past, as well as how we are all so closely related, either through our genetics or through our shared history.  Indeed, the end of the article ends with the note that Nelson “is at work on a book about genetic ancestry tracing and African diaspora culture,” which I look forward to reading.

What are your thoughts after reading Dr. Nelson’s article?

Faces of America continues every Wednesday evening from 8 – 9 p.m. ET on PBS stations through March 3rd.

A New Meme: How Many of Your Ancestors Are In The SSDI?

The Social Security Death Index (SSDI) is a searchable database created from the U.S. Social Security Administration’s Death Master File, which contains the name and social security number of deceased persons reported to the Social Security Administration since roughly 1962.  In addition to being used by genealogists, the Death Master File and SSDI are used by financial firms and government agencies for various reasons such as preventing identity fraud.

A Genealogy Meme Using the SSDI

Michael Neill at RootDig has two posts – “Have You Searched for All Your Ancestors in the SSDI?” and “My in-laws in the SSDI” – that list his and his wife’s ancestors in the SSDI.  Michael has 7 ancestors, while his wife has 6.

This led me to wonder how many ancestors I have in the SSDI, and a very brief search led me to conclude that I currently have a total of 8:

  1. Theodore LaBounty 1927-1983
  2. Jane (Garcia) LaBounty 1931-1984
  3. Theodore LaBounty 1903-1963
  4. Goldiah (Blanchard) LaBounty 1906-1996
  5. Roy Bettinger 1916-1975
  6. Marley (Johnson) Snell 1889-1983
  7. Victor Mullin 1901-1972
  8. Clara (Fitzgerald) Mullin 1907-1997

Eventually I will have a total of 11 ancestors in the SSDI, but my parents and a grandparent are still, thankfully, living.  My wife also has a total of 8 ancestors in the SSDI:

  1. Harlon Conger 1921-2005
  2. Lois (Finney) Conger 1891-1975
  3. John Alden 1900-1971
  4. Margaret (Wolford) Alden 1902-1991
  5. Inez Simmons 1891-1979
  6. Albert Bacon 1895-1963
  7. Guy Simmons 1921-1989
  8. Margaret (Bacon) Simmons 1929-2007

Other Questions

Out of your ancestors in the SSDI, who had the earliest date of birth?  Mine is Marley (Johnson) Snell who was born in 1889, and my wife’s is a tie between Lois (Finney) Conger and Inez Simmons, both born in 1891.

How many of these ancestors did you meet (whether you remember it or not)?  I met 5 of my 8 ancestors in the SSDI, and my wife met 4 of hers.

How many ancestors do you have in the SSDI?

Who Is The Oldest Relative You Remember Meeting?

The Evansville Courier & Press has a great article – “At 97, life is worth a big fuss: Six generations gathered at matriach’s birthday party” – which contains a picture of six generations of the Moore Family of Indiana.  The picture shows a newborn and 5 generations of her ancestors; her mother, grandmother, great-grandmother, great-great-grandfather, and great-great-great-grandmother!  It is truly amazing and I highly recommend clicking over to the article to see it.

My Mother’s Mother’s Mother’s Father’s Mother (whew!)

The picture led me to wonder who was my mother’s mother’s mother’s father’s mother (following the same lineage in the article’s picture), and whether I ever met her.  After consulting my family tree software (maybe I could have done it from memory, but I thought I’d save some time!), I discovered that her name was Jemima Cooper.  I never had the opportunity to meet Jemima because she died 53 years before my birth.  She would be 118 years old today.

Then I wondered how many of my other relatives in this generation I had met.  Unfortunately I never met any of my 32 great-great-great-grandparents since the last one died in 1940 (over 35 years before I was born).  Likewise, I never met any of my 16 great-great-grandparents, although I missed the death of the last one by just 13 years.

Of the 3 great-grandparents who were alive when I was born, I met all 3 (born in 1889, 1906, and 1907).  Marley, born in 1889, died in 1983 and one of my earliest childhood memories is of meeting her.

Who Do You Remember?

Did you know any of your great-great-great-grandparents?  Great-great-grandparents?  Who is the oldest relative you remember meeting?

Article via Thomas MacEntee.