The Implications of Genetic Data in the Public Domain

Exactly one week ago, I published my genetic data on github and placed it into the public domain. The response was overwhelmingly positive and the coverage was far greater than anticipated. Here’s a short list of surprising outcomes from that single blog post:

  • It took less than a day for someone to fork and edit the genetic data – and the pull request was hilarious.
  • This started a very popular discussion thread on YCombinator’s Hacker News Channel. Another one was started for the original post.
  • The sheer number of bioengineering/computer science nerd-jokes was staggering. I loved how much fun everyone was having with the post and the genetic data.
  • Engineers from 23andme (the service that sequenced bits of my genome) and from Illumina (the people that make the beadchip that analyzed my genome) left comments and sent me very nice and supportive e-mails. They’re really passionate about what they do – which is always great to see.
  • My tweet hit the front page of Twitter.
  • The blog post hit the front page of Slashdot shortly thereafter.
  • A large magazine in Russia did a story on me, and I found out that it’s illegal to send any of your genetic material outside of Russia to have it analyzed.
  • The founder of contacted me and performed a free analysis on my genome.

Most importantly, there were tens of thousands of people that read the blog post and learned a little more about this exciting new area of research and discovery. There are already folks that have started projects based on the availability of my genetic data. I’m positively thrilled and humbled.

In the previous post, I had mentioned that there were many privacy implications to releasing your genetic data to the world, whether it is in the public domain or not. This post will cover what some of those implications are and how I reasoned my way through them in order to come to the conclusion that it would be safe to release my genetic data into the public domain. The rest of this post will be broken into two sections – short-term and long-term concerns. Each section will contain questions and brief reasoning on the implications.

Short-Term Implications

The short term privacy concerns revolve around things that may happen in the next week, next year, or during the next 20 years. I am 32 years old now, so these are the things I was concerned about happening before the year 2031.

As you read the rest of this post, keep in mind that I tend to be fairly optimistic about how people treat each other based on prior knowledge about health and ancestry. Yes, there have been horrible exceptions (ethnic cleansing, racism, etc.) throughout history, but the people of the world eventually work these atrocities out. Humankind is self-serving and self-preserving, no doubt – but these inherent traits are not incompatible with treating each other with dignity. I think global society, for a large part, works because people are inherently good – perhaps one day we’ll find the genetic markers that elaborate on why.

In many ways I choose to publish my data because I believe that it will help others. Of the dangers below, I tend to perceive them as small enough, and the potential benefits large enough, such that releasing my genetic data into the public domain will be a net-positive act.

What if someone would use this data against you?

My genetic data is fairly average. It shows no terrible health risks. I don’t think it would necessarily hurt my chances of getting a job or being denied for health insurance any more than any other average American pursuing a job or health insurance with the same genetic profile. One of the questions I found is whether or not your genetics could be used against you when applying for a job or health insurance. Due to the Genetic Information Non-Discrimination Act of 2008 (GINA), it is highly unlikely that this would happen (in a public or documented manner) as it is now a punishable federal offense to discriminate based on genetic information.

That does not mean that an employer won’t Google your genetic information and find something that they don’t like and not hire you because of it. There are many illegal reasons an employer may not hire you, and if they’re smart, they’ll never tell you the real reasons. For example, they may not hire you due to your race, age, or accent. Speaking as the owner of a company, in almost every case, these reasons would be bad decisions on the part of the company.

Typically when you try to hire someone, you want the person that is best qualified. Rarely do their genetics and ancestry come into play. If they do, more likely than not, you’re looking for a reason to not hire them. Their behavior and how they get along with others plays far more heavily into the decision. At the moment, we don’t have clear genetic markers that give us strong clues as to the nature of human behavior. If someone is looking for an excuse not to hire you, any irrational reason will do – including your genetic information. The only words that you will ever hear are “We’re sorry, but you didn’t seem to be a good fit for the position.”

What may come into play more often are genetic differences in height and weight. For example, I’m a fairly small person. If I were to apply for a job as a stone mason against someone else that is a large framed, muscular person with the exact same qualifications, my genetic information could be used to determine that I was not the best candidate for the position. However, even in this case – a quick face-to-face meeting could determine that information. Even though my genetics are being used against me in the interview, it would never surface in the conversation. The equally-skilled, large framed, muscular person could probably get more done in a day than I could. They have that genetic advantage and since a good business owner wants to make the most reasonable decision, picking the larger person over me will give their company an advantage. This sort of “genetic discrimination” may happen, but it’s not something that I’m necessarily interested in protecting myself from because the decision is being made for completely rational reasons.

Now, let’s assume that I have markers for some sort of debilitating disease that is not known now, but the genetic markers are found for it in 10 years. Since my genetic information is already out there – people will eventually know that I have this condition. It may cause me to not be able to perform certain types of jobs, but even in that case – knowing this information could help both me and an employer deal with the condition if it were to arise. Keep in mind that many of these markers just signal an “increased chance”, they’re not a certainty. Genetic markers cannot predict exactly when you’re going to have a stroke or die of cancer because a large part of those diseases are environmental – some more so than others.

So, while the information could be used against me by an employer or a healthcare provider – it would have to be done in such a way where I would never know that it was used against me. There are many other things, other than genetic information, that may fall into the same category. Couple this with the fact that using this information to discriminate is illegal and we find ourselves in a world where, like race and ethnicity, this sort of information will increasingly not be used to discriminate.

What about the privacy of your relatives?

By divulging my genetic data, I am inevitably divulging short sequences of my family’s genome. Using my data, one could find out who my family members are (if they had access to their genetic information) and potentially, whether or not they’re susceptible to the same diseases and health risks that I am. However, keep in mind that if someone wanted to find out my genetic data or one of my family member’s genetic data, all that they would have to do would be to follow them around. We shed genetic information in droves everyday. We do this on the cups we drink from, by blowing our nose, using a public rest room, brushing our hair and eating. We are a fountain of DNA – gushing our genetic information onto every surface that we touch.

I view how this information could be used against my family members in the same way that I view how this information could be used against me. The previous question delved into how the information could realistically be used against them. I don’t think the risk is high for the reasons stated previously.

As for finding out if I’m related to someone else – public records are a better place to look for that sort of information. If someone were to want to find a terrible genetic secret in a family member’s DNA, there are plenty of other places that they could get a sample from to analyze.

Often the shortest path to this information is the most efficient, and computationally analyzing genetic information is very far from the shortest path to the information.

What if someone tried to kill you using your genetic information

As ridiculous as it sounds, I was somewhat irrationally worried about this. I do have a few allergies and one fairly bad one that could cause me to go into anaphylactic shock in the worst case. I was told this by a doctor 15 years ago. I don’t even know if I would still react in the same way – so don’t go trying to kill me using the allergy, it may not work! The marker for this is not in my genetic information yet, but I assume that it is only a matter of time before it is found. However, assume that it is found – if someone wants to get rid of me that badly, I would assume that they’d take a more direct route than trying to use allergies as my downfall.

The people that know me well know that I avoid this particular allergy. If I died from it, there would be a very strong suspicion that something was amiss.

In reality, if someone wants you dead that badly – they’ll find a way or get caught trying. If they’re smart, you’ll never see it coming. A staged suicide may be the way to go. There are also many medications that can cause heart attacks and other life-threatening events that may be untraceable. There are many ways that someone that wanted you gone could get rid of you. Worrying about how your genetic data will play into this will not prevent them from finding an easier way. Assuming that there is nothing in your genetics to help them kill you, unless they’re Jigsaw, they will almost always go for the easiest path.

In the end, not releasing your genetic data to the world because you’re worrying about how someone will kill you with that genetic data is wandering into tin-foil hat territory. It is my opinion that there is far more good that can be done by releasing your genetic data than by not releasing it.

Are you worried about a major company suing you because of some sort of patented gene that you may have?

That’s not really how the genetic patents work. Typically, the RNA process is the thing that is patented, not the SNPs. The patented process usually reprograms DNA in some way, whereas the data I’m publishing are just SNP markers – they don’t do anything and are thus not patentable. The likelihood of a major pharmaceutical company suing me because of the way I was born is abysmally low. The ramifications of the courts allowing such a case to proceed are far reaching – it would place a restriction on freedom to reproduce that the general populace would not allow to exist.

It would be a public relations nightmare for the company that was suing – imagine the headlines: “Pharmacom sues people for being born.” Being afraid of what the most slithery of lawyers and companies will do to you is no way to live. I believe that the likelihood of coming under legal pressure for effectively publishing facts about your genetic makeup is abysmally low. Especially since the benefits of publishing such data is already clear, as outlined in the beginning of this post.

Aren’t you worried about friends or strangers finding out this information about you and potentially judging you because of your genetics?

In short, no. I believe that we are far more than just the genetics that make up our body. There is a great TED presentation by Sebastian Seung about the human Connectome that suggests that the way we wire and re-wire our brains really captures who we are. Our genetics just boot up the hardware – our bodies. It’s the software – our minds – that we use to interact with our friends and strangers. It is largely our mind that determines how strangers and our friends interact with us.

My Connectome is something that I would never, ever share with anyone as it could be used to predict my behavior and that is far more dangerous than being able to know my genetic data. A model that is predictive of what I may do from day to day is an incredibly dangerous thing.

If you think about it, this is what websites and search companies are after when they track our movements online. They want to know more about our Connectome than our genetics. They want to know what we’re thinking, what we want, our fears, what motivates us. I don’t think that one’s genetic data is predictive enough of the thing that truly matters – your Connectome.

I’m not divulging information about my mind, which is very private and not really useful to many people. I’m publishing information about my body, which I don’t view as private information and could be of use to people building tools and software to help other people learn more about themselves.

Long-Term Implications

I define long-term as things that will happen more than 20 years from now to well past when I am dead and buried. I don’t spend much time thinking about the long-term implications not because they’re not important or interesting, but more because it’s impossible for me to predict what may happen with this data more than 5 years out. It’s all just wild speculation. Our global society is making technological breakthroughs at such an increasing pace that it’s difficult for me to grasp what the next 10 years will bring in this area.

Would you ever publish your entire genome?

The 23andme genetic data only contains about 1 million SNPs – about 25MB of data. My entire genome is around 350,000MB of data. Sequencing an entire human genome costs around $20,000 USD today, there are a few people that have done it, but that price is currently far outside of my reach.

However, it’s not always going to be and I think that in the next 20 years, it will be possible to get my entire genome sequenced for less than $250. I haven’t decided whether I would want to publish that data, but I expect that I probably will do so in the future. The reasons will be for the same reasons I published my limited genome last week – to advance science and help people write better tools and software to work with the data.

What if people would use your information to clone you?

The data I released isn’t even close to the required amount necessary to make a clone of yours truly. My answer to this question is heavily influenced by the “Connectome” response above. Your body is not your mind, and I believe that it is your mind that is the thing that is unique to each of us, not necessarily our bodies.

The question really comes down to the intended purpose of the clone. Would I support a clone of me that specifically did not have a brain, but was used to produce biologically compatible organs for life extension purposes? If it was ethically and morally sound, yes. I would prefer it if only the organ necessary for transplant were grown – but the question is more interesting if we don’t have that choice.

If I were to start dying before I wanted to, and an ethically-conscious clone could save my life with multiple organ transplants, I would choose to use the clone to extend my life. If my brain-dead clone could help someone else survive, I would support its use for that purpose. That is, assuming that I have any say over what my clone could be used for. I would expect that I wouldn’t, even in the case where it is brain dead. Would I support the creation of a brain-dead clone of myself and then have my brain transplanted into the clone’s body? Absolutely. I’d love to choose when I died.

To put it another way, if someone wanted to clone my Connectome, I would have strong concerns with publishing that information because the mind is the most private thing you own. You have expended great effort in building it. However, if someone wants to create a clone of me, go right ahead – I had nothing to do with the creation of my DNA sequence. To assert ownership over it is asserting rights that I do not have.

Are you worried about how publishing your genetics will affect your children and descendants?

I am concerned, but I am not worried. There is always the unknown. What happens if there is another Hitler, and it just so happens to be in the country where my children and descendants live? If Hitler had a way of testing genetic information, he would have used it to select those that would be forced into the gas chambers. However, if that happens – my genetic data being public domain would not save any of my descendants from that fate. 23andme already exists and that technology cannot be undone.

I think the day is approaching where we will have a great deal of control over what characteristics our children will have. We may even get the choice to use the best of genomes from around the world – where our children may have some of our DNA, but will also have thousands of other people’s DNA. They will be the best of the best. If you could afford to ensure that your children would have a good body to use throughout their life, wouldn’t you make the decision to do so? Keep in mind that when this technology becomes available, and it is cheap enough, it becomes a viable choice for almost everyone.

When that day comes, wouldn’t you choose the best for your children? Wouldn’t doing anything else be considered negligence on the part of the parent?

Many thanks to Dave Longley for his insight, suggestions and numerous corrections to this article.

Trackbacks for this post

  1. Personal genome in the public domain | Gene Expression | Discover Magazine
  2. Personal genome in the public domain | Gene Expression
  3. Personal genome in the public domain | Biology News by Biologged
  4. Rum and Reason » Personal genome in the public domain | Gene Expression
  5. facebook

Leave a Comment

Let us know your thoughts on this post but remember to play nicely folks!