The Clinical Times
The Front Page of Medicine

Genetics & Molecular · 2003

Human Genome Project Completion

International Human Genome Sequencing Consortium

Logo of the Human Genome Project
U.S. Department of Energy, Human Genome Project / Public domain (Wikimedia Commons)

When James Watson and Francis Crick described the double helix in April 1953, the idea of reading the entire sequence of three billion base pairs was a thought experiment. Fifty years later, almost to the month, it had been done. On 14 April 2003 the International Human Genome Sequencing Consortium announced that the human genome was essentially complete, two years ahead of the schedule set when the project formally launched in 1990.

The publicly funded consortium coordinated sequencing centers in six countries: the United States, the United Kingdom, France, Germany, Japan, and China. Francis Collins directed the effort from the National Human Genome Research Institute in Bethesda, while John Sulston led the Wellcome Sanger Institute's contribution in Hinxton, which alone produced roughly a third of the finished sequence. A parallel and competing effort by Craig Venter's private company Celera Genomics had published a draft sequence in 2001 using a shotgun approach that the public consortium disputed in technical terms, but both publications accelerated the timeline. The finished sequence published in Nature in 2004 covered 99% of the euchromatic genome at an accuracy of 99.99%, resolving roughly 3.2 billion base pairs.

The decision to release data publicly and immediately, formalized in the Bermuda Principles agreed upon in 1996, distinguished the consortium's approach from Celera's and set a precedent for genomic data sharing. Any researcher anywhere could download the sequence from the day of publication, without licensing fees. That open-access foundation allowed the project's results to propagate rapidly through basic science and clinical research communities worldwide.

The immediate downstream effect was the feasibility of genome-wide association studies. By the late 2000s, researchers had used the reference genome to identify hundreds of loci associated with common diseases, including type 2 diabetes, coronary artery disease, inflammatory bowel disease, and multiple cancers. The translation from statistical association to mechanistic understanding and therapeutic target proved considerably slower than early statements from project leaders suggested, but the catalog of variants itself became a foundational tool in human genetics.

In clinical medicine, the completed genome enabled the development of targeted sequencing panels for inherited cancer risk, pharmacogenomic testing for drug metabolism variants, and eventually the whole-genome sequencing pipelines used in neonatal intensive care and rare disease diagnosis. The Human Genome Project also financed and trained a generation of bioinformaticians and genome scientists whose methods underpin the precision medicine programs active in major academic medical centers today.

Key People

Read the original — PubMed

Nature. 2004;431(7011):931-945.

Related landmarks

← All Landmark Moments