Email updates

Keep up to date with the latest news and content from Human Genomics and BioMed Central.

Open Access Open Badges Genome update

Update on the olfactory receptor (OR) gene superfamily

Tsviya Olender1, Doron Lancet1 and Daniel W Nebert2*

Author Affiliations

1 The Crown Human Genome Center, Department of Molecular Genetics, The Weizmann Institute of Science, Rehovot 76100, Israel

2 Department of Environmental Health and Center for Environmental Genetics (CEG), University Cincinnati Medical Center, Cincinnati, OH 45267-0056, USA

For all author emails, please log on.

Human Genomics 2008, 3:87-97  doi:10.1186/1479-7364-3-1-87

The electronic version of this article is the complete one and can be found online at:

Received:5 May 2008
Accepted:5 May 2008
Published:1 September 2008

© 2008 Henry Stewart Publications


The olfactory receptor gene (OR) superfamily is the largest in the human genome. The superfamily contains 390 putatively functional genes and 465 pseudogenes arranged into 18 gene families and 300 subfamilies. Even members within the same subfamily are often located on different chromosomes. OR genes are located on all autosomes except chromosome 20, plus the X chromosome but not the Y chromosome. The gene:pseudogene ratio is lowest in human, higher in chimpanzee and highest in rat and mouse -- most likely reflecting the greater need of olfaction for survival in the rodent than in the human. The OR genes undergo allelic exclusion, each sensory neurone expressing usually only one odourant receptor allele; the mechanism by which this phenomenon is regulated is not yet understood. The nomenclature system (based on evolutionary divergence of genes into families and subfamilies of the OR gene superfamily) has been designed similarly to that originally used for the CYP gene superfamily.

classification of gene families and subfamilies; OR gene superfamily; CYP gene superfamily; nasal olfactory neurone; olfaction; olfactory receptor gene superfamily; allelic exclusion; opossum genome; platypus genome


Before 1980, the names of genes and classification of their encoded proteins were highly variable and non-systematic -- especially to anyone slightly outside a particular field or to a new graduate student entering the field. Professor Margaret Oakley Dayhoff was a pioneer in attempting to create order out of chaos in the naming of genes and gene families by means of computerised protein alignments [1]. She was widely recognised as the founder in this new field of gene/protein classification, before her untimely death in 1983.

Cytochrome P450 (CYP) genes are conveniently arranged into families and subfamilies based on the percentage amino acid sequence identity [2-7]. Enzymes that share approximately ≥ 40 per cent identity are assigned to a particular family designated by an Arabic numeral, whereas those sharing approximately ≥ 55 per cent identity are grouped into a particular subfamily designated by a letter. For example, the sterol 27-hydroxylase enzyme and the 25-hydroxy-vitamin D3 1α-hydroxylase enzyme are both assigned to the CYP27 family because they share > 40 per cent sequence identity. Furthermore, the sterol 27-hydroxylase is assigned to the CYP27 'A' subfamily and the 25-hydroxy-vitamin D3 1 α-hydroxylase to the CYP27 'B' subfamily because their protein sequences are < 55 per cent identical. If an additional enzyme were to be discovered that shared > 55 per cent identity with the sterol 27-hydroxylase, then it would be named CYP27A2. If an additional enzyme were to be discovered that shared < 55 per cent but > 40 per cent identity with the sterol 27-hydroxylase as well as the 25-hydroxy-vitamin D3 1α-hydroxylase, then it would be named CYP27C1. The development and application of this delightfully logical system of nomenclature to the genes of many animals, plants and bacteria [8] has eliminated the confusion that often had plagued the naming of gene families and superfamilies. Subsequently, this 'divergent evolution' nomenclature system was adopted for several hundred other gene families and superfamilies -- including the olfactory receptor superfamily.

Background and history

Vertebrate olfactory receptor (OR) genes represent a category of G-protein-coupled receptors (GPCRs) that contain seven transmembrane α-helical domains and function in the reception of innumerable odour molecules in the environment [9]. The OR gene superfamily is the largest in vertebrate genomes [10-13]. The genomic architecture of mammalian OR gene clusters shows an ancient evolutionary origin, preceding the marsupial-eutherian split; species-specific evolution has further shaped the different OR gene families, by means of both gains and losses of complete clusters, as well as expansion and contraction of existing clusters [11].

This dynamic flexibility is also reflected among individual humans; examining 51 candidate OR genes on DNA chips in 189 ethnically diverse subjects, a striking amount of population diversity was found [14]. Segregating pseudogenes (SPGs) are genes that segregate in populations between intact genes and pseudogenes -- due to a disruptive single nucleotide polymorphism (SNP). A range of 16-24 functional OR genes was found, just in this study alone, indicating that the OR gene superfamily is among the most pronounced examples of functional population diversity in the human genome [14]. Copy number variations (CNVs), another type of polymorphism, are also highly prevalent among human OR genes [15,16]. All these genomic events are evidence of a relatively recent process, whereby the extreme diminution of a functional repertoire in humans has occurred -- a process which is presumably still ongoing.

For most mammalian species, the ability to detect millions of different odourants is critical to their survival. Based on recent OR gene mining data in the platypus, opossum, cow and dog genomes -- compared with that in the rat, mouse, macaque and human genomes [13] -- we are now certain that there has been a substantial expansion of the OR gene superfamily since the mammalian radiation ~ 100 million years ago.

The evolutionary change in the number of OR genes in insects is not nearly as extensive as that in mammals. Drosophila melanogaster has a relatively small receptor repertoire of 62 odourant receptors [17]. A comparison of 12 Drosophila species, encompassing ~ 60 million years of divergence, shows that the number of functional OR genes has remained fairly stable [18]. Caenorhabditis elegans has a highly developed chemosensory system, which enables it to detect a wide variety of volatile (olfactory) and water-soluble (gustatory) cues associated with food, danger or other animals; between 500 and 1,000 different GPCRs are expressed in chemosensory neurones, and these may be supplemented by alternative sensory pathways as well [19]. The vertebrate OR gene repertoire has thus evolved from a subset of ancestral genes in the fly and worm.

There appear to be three important periods in the evolution of the vertebrate olfactory system, as evidenced by comparative genomics: (1) the adaptation to land in amphibian ancestors; (2) the decline of olfaction in primates; and (3) the delineation of putative pheromone receptors concurrent with rodent speciation [20]. The gene: pseudogene ratio is lowest in human, higher in chimpanzee and highest in rat and mouse. This most likely reflects the necessity of olfaction for survival -- more so in the rodent than in the human.

Whereas the chicken, platypus and primate genomes carry <400 functional OR genes, the opossum and rodent genomes, not surprisingly, contain between 1,000 and 1,210 functional OR genes [11,13]. Curiously, however, it is difficult to explain why the cow genome, with 970 functional OR genes, shows more than the dog genome, with ~811 functional OR genes, when dogs are considered to have such a keen sense of olfaction [13]. Thus, the number of OR genes in a species does not appear to be directly related to the environmental 'requirement' or to lifestyle.

Current bioinformatics about the OR gene superfamily

The OR gene superfamily comprises 18 gene families and 300 subfamilies (Table 1). Presently, there are 390 putatively functional (protein-coding) OR genes and 465 OR pseudogenes located in multiple clusters of varying sizes scattered throughout all autosomes except chromosome (Chr) 20, and on the X but not the Y chromosome [21-23]. The members of each subfamily have been placed therein because of divergent evolution, as described above. These subfamilies differ from CYP subfamilies, in that individual members within one subfamily are often located on two or more different chromosomes. The OR2T (Table 2) subfamily contains 16 functional genes -- more than in any other subfamily. Evolutionary divergence of each of the 18 gene families is illustrated in Figure 1.

thumbnailFigure 1. A phylogenetic analysis of one representative from each family of the human OR gene repertoire. In this tree, one can see the following: (1) a general guideline for how the different families relate to one another (although this is very general, and the branching is not always this well defined); (2) the numbers near each branch denote the OR family number; (3) each pie chart size is scaled to represent the number of the OR genes inside that family (black = functional genes, grey = pseudogenes, yellow = segregating pseudogenes [SPGs]). SPGs are genes that segregate in populations between intact genes and pseudogenes -- due to a disruptive SNP [34]. This disruptive mutation can introduce a stop codon, or alter a highly conserved amino acid that is important for proper function of the protein. In Tables 1-5, the SPGs are counted as 'functional genes' or 'pseudogenes', according to the Human Genome Project public version. Additional information can be found at the HORDE database ( webcite).

Table 1. Summary of the olfactory receptor (OR) gene superfamily (18 families)

Table 2. Summary of OR genes in families 1 to 4

Note that, in many instances, some subfamilies contain only a single gene or only a single pseudo-gene (Tables 2-5). In fact, the OR7E subfamily has only one functional gene, and all the other 85 members are pseudogenes (Table 3). The OR7E subfamily is the largest subfamily in the human OR gene repertoire, and probably has expanded in the human genome through a series of segmental gene duplication events [24]. The newly described human OR14 gene family (Table 4) was realised after analysis of the platypus and opossum OR gene repertoires. This analysis revealed that six human OR functional genes and one OR pseudogene (which previously had been classified within the OR5 family) are actually derived from a distinct platypus OR gene family [11,25]. The evolutionary divergence of the OR14 gene family is shown in Figure 2.

Table 3. Summary of OR genes in families 5 to 8

Table 4. Summary of OR genes in families 9 to 13

The 'shotgun' splattering of OR genes throughout the human genome must have happened before speciation of Homo sapiens and the development of its 22 autosomes plus the X and Y chromosomes; this can be inferred from the high conservation of the OR genes' genomic organisation among marsupial and eutherian mammals,[11] and the phylogenetic analysis of the platypus OR gene repertoire--by comparison with that in mammals [13,25]. In contrast to this OR gene arrangement would be the establishment of the CYP gene subfamilies, which arose as syntenic clusters of members within a single chromosomal segment. This finding suggests that gene duplication events within CYP subfamilies occurred after mammalian speciation and development of the autosomes and sex chromosomes.

The two largest OR gene clusters are located on Chr 11, with 38 functional genes (51 per cent of total) on 11q (Cluster 11@5.0) and 44 functional genes (45 per cent) on 11p (Cluster 11@55.6). These genes are predominantly in OR families 51, 52, 55 and 56 (Table 5). Immersed within these two clusters are dozens of other non-OR-related genes. This intrusion of other non-OR-related genes can also be seen in all other OR gene clusters throughout the genome.

Table 5. Summary of OR genes in families 51, 52, 55 and 56

Future directions: Additional subsets of sensory reception genes and identification of ligands

A recently appreciated discovery in olfaction is the unique specialisation of sensory neurones, such that each individual sensory neurone is stochastically chosen to express usually only one odourant receptor allele. This mechanism of 'allelic exclusion', by which mutually exclusive expression of odourant receptor genes is regulated, remains unclear at present [20,26,27].

The vomeronasal-1 receptor genes (VN1R) also encode GPCRs and, while they encode odourant receptors, they are evolutionarily distinct [20] from the very large OR gene superfamily. There are five VN1R genes and nine VN1R pseudogenes. The VN1R1, VN1R2 and VN1R4 genes and VN1R6P pseudogene are located at Chr 19q13.42; the VN1R10P, VN1R11P, VN1R12P, VN1R13P and VN1R14P pseudogenes are located on Chr 6p21; VN1R7P and VN1R8P are on Chr 21p11.2;VN1R3 is alone on Chr 16p11.2; VN1R5 is alone on Chr 1q44; VN1R9P is alone on Chr 22 [28].

thumbnailFigure 2. A phylogenetic analysis of platypus, opossum and human OR genes for the new family 14 only. Opossum = black for intact genes, grey for pseudogenes. Platypus = red for intact genes, pink for pseudogenes. Human = blue. ORI4 is an expansion of three ancestral OR gene subfamilies (A, B and C); the expansion, in both platypus and opossum, took place after speciation, whereas only one branch shows an orthologous relationship between platypus and human (marked with *). The tree was generated with MEGA4, using the nearest-neighbour-joining algorithm, and distances with the Poisson correction model. Bootstrap units are also indicated [34]. This tree is grounded with the OR5IEI gene. Only genes with no more than two frame disruptions were considered in the analysis.

At the present time, information about the ligands for mammalian OR genes is very limited. The smell of lemons (limonene), the perception of a floral or woody smell (acetophenone)[29] and the ability to smell isovaleric acid [30] have been mapped in the mouse to two specific genomic loci on Chr 4 (Ivat1) and Chr 6 (Iva2). In humans, isovaleric acid was found to be highly associated with the OR11H7P segregating pseudogene, which is not syntenic with either Ivat1 or Iva2 [31]. Another recent study found that human OR7D4 is selectively activated in vitro by androstenone; interestingly, this study found that two non-synonymous SNPs account for a significant proportion of the variance in smell perception of androstenone [32].

Members of the gustatory receptor (Gr) gene family in Drosophila are expressed in chemosensory neurones and are known to mediate the perception of sugars, bitter substrates, carbon dioxide and pheromones. Intriguingly, some of these Gr genes have now been shown to be expressed in abdominal multi-dendritic neurones, hygroreceptive neurones of the arista, peripheral proprioceptive neurones in the legs, neurones in the larval and adult brain, and oenocytes [33]. Along these same lines, we and others have observed several OR genes being significantly up- or downregulated in the liver or kidney of knockout mouse lines -- that is, in tissues not normally known to be involved in olfaction. It is therefore tempting to speculate that the receptors encoded by OR genes, as well as by Gr genes, might participate in the roles of detecting endogenous ligands.


We thank our colleagues for valuable discussions and a critical reading of this manuscript. The writing of this article was funded, in part, by NIH grant P30 ES06096 (D.W.N.).


  1. Dayhoff MO, Barker WC, Hunt LT: Establishing homologies in protein sequences.

    Methods Enzymol 1983, 91:524-545. PubMed Abstract OpenURL

  2. Nebert DW, Adesnik M, Coon MJ, et al.: The P450 gene superfamily. Recommended nomenclature.

    DNA 1987, 6:1-11. PubMed Abstract | Publisher Full Text OpenURL

  3. Nebert DW, Gonzalez FJ: P450 genes: Structure, evolution, and regulation.

    Annu Rev Biochem 1987, 56:945-993. PubMed Abstract | Publisher Full Text OpenURL

  4. Nebert DW, Nelson DR, Adesnik M, et al.: The P450 superfamily: Updated listing of all genes and recommended nomenclature for the chromosomal loci.

    DNA 1989, 8:1-13. PubMed Abstract | Publisher Full Text OpenURL

  5. Nebert DW, Nelson DR, Coon MJ, et al.: The P450 super-family: Update on new sequences, gene mapping, and recommended nomenclature.

    DNA Cell Biol 1991, 10:1-14. PubMed Abstract | Publisher Full Text OpenURL

  6. Nelson DR, Kamataki T, Waxman DJ, et al.: The P450 superfamily: Update on new sequences, gene mapping, accession numbers, early trivial names of enzymes, and nomenclature.

    DNA Cell Biol 1993, 12:1-51. PubMed Abstract | Publisher Full Text OpenURL

  7. Nelson DR, Koymans L, Kamataki T, et al.: P450 superfamily: Update on new sequences, gene mapping, accession numbers and nomenclature.

    Pharmacogenetics 1996, 6:1-42. PubMed Abstract | Publisher Full Text OpenURL

  8. [] webcite

  9. Buck L, Axel R: A novel multigene family may encode odorant receptors: A molecular basis for odor recognition.

    Cell 1991, 65:175-187. PubMed Abstract | Publisher Full Text OpenURL

  10. Gaillard I, Rouquier S, Giorgi D: Olfactory receptors.

    Cell Mol Life Sci 2004, 61:456-469. PubMed Abstract | Publisher Full Text OpenURL

  11. Aloni R, Olender T, Lancet D: Ancient genomic architecture for mammalian olfactory receptor clusters.

    Genome Biol 2006, 7:R88. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  12. Henion TR, Schwarting GA: Patterning the developing and regenerating olfactory system.

    J Cell Physiol 2007, 210:290-297. PubMed Abstract | Publisher Full Text OpenURL

  13. Niimura Y, Nei M: Extensive gains and losses of olfactory receptor genes in mammalian evolution.

    PLoS ONE 2007, 2:e708. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  14. Menashe I, Man O, Lancet D, Gilad Y: Different noses for different people.

    Nat Genet 2003, 34:143-144. PubMed Abstract | Publisher Full Text OpenURL

  15. Nguyen DQ, Webber C, Ponting CP: Bias of selection on human copy-number variants.

    PLoS Genet 2006, 2:e20. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  16. Hasin Y, Olender T, Khen M, et al.: High resolution copy-number variation map reflects human olfactory receptor diversity and evolution.

    PLoS Genet in press 2008. OpenURL

  17. Smith DP: Odor and pheromone detection in Drosophila melano-gaster.

    Pflüger's Arch 2007, 454:749-758. PubMed Abstract OpenURL

  18. Nozawa M, Nei M: Evolutionary dynamics of olfactory receptor genes in Drosophila species.

    Proc Natl Acad Sci USA 2007, 104:7122-7127. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  19. Bargmann CI: Chemosensation in C. elegans.

    Wormbook, October 25 2006, 1-29. OpenURL

  20. Kambere MB, Lane RP: Co-regulation of a large and rapidly evolving repertoire of odorant receptor genes.

    BMC Neurosci 2007, 8(Suppl 3):S2. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  21. [] webcite

  22. Safran M, Chalifa-Caspi V, Shmueli O, et al.: Human gene-centric databases at the Weizmann Institute of Science:GeneCards, UDB, CroW 21 and HORDE.

    Nucleic Acids Res 2003, 31:142-146. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  23. Olender T, Feldmesser E, Atarot T, et al.: The olfactory receptor universe -- From whole genome analysis to structure and evolution.

    Genet Mol Res 2004, 3:545-553. PubMed Abstract | Publisher Full Text OpenURL

  24. Yue Y, Haaf T: 7E olfactory receptor gene clusters and evolutionary chromosome rearrangements.

    Cytogenet Genome Res 2006, 112:6-10. PubMed Abstract | Publisher Full Text OpenURL

  25. Warren WC, Hillier LW, Marshall-Graves JA, et al.: Genome analysis of the platypus reveals unique signatures of evolution.

    Nature 2008, 453:175-183. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  26. Serizawa S, Miyamichi K, Nakatani H, et al.: Negative feedback regulation ensures the one receptor-one olfactory neuron rule in mouse.

    Science 2003, 302:2088-2094. PubMed Abstract | Publisher Full Text OpenURL

  27. Rodriguez I: Odorant and pheromone receptor gene regulation in vertebrates.

    Curr Opin Genet Dev 2007, 17:465-470. PubMed Abstract | Publisher Full Text OpenURL

  28. [] webcite

  29. Malnic B, Godfrey PA, Buck LB: The human olfactory receptor gene family.

    Proc Natl Acad Sci USA 2004, 101:2584-2589. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  30. Griff IC, Reed RR: The genetic basis for specific anosmia to isovaleric acid in the mouse.

    Cell 1995, 83:407-414. PubMed Abstract | Publisher Full Text OpenURL

  31. Menashe I, Abaffy T, Hasin Y, et al.: Genetic elucidation of human hyperosmia to isovaleric acid.

    PLoS Biol 2007, 5:e284. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  32. Keller A, Zhuang H, Chi Q, et al.: Genetic variation in a human odorant receptor alters odour perception.

    Nature 2007, 449:468-472. PubMed Abstract | Publisher Full Text OpenURL

  33. Thorne N, Amrein H: Atypical expression of Drosophila gustatory receptor genes in sensory and central neurons.

    J Comp Neurol 2008, 506:548-568. PubMed Abstract | Publisher Full Text OpenURL

  34. Tamura K, Dudley J, Nei M, Kumar S: MEGA4:Molecular evolutionary genetics analysis (MEGA) software version 4.0.

    Mol Biol Evol 2007, 24:1596-1599. PubMed Abstract | Publisher Full Text OpenURL