KEY CONCEPTS:
- Facts suggesting that exons were the building blocks of
evolution and the first genes were interrupted are:
- Gene structure is conserved between genes in very distant species.
- Many exons can be equated with coding for protein sequences that have particular functions.
- Related exons are found in different genes.
If current proteins evolved by combining ancestral proteins
that were originally separate, the accretion of units is likely to have occurred
sequentially over some period of time, with one exon added at a time (for review
see Blake, 1985). Can the different functions from which
these genes were pieced together be seen in their present structures? In other
words, can we equate particular functions of current proteins with individual
exons ?
In some cases, there is a clear relationship between the
structures of the gene and protein. The example par excellence is
provided by the immunoglobulin proteins, which are coded by genes in which every
exon corresponds exactly with a known functional domain of the protein. Figure 2.22 compares the structure of an immunoglobulin with
its gene.
An immunoglobulin is a tetramer of two light chains and two
heavy chains, which aggregate to generate a protein with several distinct
domains. Light chains and heavy chains differ in structure, and there are
several types of heavy chain. Each type of chain is expressed from a gene that
has a series of exons corresponding with the structural domains of the
protein.
In many instances, some of the exons of a gene can be
identified with particular functions. In secretory proteins, the first exon,
coding for the N-terminal region of the polypeptide, often specifies the signal
sequence involved in membrane secretion. An example is insulin.
The view that exons are the functional building blocks of
genes is supported by cases in which two genes may have some exons that are
related to one another, while other exons are found only in one of the genes. Figure 2.23 summarizes the relationship between the receptor
for human LDL (plasma low density lipoprotein) and other proteins. In the center
of the LDL receptor gene is a series of exons related to the exons of the gene
for the precursor for EGF (epidermal growth factor). In the N-terminal part of
the protein, a series of exons codes for a sequence related to the blood protein
complement factor C9. So the LDL receptor gene was created by assembling
modules for its various functions. These modules are also used in
different combinations in other proteins.
Exons tend to be fairly small (see Figure
2.12), around the size of the smallest polypeptide that can assume a stable
folded structure, ~20-40 residues. Perhaps proteins were originally assembled
from rather small modules. Each module need not necessarily correspond to a
current function; several modules could have combined to generate a function.
The number of exons in a gene tends to increase with the length of its protein,
which is consistent with the view that proteins acquire multiple functions by
successively adding appropriate modules.
This idea might explain another feature of protein
structure: it seems that the sites represented at exon-intron boundaries often
are located at the surface of a protein. As modules are added to a protein, the
connections, at least of the most recently added modules, could tend to lie at
the surface.
No comments:
Post a Comment