Gene Translation: RNA -> Protein

Index to this page

The problem: How does a particular sequence of nucleotides specify a particular sequence of amino acids?

The answer: by means of transfer RNA molecules, each specific for one amino acid and for a particular triplet of nucleotides in messenger RNA (mRNA) called a codon. The family of tRNA molecules enables the codons in a mRNA molecule to be translated into the sequence of amino acids in the protein.

This image shows the structure of alanine transfer RNA (tRNAala) from yeast. It consists of a single strand of 77 ribonucleotides. The chain is folded on itself, and many of the bases pair with each other forming four helical regions. Loops are formed in the unpaired regions of the chain. (The bases circled in blue have been chemically-modified following synthesis of the molecule.)

At least one kind of tRNA is present for each of the 20 amino acids used in protein synthesis. (Some amino acids employ the services of two or three different tRNAs, so most cells contain as many as 32 different kinds of tRNA.) The amino acid is attached to the appropriate tRNA by an activating enzyme (one of 20 aminoacyl-tRNA synthetases) specific for that amino acid as well as for the tRNA assigned to it.

Each kind of tRNA has a sequence of 3 unpaired nucleotides — the anticodon — which can bind, following the rules of base pairing, to the complementary triplet of nucleotides — the codon — in a messenger RNA (mRNA) molecule. Just as DNA replication and transcription involve base pairing of nucleotides running in opposite direction, so the reading of codons in mRNA (5' -> 3') requires that the anticodons bind in the opposite direction.
      Anticodon:  3' CGA  5'
      Codon:      5' GCU  3'

The RNA Codons

Second nucleotide
U UUU Phenylalanine (Phe)UCU Serine (Ser)UAU Tyrosine (Tyr)UGU Cysteine (Cys)U
UUG LeuUCG Ser UAG STOPUGG Tryptophan (Trp)G
C CUU Leucine (Leu)CCU Proline (Pro)CAU Histidine (His) CGU Arginine (Arg)U
CUA LeuCCA ProCAA Glutamine (Gln)CGA Arg A
A AUU Isoleucine (Ile)ACU Threonine (Thr)AAU Asparagine (Asn)AGU Serine (Ser)U
AUA IleACA Thr AAA Lysine (Lys)AGA Arginine (Arg)A
AUG Methionine (Met) or STARTACG ThrAAG LysAGG Arg G
G GUU Valine ValGCU Alanine (Ala)GAU Aspartic acid (Asp)GGU Glycine (Gly)U
GUA ValGCA AlaGAA Glutamic acid (Glu)GGA GlyA


The Steps of Translation

1. Initiation

2. Elongation

Note: the initiator tRNA is the only member of the tRNA family that can bind directly to the P site. The P site is so-named because, with the exception of initiator tRNA, it binds only to a peptidyl-tRNA molecule; that is, a tRNA with the growing peptide attached.

The A site is so-named because it binds only to the incoming aminoacyl-tRNA; that is the tRNA bringing the next amino acid. So, for example, the tRNA that brings Met into the interior of the polypeptide can bind only to the A site.

3. Termination

External Link
Link to John Kyrk's superb animation of the entire process.
(Uses Flash)
Please let me know by e-mail if you find a broken link in my pages.)


A single mRNA molecule usually has many ribosomes traveling along it, in various stages of synthesizing the protein. This complex is called a polysome [View].

Codon Bias

All but two of the amino acids (Met and Trp) can be encoded by from 2 to 6 different codons. However, the genome of most organisms reveals that certain codons are preferred over others. In humans, for example, alanine is encoded by GCC four times as often as by GCG. This probably reflects a greater translation efficiency by the translation apparatus for certain codons over their synonyms.

Codon bias even extends to pairs of codons: wherever a human protein contains the amino acids Ala-Glu, the gene encoding those amino acids is seven times as likely to use the codons GCAGAG rather than the synonymous GCCGAA.

Codon bias is exploited by the biotechnology industry to improve the yield of the desired product. The ability to manipulate codon bias may also usher in a era of safer vaccines. Link to a discussion.

Quality Control

Defective mRNA molecules can be produced by In addition to producing mRNAs with incorrect codons for amino acids, these errors can produce mRNA molecules that have

Nonsense-Mediated mRNA Decay (NMD)

Premature termination codons (PTCs) may be generated by


Mutations that introduce premature termination codons are responsible for some cases of such inherited human diseases as cystic fibrosis and Duchenne muscular dystrophy (DMD).

A drug, designated PTC124 or ataluren, causes the ribosome to skip over PTCs while still enabling normal termination of translation. PTC124 has shown promise in animal models of cystic fibrosis and DMD and phase II clinical trials are now being conducted on humans.

Nonstop mRNA Decay

Nonstop transcripts occur when there is no STOP codon in the message. As a result the ribosome is unable to recruit the release factors needed to leave the mRNA.

Nonstop transcripts are formed during RNA processing, e.g., by having the poly(A) tail put on before the STOP codon is reached.


Eukaryotes and bacteria handle the problem of no STOP codon differently.

Regulation of Translation

The expression of most genes is controlled at the level of their transcription. Transcription factors (proteins) bind to promoters and enhancers turning on (or off) the genes they control.

Link to an example.

However, gene expression can also be controlled at the level of translation.

By General RNA-Degradation Machinery

P bodies

The cytosol of eukaryotes contains protein complexes that compete with ribosomes for access to mRNAs. As these increase their activity, they sequester mRNAs in larger aggregates called P bodies (for "processing bodies", but this processing should not be confused with the processing of pre-mRNA to mature mRNA that occurs in the nucleus).

These protein complexes break down the mRNA by

What controls the dynamic balance between ribosomes and P bodies for access to mRNAs remains to be learned. But this mechanism provides for


These are hollow macromolecular complexes with two openings. They take in unfolded RNA molecules and degrade them in the 3' -> 5' direction.

(In neither structure nor function do these exosomes resemble the exosomes involved in antigen presentation that unfortunately share the same name.)

By MicroRNAs (miRNAs)

Here small RNA molecules bind to a complementary portion in the 3'-UTR of the mRNA and
Link to a discussion.

Both these activities take place in P bodies.

By Riboswitches

It turns out that the regulation of the level of certain metabolites is controlled by riboswitches. A riboswitch is a part of a molecule of messenger RNA (mRNA) with a specific binding site for the metabolite (or a close relative).


It has been suggested that these regulatory mechanisms, which do not involve any protein, are a relict from an "RNA world".

By RNA Thermosensors

Several species of bacteria have been found with mRNAs containing a temperature-sensitive region in the 5' untranslated region (UTR) of certain of their mRNAs. For example, at normal temperatures the mRNA encoding a gene for a heat-shock protein contains a loop in the 5' UTR that prevents the mRNA from binding to a ribosome and being translated. At elevated temperatures, however, the loop opens and the mRNA now can bind a ribosome and the heat-shock protein be translated.

By Gene-Specific Proteins

Translation of at least one mRNA in humans is repressed by a protein — an aminoacyl tRNA synthetase. In response to the inflammatory cytokine interferon-gamma [IFN-γ], the synthetase abandons its normal function (adding Glu and Pro to their respective tRNAs) and instead binds to the mRNA blocking its translation.

In some bacteria, a protein product may inhibit the further translation of its own mRNA (a kind of feedback inhibition). It does so by binding to a site which blocks the mRNA from further association with a ribosome.


Gene expression occurs in two steps:

In eukaryotes, the processes of transcription and translation are separated both spatially and in time. Transcription of DNA into mRNA occurs in the nucleus. Translation of mRNA into polypeptides occurs on polysomes in the cytoplasm.

In bacteria (which have no nucleus), both these steps of gene expression occur simultaneously: the nascent mRNA molecule begins to be translated even before its transcription from DNA is complete.

View an electron micrograph showing polysomes formed during simultaneous transcription and translation in E. coli.
Evidence (reported by Iborra, et al., in the 10 August 2001 issue of Science) shows that the distinction between bacteria and eukaryotes is not absolute. They find that 10 to 15% of translation in mammalian cells occurs in the nucleus, and that at least some of this translation occurs as the mRNA is still being synthesized by RNA polymerase (just as in E. coli)

Welcome&Next Search

15 February 2020