Skip to main content
Back

Gene Expression II: The Genetic Code and Protein Synthesis

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Gene Expression II: The Genetic Code and Protein Synthesis

Introduction

This topic explores how genetic information encoded in DNA is translated into proteins, focusing on the genetic code, its experimental discovery, and its properties. Understanding these principles is fundamental to cell biology and molecular genetics.

The Genetic Code

Definition and Relationship

The genetic code is the set of rules by which the sequence of bases in DNA is translated into the sequence of amino acids in proteins.

  • DNA base sequence determines the linear order of amino acids in protein products.

  • mRNA acts as the intermediary, carrying instructions from DNA to the ribosome for protein synthesis.

Key Terms

  • Gene: Functional unit of DNA that encodes one or more polypeptides or functional RNAs.

  • Coding strand: The DNA strand whose sequence matches the mRNA (except T is replaced by U).

  • Template strand: The DNA strand that is copied during mRNA synthesis.

  • Codon: A sequence of three nucleotides in mRNA that specifies an amino acid.

  • Alternative splicing: Process by which different combinations of exons are joined to produce multiple mRNA variants from a single gene.

  • Frameshift mutation: Mutation caused by insertion or deletion of nucleotides that shifts the reading frame.

Experimental Evidence for the Genetic Code

Beadle and Tatum: One Gene-One Enzyme Hypothesis

George Beadle and Edward Tatum used the bread mold Neurospora crassa to demonstrate the link between genes and enzymes.

  • Mutations induced by X-rays led to loss of ability to synthesize specific amino acids or vitamins.

  • Mutants survived only when missing substances were provided in the medium.

  • Each mutation disabled a single enzymatic step in a metabolic pathway.

  • Formulated the one gene-one enzyme hypothesis.

Beadle and Tatum Experimental Design (Summary Table)

Mutant Class

Growth on Minimal Medium

Growth on Supplemented Medium

Inference

Wild type

Yes

Yes

No mutation

Class I

No

Yes (with supplement)

Mutation blocks synthesis of specific compound

Class II

No

Yes (with different supplement)

Mutation blocks different step

Ingram: One Gene-One Polypeptide Hypothesis

  • Linus Pauling and Vernon Ingram studied sickle-cell anemia.

  • Electrophoresis showed hemoglobin from sickle cells migrated differently than normal.

  • Trypsin digestion and peptide analysis revealed a single amino acid change: valine replaces glutamic acid in sickle-cell hemoglobin.

  • Led to refinement: one gene-one polypeptide theory.

Normal vs. Sickle-Cell Hemoglobin (Summary Table)

Hemoglobin Type

Amino Acid at Position 6

Charge

Normal (HbA)

Glutamic acid

Negative

Sickle-cell (HbS)

Valine

Neutral

Properties of the Genetic Code

The Triplet Code

  • Four DNA bases (A, T, G, C) and 20 amino acids.

  • Doublet code (2 bases per codon) yields only 16 combinations—insufficient.

  • Triplet code (3 bases per codon) yields 64 possible combinations—more than enough for all amino acids.

  • Each codon consists of three nucleotides.

Frameshift Mutations and Evidence for Triplet Code

  • Crick, Brenner, and others used proflavin to induce indel mutations in bacteriophage T4.

  • Frameshift mutations shift the reading frame, altering downstream amino acid sequence.

  • Revertant mutations (second indel near the first) restore the reading frame, producing pseudo wild-type phenotype.

  • Adding/removing three nucleotides does not alter the reading frame, supporting the triplet nature of the code.

Degeneracy and Nonoverlapping Nature

  • 64 codons, but only 20 amino acids.

  • Degenerate code: Most amino acids are specified by more than one codon.

  • Nonoverlapping code: Each nucleotide is part of only one codon; reading frame advances three nucleotides at a time.

Degeneracy Table (Sample)

Amino Acid

Codons

Leucine

UUA, UUG, CUU, CUC, CUA, CUG

Serine

UCU, UCC, UCA, UCG, AGU, AGC

Messenger RNA and Protein Synthesis

Role of mRNA

  • Genetic code refers to the order of nucleotides in mRNA that direct protein synthesis.

  • mRNA is transcribed from DNA, serving as a template for translation.

Differences Between mRNA Synthesis and DNA Replication

  • In mRNA synthesis, only one DNA strand is copied—the template strand.

  • The other strand is the coding strand, which matches the mRNA sequence (except T is replaced by U).

  • In mRNA, uracil (U) replaces thymine (T).

Example: Coding vs. Template Strand

Strand

Sequence

Coding (DNA)

5'-ATGGGCGGC-3'

Template (DNA)

3'-TACCCGCCG-5'

mRNA

5'-AUGGGCGGC-3'

Experimental Elucidation of the Genetic Code

Cell-Free Systems

  • Marshall Nirenberg and J. Heinrich Matthaei used cell-free systems to study protein synthesis.

  • Synthetic RNAs of known sequence were added to bacterial extracts.

  • Polynucleotide phosphorylase was used to make synthetic RNA molecules.

Homopolymers and Copolymers

  • Homopolymer: RNA made from a single type of nucleotide (e.g., poly(U)).

  • Poly(U) directed incorporation of phenylalanine, showing UUU codes for phenylalanine.

  • Further experiments showed AAA codes for lysine, CCC for proline.

  • Copolymers (mixtures of two nucleotides) helped assign codons to amino acids.

Assignment of Codons

  • H. Gobind Khorana synthesized RNAs with alternating sequences to narrow codon assignments.

  • Eventually, all 64 codons were assigned to specific amino acids or stop signals.

Codon Table (Summary)

Codon

Amino Acid

UUU, UUC

Phe

AUG

Met (Start)

UAA, UAG, UGA

Stop

AAA

Lys

CCC

Pro

... (see full codon table for all assignments)

...

Properties of the Genetic Code

Unambiguous and Degenerate

  • Each codon specifies only one amino acid (unambiguous).

  • Many amino acids are specified by multiple codons (degenerate).

  • Mutations in codons often change the amino acid sequence.

Universality of the Genetic Code

  • The genetic code is nearly universal among all organisms.

  • Exceptions exist in mitochondria and some bacteria, where codon assignments differ.

Key Equations and Concepts

Number of Possible Codons

  • Number of possible codons: $4^3 = 64$

Central Dogma of Molecular Biology

  • DNA → RNA → Protein

Summary Table: Properties of the Genetic Code

Property

Description

Triplet

Three nucleotides per codon

Degenerate

Multiple codons for most amino acids

Nonoverlapping

Each nucleotide is part of only one codon

Unambiguous

Each codon specifies only one amino acid

Nearly Universal

Same code used by most organisms

Conclusion

The genetic code is a fundamental principle of molecular biology, linking DNA sequence to protein structure and function. Its discovery and characterization were achieved through classic experiments and remain central to understanding gene expression in all cells.

Pearson Logo

Study Prep