Gene Expression II: The Genetic Code and Protein Synthesis

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Gene Expression II: The Genetic Code and Protein Synthesis

Introduction

This topic explores how genetic information stored in DNA is used to direct the synthesis of proteins, focusing on the genetic code and the experimental evidence that led to its discovery. Understanding these processes is fundamental to cell biology, as proteins carry out most cellular functions.

The Genetic Code

Definition and Significance

Genetic Code: The set of rules by which the nucleotide sequence of DNA (or RNA) is translated into the amino acid sequence of proteins.
The genetic code is universal (with rare exceptions), triplet-based, and degenerate (multiple codons can specify the same amino acid).
Codon: A sequence of three nucleotides in mRNA that specifies a particular amino acid or a stop signal during translation.

Key Terms

Gene: A segment of DNA that encodes a functional product (polypeptide or RNA).
Coding Strand: The DNA strand whose sequence matches the mRNA (except T is replaced by U).
Template Strand: The DNA strand that is used as a template for mRNA synthesis.
Frameshift Mutation: A genetic mutation caused by insertions or deletions of nucleotides that change the reading frame of the genetic code.
Alternative Splicing: The process by which different combinations of exons are joined together to produce multiple mRNA variants from a single gene.
One-Gene One-Polypeptide Theory: The concept that each gene encodes a single polypeptide chain.

Experimental Evidence for the Genetic Code

Beadle and Tatum Experiment

George Beadle and Edward Tatum used the bread mold Neurospora crassa to demonstrate the relationship between genes and enzymes.

They exposed the mold to X-rays to induce mutations.
Mutants unable to grow on minimal medium but able to grow on complete medium were isolated.
By supplementing minimal medium with specific amino acids or vitamins, they identified which metabolic step was blocked in each mutant.
This led to the One-Gene One-Enzyme Hypothesis, later refined to the One-Gene One-Polypeptide Theory.

Table: Beadle and Tatum's Experimental Design

Type	Growth on Minimal Medium	Growth on Supplemented Medium	Blocked Step
Wild Type	Yes	Yes	None
Class I Mutant	No	Yes (with ornithine)	Gene A (Ornithine synthesis)
Class II Mutant	No	Yes (with citrulline)	Gene B (Citrulline synthesis)
Class III Mutant	No	Yes (with arginine)	Gene C (Arginine synthesis)

Relationship Between DNA, mRNA, and Protein

The sequence of DNA bases determines the sequence of mRNA, which in turn determines the sequence of amino acids in a protein.
Transcription copies the template strand of DNA into mRNA, replacing thymine (T) with uracil (U).
Translation reads mRNA codons to assemble amino acids into a polypeptide chain.

Triplet Nature of the Genetic Code

There are four DNA bases (A, T, C, G) and 20 amino acids.
A doublet code (two bases per codon) yields only 16 combinations, insufficient for 20 amino acids.
A triplet code yields 64 possible codons (), more than enough for all amino acids.

Frameshift Mutations and the Triplet Code

Frameshift mutations (insertions or deletions) shift the reading frame, altering the downstream amino acid sequence.
Experiments by Crick and Brenner showed that adding or removing three nucleotides restored the reading frame, supporting the triplet nature of the code.

Degeneracy and Nonoverlapping Nature of the Code

The genetic code is degenerate: most amino acids are specified by more than one codon.
The code is nonoverlapping: each nucleotide is part of only one codon, and the reading frame advances three nucleotides at a time.

Cell-Free Systems and Deciphering the Code

Marshall Nirenberg and J. Heinrich Matthaei used cell-free systems to study protein synthesis.
They added synthetic RNAs of known sequence to these systems and observed which amino acids were incorporated.
Homopolymers (e.g., poly(U)) led to the identification of codons for specific amino acids (e.g., UUU codes for phenylalanine).
Gobind Khorana synthesized RNAs with alternating sequences to further assign codons to amino acids.

Start and Stop Codons

Of the 64 possible codons, 61 code for amino acids.
AUG is the start codon (also codes for methionine).
UAA, UAG, UGA are stop codons, signaling termination of translation.

Universality and Ambiguity of the Genetic Code

The genetic code is nearly universal across all organisms.
Each codon has a single meaning (unambiguous), but many amino acids are specified by multiple codons (degenerate).

Summary Table: Properties of the Genetic Code

Property	Description
Triplet	Three nucleotides per codon
Degenerate	Multiple codons for most amino acids
Nonoverlapping	Each nucleotide belongs to one codon
Unambiguous	Each codon specifies only one amino acid
Nearly Universal	Same code used by most organisms

Example: DNA to Protein Sequence

Coding strand: 5'-ATGGGCTC-3'
Template strand: 3'-TACCCGAG-5'
mRNA: 5'-AUGGGCUCG-3'
Protein: Met-Gly-Ser

Additional info: The notes above expand on the original slides by providing definitions, context, and tables for clarity. The tables are inferred from the experimental design and properties of the genetic code as described in the slides.