Gene Expression II: The Genetic Code and Protein Synthesis – Study Notes

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Gene Expression II: The Genetic Code and Protein Synthesis

Introduction

Gene expression is the process by which information from a gene is used to synthesize a functional gene product, often a protein. For some genes, the RNA transcript is the final product, but for most, the ultimate product is a protein. Messenger RNAs (mRNAs) encode instructions for translation, the process of assembling amino acids into a polypeptide chain.

The Genetic Code

Definition and Significance

The genetic code is the set of rules by which the nucleotide sequence of DNA (and its RNA transcript) is translated into the linear order of amino acids in proteins. This code is fundamental to the flow of genetic information from DNA to protein.

Codon: A sequence of three nucleotides in mRNA that specifies a particular amino acid or a stop signal during translation.
Triplet Code: The genetic code is read in groups of three nucleotides (codons), allowing for 64 possible combinations.

Beadle and Tatum: One Gene-One Enzyme Hypothesis

George Beadle and Edward Tatum's experiments with Neurospora crassa established the link between genes and enzymes.

They used X-rays to induce mutations in the bread mold, creating mutants unable to survive on minimal medium.
Mutants could grow only when supplemented with specific amino acids or vitamins, indicating a block in a metabolic pathway.
Each mutation affected a single enzymatic step, leading to the one gene-one enzyme hypothesis.

Mutant Class	Growth on Minimal Medium	Growth on Supplemented Medium	Deficiency
Wild Type	Yes	Yes	None
Class I	No	Yes (with precursor 1)	Enzyme 1
Class II	No	Yes (with precursor 2)	Enzyme 2
Class III	No	Yes (with precursor 3)	Enzyme 3
Additional info: Table inferred from slide and experiment description.

Ingram: One Gene-One Polypeptide Theory

Linus Pauling and Vernon Ingram refined the hypothesis through studies of sickle-cell anemia.

Electrophoresis showed that hemoglobin from sickle cells migrated differently than normal hemoglobin.
Trypsin digestion and peptide analysis revealed a single amino acid change: valine replaced glutamic acid in sickle-cell hemoglobin.
This led to the one gene-one polypeptide theory: each gene encodes a single polypeptide chain, not necessarily an enzyme.

Hemoglobin Type	Amino Acid at Position 6	Charge
Normal (Hb-A)	Glutamic acid	Negative
Sickle-cell (Hb-S)	Valine	Neutral

Gene Function Complexity

Gene function is more complex than originally thought.

Most eukaryotic genes contain noncoding sequences (introns) among coding regions (exons).
Alternative splicing allows a single gene to code for multiple polypeptides.
Some genes encode functional RNAs rather than proteins.
Gene: Defined as a functional unit of DNA that encodes one or more polypeptides or functional RNAs.

The Triplet Nature of the Genetic Code

Triplet Code Evidence

There are four DNA bases and 20 amino acids. A doublet code (two bases per amino acid) yields only 16 combinations, which is insufficient. A triplet code yields 64 possible codons, more than enough for all amino acids.

Frameshift mutations (insertions or deletions) shift the reading frame, altering the downstream amino acid sequence.
Crick and Brenner's experiments with bacteriophage T4 showed that adding or removing three nucleotides restored the reading frame, supporting the triplet code model.

Frameshift Mutations

Frameshift mutations are caused by indels (insertions or deletions) of nucleotides, which disrupt the reading frame of the gene.

These mutations can revert to pseudo wild-type if a second, compensatory mutation occurs nearby.
The gene is read in three-letter words (codons); shifting the frame changes the meaning of all subsequent codons.

Mutation Type	Effect on Reading Frame	Protein Product
Insertion/Deletion of 1 or 2 bases	Frameshift	Abnormal
Insertion/Deletion of 3 bases	No frameshift	May be functional

Properties of the Genetic Code

Degeneracy and Nonoverlapping Nature

The genetic code is both degenerate and nonoverlapping.

Degenerate code: Most amino acids are specified by more than one codon.
Nonoverlapping code: Each nucleotide is part of only one codon; the reading frame advances three nucleotides at a time.

Messenger RNA and Polypeptide Synthesis

Messenger RNA guides the synthesis of polypeptide chains by providing the sequence of codons that direct the order of amino acids.

mRNA is transcribed from DNA, but only one DNA strand (the template strand) is copied.
The coding strand of DNA has the same sequence as the mRNA (except T is replaced by U).

Strand	Sequence	Role
Coding Strand	5'-ATGGGCGTC-3'	Same as mRNA (T→U)
Template Strand	3'-TACCCGCAG-5'	Used for mRNA synthesis
mRNA	5'-AUGGGCGUC-3'	Translated into protein

Cell-Free Systems and Deciphering the Code

Marshall Nirenberg and J. Heinrich Matthaei used cell-free systems to study protein synthesis and decipher the genetic code.

Synthetic RNAs of known sequence were added to cell-free extracts.
Polynucleotide phosphorylase was used to make synthetic RNA molecules.
Homopolymers (e.g., poly(U)) led to the incorporation of specific amino acids (e.g., UUU codes for phenylalanine).
Copolymer experiments and alternating sequence RNAs (Khorana) allowed assignment of all codons.

The Codon Dictionary

Of the 64 possible codons in mRNA, 61 code for amino acids, one (AUG) is a start codon, and three (UAA, UAG, UGA) are stop codons.

Codon	Function
AUG	Start codon (also codes for methionine)
UAA, UAG, UGA	Stop codons (terminate translation)
61 other codons	Specify amino acids

Unambiguous and Degenerate Code

Each codon has only one meaning (unambiguous), but many amino acids are specified by multiple codons (degenerate).

Mutations in degenerate codons may change the amino acid sequence, but some changes are silent (do not alter the amino acid).

Universality of the Genetic Code

The genetic code is nearly universal among all organisms, with only a few exceptions (e.g., mitochondria, some bacteria).

Most organisms use the same basic genetic code.
Exceptions include certain codons in mitochondria and some bacteria that specify nonstandard amino acids.

Key Terms and Concepts

Gene: Functional unit of DNA encoding polypeptides or functional RNAs.
Codon: Three-nucleotide sequence in mRNA specifying an amino acid.
Template Strand: DNA strand used for mRNA synthesis.
Coding Strand: DNA strand with the same sequence as mRNA (except T→U).
Frameshift Mutation: Mutation causing a shift in the reading frame due to indel.
Alternative Splicing: Process by which different mRNAs are produced from the same gene.

Summary Table: Properties of the Genetic Code

Property	Description
Triplet	Three nucleotides per codon
Degenerate	Multiple codons for most amino acids
Nonoverlapping	Each nucleotide is part of only one codon
Unambiguous	Each codon specifies only one amino acid
Nearly Universal	Same code used by most organisms

Equations and Formulas

Number of possible codons:
Central Dogma:

Example: Sickle-Cell Anemia

Sickle-cell anemia is caused by a single nucleotide change in the gene encoding the β-globin polypeptide, resulting in the substitution of valine for glutamic acid. This demonstrates the direct relationship between gene sequence and protein structure.

Additional info:

Tables and some details inferred from slide images and standard textbook knowledge.
Expanded explanations provided for clarity and completeness.