BackGene Expression: From Gene to Protein (Transcription and Translation)
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Gene Expression: From Gene to Protein
Introduction to Gene Expression
Gene expression is the process by which the information encoded in a gene is used to direct the synthesis of a protein or functional RNA. This process is fundamental to the manifestation of inherited traits and the functioning of all living cells. The flow of genetic information from DNA to RNA to protein is central to biology and is often referred to as the "central dogma."
Genotype refers to the genetic makeup of an organism, while phenotype is the observable trait.
Proteins are the link between genotype and phenotype, as they carry out most cellular functions.
Gene expression involves two main stages: transcription (DNA to RNA) and translation (RNA to protein).
Historical Foundations: Genes and Enzymes
Evidence from Metabolic Defects
Early studies of inherited diseases, such as alkaptonuria, led to the hypothesis that genes dictate phenotypes through enzymes. Archibald Garrod proposed that the symptoms of certain diseases result from the inability to synthesize specific enzymes.
Alkaptonuria: Characterized by black urine due to the accumulation of alkapton, which cannot be broken down because of a missing enzyme.
Each step in a metabolic pathway is catalyzed by a specific enzyme, and mutations can block these steps, leading to metabolic disorders.
Nutritional Mutants and the One Gene–One Enzyme Hypothesis
Beadle and Tatum's experiments with the bread mold Neurospora crassa provided strong evidence that each gene encodes a specific enzyme. By inducing mutations and analyzing nutritional requirements, they demonstrated that different mutants were blocked at different steps in the arginine biosynthesis pathway.
Wild-type Neurospora can grow on minimal medium, synthesizing all necessary nutrients.
Mutants unable to grow on minimal medium but able to grow on supplemented medium were identified as nutritional mutants.
Each mutant lacked a specific enzyme required for a step in the metabolic pathway.


Refinements to the One Gene–One Enzyme Hypothesis
Further research revealed that not all proteins are enzymes and that some proteins are composed of multiple polypeptides, each encoded by a separate gene. The hypothesis was updated to "one gene–one polypeptide." Additionally, some genes code for functional RNAs rather than proteins.
Example: Hemoglobin consists of two types of polypeptide chains, each encoded by a different gene.
Some genes produce RNA molecules (e.g., tRNA, rRNA) that are not translated into proteins.
Basic Principles of Transcription and Translation
Transcription: DNA to RNA
Transcription is the synthesis of RNA using a DNA template. The resulting RNA molecule is complementary to the DNA template strand and carries the genetic message to the protein-synthesizing machinery of the cell.
Messenger RNA (mRNA): Carries the genetic code from DNA to ribosomes.
RNA differs from DNA by having ribose sugar and uracil (U) instead of thymine (T).
Translation: RNA to Protein
Translation is the synthesis of a polypeptide using the information in mRNA. Ribosomes facilitate the decoding of mRNA codons into a specific sequence of amino acids, forming a protein.
Occurs in the cytoplasm at ribosomes.
Involves tRNA molecules that bring amino acids to the ribosome according to the mRNA codon sequence.
Differences in Prokaryotic and Eukaryotic Gene Expression
In bacteria, transcription and translation are coupled because there is no nucleus. In eukaryotes, transcription occurs in the nucleus, and the mRNA is processed before being exported to the cytoplasm for translation.


The Central Dogma of Molecular Biology
The central dogma describes the directional flow of genetic information: DNA → RNA → Protein.
Exceptions exist, such as reverse transcription in retroviruses, but the general flow holds for most organisms.

The Genetic Code
Codons: Triplets of Nucleotides
The genetic code is based on triplets of nucleotides called codons. Each codon specifies a particular amino acid or a stop signal during translation.
There are 64 possible codons (43), more than enough to code for 20 amino acids.
Codons are read in the 5′ → 3′ direction on the mRNA.
The sequence of codons determines the sequence of amino acids in the resulting polypeptide.

Deciphering the Genetic Code
Experiments in the 1960s revealed the amino acid specified by each codon. Most amino acids are specified by more than one codon (redundancy), but each codon specifies only one amino acid (no ambiguity).
AUG is both the start codon and codes for methionine.
Three codons (UAA, UAG, UGA) are stop signals.


Reading Frame
The correct grouping of nucleotides into codons (the reading frame) is essential for accurate translation. Shifting the reading frame results in a completely different and usually nonfunctional protein.
Universality and Evolution of the Genetic Code
The genetic code is nearly universal among all organisms, providing strong evidence for the common ancestry of life. Genes from one species can often be expressed in another, a principle used in genetic engineering.

Summary Table: Key Concepts in Gene Expression
Concept | Description | Example |
|---|---|---|
Gene Expression | Process by which genetic information is used to synthesize proteins or RNAs | Transcription and translation of the hemoglobin gene |
Transcription | Synthesis of RNA from a DNA template | mRNA production in the nucleus |
Translation | Synthesis of a polypeptide from mRNA | Protein synthesis at the ribosome |
Codon | Three-nucleotide sequence in mRNA specifying an amino acid | AUG codes for methionine |
Central Dogma | Flow of genetic information: DNA → RNA → Protein | Gene expression in all living cells |
Key Equations and Concepts
Number of codons:
Relationship between nucleotides and amino acids:
Conclusion
Gene expression is a central process in biology, linking genetic information to cellular function and phenotype. The universality of the genetic code underscores the shared evolutionary history of all life and enables powerful applications in biotechnology and medicine.