Gene Expression: From Gene to Protein – Transcription, Translation, and the Genetic Code

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Gene Expression: From Gene to Protein

Introduction to Gene Expression

Gene expression is the process by which the information encoded in a gene is used to direct the synthesis of a protein or, in some cases, functional RNAs. This process is fundamental to the manifestation of inherited traits and the functioning of all living cells. The two main stages of gene expression are transcription and translation.

Genotype refers to the genetic makeup of an organism, while phenotype refers to the observable traits.
Proteins are the link between genotype and phenotype, as they carry out the instructions encoded by genes.
Gene expression involves the flow of genetic information from DNA to RNA to protein.

Historical Foundations: Genes and Enzymes

Evidence from Metabolic Defects

Early studies of inherited diseases, such as alkaptonuria, led Archibald Garrod to propose that genes dictate phenotypes through enzymes. He suggested that symptoms of inherited diseases reflect an inability to synthesize a particular enzyme, resulting in metabolic blocks.

Each step in a metabolic pathway is catalyzed by a specific enzyme.
Mutations in genes can lead to the absence or malfunction of these enzymes, causing metabolic disorders.

Beadle and Tatum’s Experiments with Neurospora

George Beadle and Edward Tatum used the bread mold Neurospora crassa to demonstrate the relationship between genes and enzymes. By inducing mutations and analyzing nutritional requirements, they established the one gene–one enzyme hypothesis.

Mutant strains unable to synthesize specific nutrients were identified as nutritional mutants.
Each mutant was defective in a single step of a metabolic pathway, corresponding to a specific enzyme.

Beadle and Tatum's experimental approach with Neurospora One gene–one enzyme hypothesis illustrated with arginine synthesis pathway

Example: Mutants requiring arginine were grouped based on which step in the arginine synthesis pathway was blocked, supporting the idea that each gene encodes a specific enzyme.

Refinements to the One Gene–One Enzyme Hypothesis

Further research revealed that not all proteins are enzymes and that some proteins are composed of multiple polypeptides, each encoded by a separate gene. The hypothesis was refined to the one gene–one polypeptide concept. Additionally, some genes code for functional RNAs rather than proteins.

Alternative splicing allows a single gene to code for multiple related polypeptides.
Some genes produce RNAs (e.g., tRNA, rRNA) that are not translated into proteins.

Basic Principles of Transcription and Translation

Overview of the Flow of Genetic Information

The central dogma of molecular biology describes the directional flow of genetic information: DNA → RNA → Protein. This process involves two main stages:

Transcription: Synthesis of RNA from a DNA template.
Translation: Synthesis of a polypeptide using the information in mRNA.

Central dogma: DNA to RNA to Protein

Transcription

During transcription, a gene's DNA sequence is copied to produce a complementary RNA strand. In protein-coding genes, this RNA is called messenger RNA (mRNA).

RNA differs from DNA by having ribose sugar and uracil (U) instead of thymine (T).
Only one DNA strand (the template strand) is transcribed for each gene.

Translation

Translation is the process by which the sequence of an mRNA is decoded to build a polypeptide, with the help of ribosomes. The mRNA sequence is read in sets of three nucleotides, called codons, each specifying an amino acid.

Translation occurs in the cytoplasm at ribosomes.
Each codon corresponds to a specific amino acid or a stop signal.

Differences Between Prokaryotic and Eukaryotic Gene Expression

In bacteria (prokaryotes), transcription and translation are coupled, as there is no nucleus. In eukaryotes, transcription occurs in the nucleus, and the resulting pre-mRNA undergoes processing before being exported to the cytoplasm for translation.

Comparison of gene expression in bacterial and eukaryotic cells RNA processing in eukaryotic cells

The Genetic Code

Codons: Triplets of Nucleotides

The genetic code is based on triplets of nucleotides called codons. Each codon specifies one amino acid. With four nucleotide bases, there are 64 possible codons, more than enough to code for the 20 amino acids.

The code is redundant (more than one codon may specify the same amino acid) but not ambiguous (no codon specifies more than one amino acid).
The reading frame (grouping of codons) is crucial for correct translation.

Transcription and translation: DNA to mRNA to protein Codon table for mRNA

Cracking the Genetic Code

Experiments in the 1960s, such as those by Marshall Nirenberg, deciphered which codons correspond to which amino acids. The codon table is now a fundamental tool in molecular biology.

Alternative codon table representation

Universality and Evolution of the Genetic Code

The genetic code is nearly universal among all organisms, indicating a common evolutionary origin. Genes from one species can often be expressed in another, a principle used in genetic engineering.

Evidence for evolution: expression of genes from different species

Example: Bacteria can be engineered to produce human proteins, such as insulin, by introducing human genes.

Summary Table: Key Concepts in Gene Expression

Concept	Description
Gene Expression	Process by which genetic information is used to synthesize proteins or functional RNAs
Transcription	Synthesis of RNA from a DNA template
Translation	Synthesis of a polypeptide from an mRNA template
Codon	Three-nucleotide sequence in mRNA that specifies an amino acid
Redundancy	Multiple codons can code for the same amino acid
Universality	Genetic code is shared by almost all organisms

Key Equations and Concepts

Central Dogma:
Codon Calculation: possible codons (4 bases, 3 positions)

Conclusion

Gene expression is a central concept in biology, linking genetic information to cellular function and phenotype. The processes of transcription and translation, governed by the nearly universal genetic code, are fundamental to life and have profound implications for biotechnology and medicine.