Skip to main content
Back

Genomes and Their Evolution: Study Guide for Chapter 21 (Campbell Biology)

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Genomics and Bioinformatics

Introduction to Genomics

Genomics is the study of the entire set of genes (genome) and their interactions within an organism. It encompasses the sequencing, mapping, and analysis of genomes to understand biological function and evolution.

  • Genomics: The comprehensive study of genomes, including their structure, function, evolution, and mapping.

  • Bioinformatics: The application of computational tools to store, analyze, and interpret biological data, especially large-scale genomic data.

  • Applications: Includes gene discovery, evolutionary studies, and medical diagnostics.

Sequencing and Analyzing Genomes

Modern genome sequencing involves fragmenting DNA, sequencing the fragments, and assembling them into a complete genome using computational methods.

  • DNA Fragmentation: DNA is cut into overlapping fragments for sequencing.

  • Cloning and Sequencing: Fragments are cloned into vectors and sequenced individually.

  • Assembly: Computer software orders the sequences into a complete genome.

Steps in genome sequencing and assembly

Gene Cloning and Recombinant DNA Technology

Gene cloning involves inserting a gene of interest into a plasmid vector, which is then introduced into a bacterial cell for replication and expression.

  • Restriction Enzymes: Used to cut DNA at specific sites, producing sticky ends for ligation.

  • DNA Ligase: Seals the strands to form recombinant DNA.

  • Applications: Production of proteins, gene therapy, and genetic engineering.

Formation of recombinant plasmid using restriction enzymes and DNA ligase

Identifying Genes and Their Functions

Gene Annotation and Functional Studies

Gene annotation is the process of identifying protein-coding genes within DNA sequences and predicting their functions.

  • Gene Annotation: Automated comparison of unknown genes to known genes in databases.

  • Functional Confirmation: Techniques like RNA-seq and CRISPR-Cas9 are used to study gene function by observing phenotypic changes after gene knockout.

Proteomics and Systems Biology

Proteomics is the large-scale study of proteins, their properties, and their functional integration in biological systems. Systems biology integrates genomics, proteomics, and other data to understand complex biological networks.

  • Proteome: The entire set of proteins expressed by a cell or organism.

  • Systems Biology: Focuses on the interactions and integration of genes and proteins within biological systems.

Systems biology network diagram

Medical Applications

Genomic technologies are used in medicine to study gene expression patterns in diseases such as cancer.

  • Cancer Genome Atlas: Project comparing gene sequences and expression in cancer vs. normal cells.

  • Microarrays: Silicon chips containing thousands of gene probes used to analyze gene expression.

Gene expression microarray chip

Genome Structure and Variation

Genome Size, Gene Number, and Density

Genomes vary widely in size, number of genes, and gene density across different organisms.

  • Gene Density: Number of genes per megabase (Mb) of DNA; varies between prokaryotes and eukaryotes.

  • Alternative Splicing: Allows vertebrate genomes to produce multiple polypeptides from a single gene.

Genome sizes and estimated numbers of genes in various organisms

Noncoding DNA and Repetitive Elements

Much of the genome consists of noncoding DNA, including pseudogenes and repetitive sequences.

  • Pseudogenes: Former genes that have accumulated mutations and are nonfunctional.

  • Repetitive DNA: Includes transposable elements and other repeated sequences.

  • Sequence Conservation: Some noncoding regions are highly conserved, indicating important functions.

Pie chart of human genome composition

Transposable Elements and Genome Evolution

Transposons and Retrotransposons

Transposable elements are DNA sequences that can move within the genome, contributing to genetic diversity and evolution.

  • Transposons: Move via a DNA intermediate, require transposase enzyme.

  • Retrotransposons: Move via an RNA intermediate, require reverse transcriptase.

  • Alu Elements: A family of repetitive sequences in primates, some of which regulate gene expression.

  • LINE-1 (L1): Retrotransposons that may affect gene expression and neuronal diversity.

Transposon movement mechanismRetrotransposon movement mechanism

Other Repetitive DNA

Simple sequence DNA and short tandem repeats (STRs) are common in centromeres and telomeres, playing structural roles.

  • STRs: Repeating units of 2–5 nucleotides; number of repeats varies among individuals.

Multigene Families and Genome Evolution

Multigene Families

Multigene families are collections of two or more identical or similar genes, often coding for related proteins.

  • Identical DNA Sequences: Often clustered, such as rRNA gene families.

  • Globin Gene Families: Human α-globin and β-globin gene families are examples of multigene families with related functions.

Ribosomal RNA gene family structureHuman α-globin and β-globin gene families

Genome Evolution Mechanisms

Genome evolution is driven by mutation, duplication, rearrangement, and polyploidy.

  • Polyploidy: Extra sets of chromosomes can lead to gene diversification.

  • Chromosome Structure Alterations: Fusion, duplication, and rearrangement of chromosomes contribute to evolutionary change.

Human and chimpanzee chromosome comparisonHuman and mouse chromosome comparisonGene duplication and deletion via transposable elements

Evolution of Genes with Related and Novel Functions

Gene duplication and mutation can lead to families of genes with related functions or novel functions.

  • Globin Genes: Evolved from a common ancestral gene through duplication and divergence.

  • Lysozyme and α-lactalbumin: Example of gene duplication leading to novel functions.

Evolution of globin gene familiesLysozyme and α-lactalbumin sequence comparison

Exon Duplication and Shuffling

Errors in meiosis can result in exon duplication or shuffling, creating new gene variants.

  • Exon Shuffling: Mixing and matching of exons within or between genes, leading to new protein functions.

Role of Transposable Elements in Genome Evolution

Transposable elements can facilitate recombination, alter gene expression, and create new sites for alternative splicing.

  • Effects: Usually detrimental, but occasionally advantageous for evolution.

Comparative Genomics and Evolutionary Development

Comparing Genome Sequences

Comparative genomics reveals evolutionary relationships and clarifies mechanisms of development and diversity.

  • Phylogenetic Trees: Used to represent relationships among species based on genome comparisons.

  • Highly Conserved Genes: Help clarify relationships among distantly related species.

Phylogenetic tree of humans, chimpanzees, and mice

Comparing Closely Related Species

Genomes of closely related species are organized similarly, but differ in sequence and gene expression.

  • Human vs. Chimpanzee: Differ by 1.2% at single base pairs and 2.7% due to insertions/deletions.

  • FOXP2 Gene: Rapidly evolving in humans, associated with speech and language.

FOXP2 gene disruption experiment in mice

Comparing Genomes Within a Species

Genetic variation within humans is due to single nucleotide polymorphisms, inversions, deletions, and duplications.

  • Copy-Number Variants: Useful for studying human evolution and health.

Evolutionary Developmental Biology (Evo-Devo)

Evo-devo compares developmental processes across organisms, revealing conserved genes and regulatory sequences.

  • Homeotic Genes: Include a homeobox sequence, highly conserved across animals.

  • Hox Genes: Control body plan and segment identity in animals.

  • Regulatory Sequence Changes: Can lead to major changes in body form.

Homeotic gene expression in fruit fly and mouseHox gene expression in brine shrimp and grasshopper

Summary Table: Genome Features Across Domains

This table compares genome size, gene number, gene density, introns, and noncoding DNA across Bacteria, Archaea, and Eukarya.

Bacteria

Archaea

Eukarya

Genome size

Most are 1–6 Mb

Most are 1–6 Mb

Most are 10–4,000 Mb, but a few are much larger

Number of genes

1,500–7,500

1,500–7,500

Most are 5,000–45,000

Gene density

Higher than in eukaryotes

Higher than in eukaryotes

Lower than in prokaryotes (Within eukaryotes, lower density is correlated with larger genomes.)

Introns

None in protein-coding genes

Present in some genes

Present in most genes of multicellular eukaryotes, but only in some genes of unicellular eukaryotes

Other noncoding DNA

Very little

Very little

Can exist in large amounts; generally more repetitive noncoding DNA in multicellular eukaryotes

Summary table of genome features across domains

Key Equations and Concepts

  • Gene Density Formula:

  • Alternative Splicing: (in vertebrates)

Pearson Logo

Study Prep