Skip to main content
Back

Genomic Technologies: DNA Sequencing, Genomics, and Multi-Omics Applications

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Recombinant DNA Technology and Genetically Modified Organisms (GMOs)

Introduction to Recombinant DNA and GMOs

Recombinant DNA technology enables the manipulation and combination of DNA from different sources to create genetically modified organisms (GMOs). This technology is foundational for modern genetics, biotechnology, and genomics.

  • Genetically Modified Organisms (GMOs): Organisms whose genetic material has been altered using recombinant DNA methods to introduce new traits or functions.

  • Recombinant DNA: DNA molecules formed by laboratory methods of genetic recombination, such as molecular cloning, to bring together genetic material from multiple sources.

  • Applications: Agriculture (e.g., pest-resistant crops), medicine (e.g., insulin production), and research (e.g., gene function studies).

How are GMO plants made?

Polymerase Chain Reaction (PCR) and DNA Amplification

Principles and Applications of PCR

The polymerase chain reaction (PCR) is a technique used to amplify specific DNA segments, making millions of copies from a small initial sample. PCR is essential for genetic analysis, cloning, and sequencing.

  • Key Steps: Denaturation, annealing, and extension using DNA polymerase.

  • Applications: Genetic testing, forensic analysis, cloning, and preparation for sequencing.

DNA Sequencing Technologies

Overview of DNA Sequencing

DNA sequencing determines the precise order of nucleotides (A, T, C, G) in a DNA molecule. Sequencing technologies have evolved through three generations, each with distinct features and applications.

  • First Generation: Sanger sequencing (chain-termination method).

  • Second Generation: Next Generation Sequencing (NGS), e.g., Illumina platforms.

  • Third Generation: Single-molecule sequencing, e.g., PacBio and Nanopore.

DNA sequencer (NextSeq 500)

First Generation: Sanger Sequencing

Sanger sequencing uses chain-terminating dideoxynucleotides (ddNTPs) to generate DNA fragments of varying lengths, which are then separated and analyzed to determine the DNA sequence.

  • Chain Termination: Incorporation of ddNTPs halts DNA synthesis due to the absence of a 3'-OH group.

  • Detection: Fragments are separated by size using gel electrophoresis and detected by fluorescence or radioactivity.

  • Read Length: Typically 500–1,000 base pairs per reaction.

Sanger sequencing gel and readout

Mechanism of Chain Termination

  • dNTP vs. ddNTP: ddNTPs lack a 3'-OH group, preventing further elongation of the DNA strand.

Structural difference between dNTP and ddNTP

Sanger Sequencing Workflow

  • PCR with fluorescent ddNTPs: DNA is amplified with a mixture of dNTPs and fluorescently labeled ddNTPs.

  • Capillary Gel Electrophoresis: Fragments are separated by size.

  • Laser Detection: Fluorescent signals are detected and translated into sequence data.

PCR with fluorescent, chain-terminating ddNTPs Size separation by capillary gel electrophoresis Laser excitation & detection by sequencing machine

Second Generation: Next Generation Sequencing (NGS)

NGS technologies, such as Illumina sequencing, enable massively parallel sequencing of millions of short DNA fragments, increasing throughput and reducing cost per base.

  • Sequencing by Synthesis: Each nucleotide incorporation is detected in real time.

  • Applications: Whole genome sequencing, transcriptomics, and metagenomics.

  • Read Length: Typically 50–500 base pairs per fragment.

Massively parallel DNA sequencing Illumina sequencing steps

Third Generation: Single-Molecule Sequencing

Third-generation sequencing technologies, such as PacBio and Nanopore, sequence single DNA molecules in real time, producing much longer reads (up to tens of thousands of base pairs).

  • PacBio SMRT Sequencing: Uses nanowells and real-time detection of nucleotide incorporation.

  • Nanopore Sequencing: DNA passes through a nanopore, and changes in electrical current are used to identify bases.

  • Advantages: Long reads, detection of epigenetic modifications, and real-time analysis.

Single molecule DNA sequencing (Nanopore) PacBio SMRT sequencing workflow

Comparison of Sequencing Technologies

Generation

Technology

Read Length

Throughput

Key Features

First

Sanger

500–1,000 bp

Low

High accuracy, single gene

Second

Illumina NGS

50–500 bp

High

Massively parallel, short reads

Third

PacBio, Nanopore

10,000+ bp

Very High

Long reads, real-time

Genomic Analysis and the Human Genome Project

What is Genomics?

Genomics is the study of the complete set of DNA (genome) in an organism, including its structure, function, evolution, and mapping. It encompasses the identification of genes, regulatory elements, and their interactions.

  • Structural Genomics: Determining the DNA sequence and mapping genes.

  • Functional Genomics: Assigning biological functions to genomic elements.

Whole Genome Sequencing and Assembly

Whole genome sequencing (WGS) involves determining the complete DNA sequence of an organism's genome. Genome assembly reconstructs the genome from short sequence reads.

  • DNA Libraries: Collections of DNA fragments representing the genome (genomic or cDNA libraries).

  • Assembly: Overlapping sequence reads are merged to form contigs and scaffolds, which are then mapped to chromosomes.

The Human Genome Project (HGP)

The Human Genome Project was an international effort to sequence and map all human genes. It provided a reference genome for biomedical research and comparative genomics.

  • Timeline: Draft released in 2000, declared complete in 2003, gapless assembly in 2022.

  • Methods: BAC/YAC cloning, Sanger sequencing, and later NGS.

  • Findings: ~20,000 protein-coding genes, 99.9% similarity among humans, identification of SNPs and CNVs.

Human Genome Project

Genome Assembly: Contigs, Scaffolds, and Mapping

Genome assembly involves aligning and merging sequence reads into longer contiguous sequences (contigs), which are then ordered into scaffolds and mapped to chromosomes using genetic and physical maps.

  • Genetic Maps: Based on recombination frequencies (centimorgans, cM).

  • Physical Maps: Based on actual base-pair distances (bp, kb, Mb).

Multi-Omics Technologies

Overview of Multi-Omics

Multi-omics integrates data from genomics, transcriptomics, epigenomics, proteomics, and metabolomics to provide a comprehensive view of biological systems.

  • Genomics: DNA sequence analysis.

  • Transcriptomics: Analysis of all expressed RNA (mRNA, ncRNA).

  • Epigenomics: Study of DNA methylation, histone modifications, and chromatin structure.

  • Proteomics: Analysis of all proteins and their modifications.

  • Metabolomics: Analysis of metabolites and small molecules.

Applications of Multi-Omics

  • Genetic Testing: Identification of disease-associated variants.

  • Genome-Wide Association Studies (GWAS): Linking genetic variants to traits and diseases.

  • Synthetic Biology: Engineering organisms with new functions.

  • Comparative Genomics: Comparing genomes across species to study evolution and function.

  • Metagenomics: Sequencing DNA from entire microbial communities to study diversity and function.

Transcriptomics and RNA Sequencing

Transcriptomics analyzes gene expression at the RNA level, using techniques such as RNA sequencing (RNA-seq) to quantify and compare gene expression across tissues and conditions.

  • Bulk RNA-Seq: Measures average gene expression in a population of cells.

  • Single-Cell RNA-Seq: Resolves gene expression at the single-cell level, revealing cellular heterogeneity.

Epigenomics

Epigenomics studies heritable changes in gene expression that do not involve changes to the DNA sequence, such as DNA methylation and histone modification.

  • Methods: Whole genome bisulfite sequencing (WGBS), ATAC-seq, ChIP-seq.

  • Applications: Cancer research, developmental biology, environmental studies.

Proteomics

Proteomics involves the large-scale study of proteins, including their expression, structure, and function. Mass spectrometry (LC-MS/MS) is a key technique for protein identification and quantification.

  • Applications: Disease biomarker discovery, cancer research, functional annotation of genomes.

Summary Table: Multi-Omics Technologies

Omics Field

Analyte

Key Methods

Applications

Genomics

DNA

WGS, NGS

Gene discovery, GWAS

Transcriptomics

RNA

RNA-seq

Gene expression analysis

Epigenomics

DNA/histones

WGBS, ChIP-seq

Epigenetic regulation

Proteomics

Proteins

LC-MS/MS

Protein function, biomarkers

Metabolomics

Metabolites

MS, NMR

Metabolic pathways

Additional info: The integration of multi-omics data is crucial for systems biology, personalized medicine, and understanding complex traits and diseases.

Pearson Logo

Study Prep