Skip to main content
Back

Genomic Technologies and Applications: DNA Sequencing, Genomics, and Multi-Omics

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Recombinant DNA Technology and Genetically Modified Organisms (GMOs)

Introduction to GMOs and Recombinant DNA

Recombinant DNA technology enables the creation of genetically modified organisms (GMOs) by combining DNA from different sources. This technology is foundational in modern genetics, agriculture, and medicine.

  • Genetically Modified Organisms (GMOs): Organisms whose genetic material has been altered using recombinant DNA methods.

  • Recombinant DNA: DNA molecules formed by laboratory methods of genetic recombination to bring together genetic material from multiple sources.

  • Applications: Agriculture (GMO crops), medicine (insulin production), research (model organisms).

How are GMO plants made?

DNA Sequencing Technologies

Overview of DNA Sequencing

DNA sequencing is the process of determining the precise order of nucleotides within a DNA molecule. Advances in sequencing technology have revolutionized genomics, enabling high-throughput and cost-effective analysis.

  • Goal: To determine the complete sequence of nucleotide bases (A, T, C, G) in a DNA sample.

  • Applications: Genome mapping, disease gene identification, evolutionary studies.

Generations of DNA Sequencing

DNA sequencing technologies are classified into three generations, each with distinct methodologies and capabilities.

  • First Generation: Sanger sequencing – chain-termination method, low throughput, high accuracy for small fragments.

  • Second Generation: Next Generation Sequencing (NGS) – massively parallel, short reads, high throughput, cost-effective.

  • Third Generation: Single-molecule sequencing – long reads, real-time, high throughput, suitable for complex genomes.

Sanger Sequencing (First Generation)

Sanger sequencing, also known as chain-termination sequencing, uses dideoxynucleotides (ddNTPs) to terminate DNA synthesis at specific bases, allowing the sequence to be determined by fragment length analysis.

  • Key Principle: Incorporation of ddNTPs, which lack a 3'-OH group, terminates DNA strand elongation.

  • Detection: Fragments are separated by size using gel electrophoresis, and the sequence is read from the pattern of terminated fragments.

  • Applications: Sequencing single genes or small DNA fragments.

Original Sanger sequencing gel and readout Structural difference between dNTP and ddNTP

Sanger Sequencing Workflow

  1. PCR with fluorescent, chain-terminating ddNTPs

  2. Size separation by capillary gel electrophoresis

  3. Laser excitation and detection by sequencing machine

PCR with fluorescent, chain-terminating ddNTPs Size separation by capillary gel electrophoresis Laser excitation and detection by sequencing machine

Next Generation Sequencing (NGS, Second Generation)

NGS technologies, such as Illumina sequencing, enable massively parallel sequencing of millions of short DNA fragments, greatly increasing throughput and reducing cost per base.

  • Sequencing by Synthesis: Each nucleotide incorporation is detected in real time, allowing for high-throughput data collection.

  • Applications: Whole genome sequencing, transcriptomics, metagenomics.

  • Advantages: High throughput, cost-effective, suitable for large-scale studies.

Massively parallel DNA sequencing Illumina NextSeq 500 DNA sequencer Illumina sequencing steps

Third Generation Sequencing

Third generation sequencing technologies, such as PacBio and Oxford Nanopore, sequence single DNA molecules in real time, producing long reads that are valuable for resolving complex genomic regions.

  • Single-Molecule Real-Time (SMRT) Sequencing: DNA polymerase synthesizes DNA in a nanowell, and nucleotide incorporation is detected in real time.

  • Nanopore Sequencing: DNA passes through a nanopore, and changes in electrical current are used to identify bases.

  • Advantages: Long read lengths, real-time data, ability to resolve repetitive regions and structural variants.

Single molecule DNA sequencing (Nanopore) Nanopore sequencing device PacBio SMRT sequencing workflow

Comparison of Sequencing Technologies

Generation

Technology

Read Length

Throughput

Applications

First

Sanger

500-1,000 bp

Low

Single genes, small fragments

Second

Illumina NGS

50-500 bp

High

Whole genomes, transcriptomes

Third

PACBIO, Nanopore

10,000+ bp

Very High

Complex genomes, structural variants

Genomics and Genome Analysis

What is Genomics?

Genomics is the study of the complete set of DNA (the genome) in an organism. It encompasses the structure, function, evolution, and mapping of genomes.

  • Structural Genomics: Determining the DNA sequence and physical structure of the genome.

  • Functional Genomics: Assigning biological functions to genomic elements, such as genes and regulatory regions.

  • Comparative Genomics: Comparing genomes across species to identify conserved and unique features.

Genome Sequencing and Assembly

Whole genome sequencing (WGS) involves sequencing the entire genome and assembling the sequence reads into a complete genome.

  • DNA Libraries: Collections of DNA fragments representing the genome, used as input for sequencing.

  • Genome Assembly: Overlapping sequence reads are matched and assembled into contiguous sequences (contigs) and scaffolds.

  • Reference Genome: A composite genome assembled from multiple individuals, used as a standard for comparison.

DNA sequencing workflow Genomic DNA library construction cDNA library construction

The Human Genome Project (HGP)

The Human Genome Project was an international effort to sequence and map all human genes. It provided a reference genome and revealed the genetic basis of many diseases.

  • Timeline: Draft released in 2000, declared complete in 2003, final gapless assembly in 2022.

  • Methods: BAC/YAC cloning, Sanger sequencing, later replaced by NGS.

  • Findings: ~20,000 protein-coding genes, 99.9% similarity across humans, identification of SNPs and CNVs.

Human Genome Project Reference genome composition

Genome Assembly: Contigs, Scaffolds, and Chromosomes

Genome assembly is the process of reconstructing the original genome from sequence reads. Contigs are continuous sequences, scaffolds are ordered sets of contigs, and chromosomes are assembled from scaffolds.

  • Contig: A contiguous sequence of DNA assembled from overlapping reads.

  • Scaffold: A series of contigs joined together using additional information, such as genetic or physical maps.

  • Chromosome: The final, ordered assembly of scaffolds.

Genetic Mapping vs. Physical Mapping

Type

Basis

Unit

Resolution

Genetic Map

Recombination frequency

centimorgan (cM)

Low (marker order)

Physical Map

Physical distance

base pairs (bp, kb, Mb)

High (precise location)

Multi-Omics Technologies

Introduction to Multi-Omics

Multi-omics integrates data from various molecular levels to provide a comprehensive view of biological systems. The main omics fields include genomics, transcriptomics, epigenomics, proteomics, and metabolomics.

  • Genomics: Study of the entire DNA sequence.

  • Transcriptomics: Analysis of all expressed RNA molecules.

  • Epigenomics: Study of DNA methylation, histone modifications, and chromatin accessibility.

  • Proteomics: Analysis of all proteins in a cell or tissue.

  • Metabolomics: Study of metabolites and small molecules involved in metabolism.

Transcriptomics

Transcriptomics involves the global analysis of gene expression, identifying which genes are expressed and at what levels in specific tissues or conditions.

  • RNA Sequencing (RNA-seq): Uses NGS to quantify and compare gene expression.

  • Applications: Cancer diagnostics, developmental biology, disease research.

  • Bulk RNA-Seq: Measures average expression across many cells.

  • Single-Cell RNA-Seq: Resolves expression at the level of individual cells, revealing heterogeneity.

Epigenomics

Epigenomics studies heritable changes in gene expression that do not involve changes to the DNA sequence, such as DNA methylation and histone modification.

  • Methods: Whole genome bisulfite sequencing (WGBS), ATAC-seq, ChIP-seq.

  • Applications: Cancer research, developmental biology, environmental studies.

Proteomics

Proteomics is the large-scale study of proteins, including their expression, structure, and function.

  • Methods: Liquid chromatography-mass spectrometry (LC-MS/MS), Western blotting, ELISA.

  • Applications: Disease biomarker discovery, cancer research, functional annotation.

Metagenomics

Metagenomics involves sequencing the genomes of entire microbial communities directly from environmental samples, bypassing the need for culturing.

  • Applications: Microbiome studies, environmental monitoring, discovery of novel genes and organisms.

  • Case Study: Human Microbiome Project, NYC Subway Microbiota.

Summary Table: Multi-Omics Technologies

Omics Field

Analyte

Key Methods

Applications

Genomics

DNA

WGS, NGS

Genome mapping, disease gene discovery

Transcriptomics

RNA

RNA-seq

Gene expression profiling, diagnostics

Epigenomics

Epigenetic marks

WGBS, ATAC-seq, ChIP-seq

Regulation, development, disease

Proteomics

Proteins

LC-MS/MS

Biomarker discovery, functional studies

Metabolomics

Metabolites

Mass spectrometry

Metabolic profiling, disease research

Pearson Logo

Study Prep