Skip to main content
Back

Genomic Technologies: DNA Sequencing, Genomics, and Multi-Omics Applications

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Recombinant DNA Technology and Genetically Modified Organisms (GMOs)

Introduction to Recombinant DNA and GMOs

Recombinant DNA technology enables the manipulation and combination of DNA from different sources to create genetically modified organisms (GMOs). This technology is foundational for modern genetics, biotechnology, and genomics.

  • Genetically Modified Organisms (GMOs): Organisms whose genetic material has been altered using recombinant DNA methods to introduce new traits or functions.

  • Recombinant DNA: DNA molecules formed by laboratory methods of genetic recombination, such as molecular cloning, to bring together genetic material from multiple sources.

  • Applications: Agriculture (e.g., pest-resistant crops), medicine (e.g., insulin production), and research (e.g., gene function studies).

How are GMO plants made?

Polymerase Chain Reaction (PCR) and DNA Amplification

Principles and Applications of PCR

The Polymerase Chain Reaction (PCR) is a technique used to amplify specific DNA segments, making millions of copies from a small initial sample. PCR is essential for genetic analysis, cloning, and sequencing.

  • Key Steps: Denaturation, annealing, and extension cycles using DNA polymerase.

  • Applications: Genetic testing, cloning, forensic analysis, and preparation for sequencing.

DNA Sequencing Technologies

Overview of DNA Sequencing

DNA sequencing determines the precise order of nucleotides (A, T, C, G) in a DNA molecule. Sequencing technologies have evolved through three generations, each with distinct features and applications.

  • First Generation: Sanger sequencing (chain-termination method).

  • Second Generation: Next Generation Sequencing (NGS), e.g., Illumina platforms.

  • Third Generation: Single-molecule sequencing, e.g., PacBio and Nanopore.

DNA sequencer (NextSeq 500)

First Generation: Sanger Sequencing

Sanger sequencing uses chain-terminating dideoxynucleotides (ddNTPs) to generate DNA fragments of varying lengths, which are then separated and analyzed to determine the DNA sequence.

  • Chain Termination: Incorporation of ddNTPs halts DNA synthesis due to the absence of a 3'-OH group.

  • Detection: Fragments are separated by size using gel electrophoresis and detected by fluorescence or radioactivity.

  • Read Length: Typically 500–1,000 base pairs per reaction.

Sanger sequencing gel and readout Structural difference between dNTP and ddNTP

Second Generation: Next Generation Sequencing (NGS)

NGS technologies, such as Illumina sequencing, enable massively parallel sequencing of millions of short DNA fragments, increasing throughput and reducing cost per base.

  • Sequencing by Synthesis: Each nucleotide addition is detected in real time, allowing for high-throughput data generation.

  • Applications: Whole genome sequencing, transcriptomics, and large-scale genetic studies.

  • Read Length: Typically 50–500 base pairs per fragment.

Massively parallel DNA sequencing Illumina sequencing steps

Third Generation: Single-Molecule Sequencing

Third-generation sequencing technologies, such as PacBio and Nanopore, sequence single DNA molecules in real time, producing much longer reads and enabling the analysis of complex genomic regions.

  • PACBIO SMRT Sequencing: Uses nanowells and real-time detection of nucleotide incorporation by DNA polymerase.

  • Nanopore Sequencing: DNA strands pass through nanopores, and changes in electrical current are used to identify bases.

  • Read Length: Up to tens of thousands of base pairs per read.

  • Applications: Structural variant detection, genome assembly, and epigenetic analysis.

Single molecule DNA sequencing (Nanopore) Nanopore sequencing device

Comparison of Sequencing Technologies

Generation

Technology

Read Length

Throughput

Applications

First

Sanger

500–1,000 bp

Low

Single genes, small genomes

Second

Illumina NGS

50–500 bp

High

Whole genomes, transcriptomics

Third

PACBIO, Nanopore

10,000+ bp

High

Genome assembly, structural variants

Genomic Libraries and cDNA Libraries

Genomic DNA Libraries

Genomic libraries are collections of DNA fragments representing the entire genome of an organism, used for sequencing, mapping, and gene discovery.

  • Construction: Genomic DNA is fragmented and cloned into vectors (e.g., BACs, YACs).

  • Coverage: Includes both coding and noncoding regions.

  • Historical Use: Essential for the Human Genome Project; now largely replaced by whole genome sequencing.

Genomic DNA library construction

cDNA Libraries

cDNA libraries are collections of complementary DNA (cDNA) synthesized from mRNA, representing only the expressed genes in a cell or tissue.

  • Construction: mRNA is reverse transcribed into cDNA, which is then cloned.

  • Applications: Studying gene expression, identifying disease-related genes, and comparing normal vs. diseased tissues.

  • Modern Replacement: RNA sequencing (RNA-seq) provides more comprehensive expression analysis.

cDNA library synthesis

Human Genome Project and Genome Assembly

Human Genome Project (HGP)

The Human Genome Project was an international effort to sequence and map all human genes, providing a reference for genetic research and medicine.

  • Timeline: Draft released in 2000, declared complete in 2003, gapless assembly in 2022.

  • Methods: BAC/YAC cloning, Sanger sequencing, and assembly of overlapping fragments.

  • Impact: Revealed genetic similarities and diversity among humans and other species.

Human Genome Project reference sequence composition

Genome Assembly

Genome assembly involves piecing together short DNA sequence reads into longer contiguous sequences (contigs) and scaffolds, ultimately reconstructing entire chromosomes.

  • Contigs: Overlapping sequence reads are merged to form continuous sequences.

  • Scaffolds: Runs of contigs with no gaps, assembled using genetic and physical maps.

  • Reference Genome: Assembled from pooled DNA of multiple individuals, not a single person.

Genome assembly: contigs and scaffolds

Genetic Mapping vs. Physical Mapping

Genetic maps are based on recombination frequencies (centimorgans), while physical maps use actual base-pair distances. Both are essential for genome assembly and annotation.

  • Genetic Maps: Provide marker order but not precise distances.

  • Physical Maps: Offer detailed, base-pair level resolution.

Whole Genome Sequencing (WGS) and Shotgun Sequencing

WGS Workflow

Whole genome sequencing (WGS) uses random fragmentation and high-throughput sequencing to reconstruct entire genomes without cloning.

  • Steps: DNA extraction, fragmentation (sonication or enzymatic), size selection, library construction (adapters and PCR), sequencing, and bioinformatics assembly.

  • Advantages: Faster and more scalable than traditional cloning-based methods.

Features of the Human Genome

Key Characteristics

  • Genome Size: ~3.1 billion nucleotides

  • Protein-Coding Genes: ~20,000 (about 2% of the genome)

  • Genetic Diversity: ~99.9% identical among humans; diversity includes SNPs and CNVs

  • ENCODE Project: Catalogs functional elements, including genes, regulatory elements, and non-coding RNAs.

Multi-Omics Technologies

Overview of Omics

Multi-omics integrates genomics, transcriptomics, epigenomics, proteomics, and metabolomics to provide a comprehensive view of biological systems.

  • Genomics: Study of the complete DNA sequence.

  • Transcriptomics: Analysis of all expressed RNAs (mRNA, ncRNA).

  • Epigenomics: Study of DNA methylation, histone modifications, and chromatin structure.

  • Proteomics: Analysis of all proteins and their modifications.

  • Metabolomics: Study of metabolites and small molecules.

Comparative Genomics

Comparative genomics analyzes genome sequences across species to identify conserved and unique features, gene functions, and evolutionary relationships.

  • Applications: Disease gene discovery, evolutionary biology, and adaptation studies.

  • Example: Human and Neanderthal genomes share 99% similarity; 1–4% of non-African human DNA is inherited from Neanderthals.

Metagenomics

Metagenomics sequences DNA from entire microbial communities, revealing genetic diversity and novel functions without culturing organisms.

  • Applications: Environmental studies, human microbiome research, and discovery of new genes and pathways.

Functional Genomics

Functional genomics aims to determine the roles of genes and genomic elements using high-throughput approaches.

  • Transcriptome: All RNA molecules transcribed from the genome.

  • Epigenome: All chemical modifications to DNA and histones.

  • Proteome: All proteins encoded by the genome.

  • Techniques: RNA-seq, microarrays, ChIP-seq, and mass spectrometry.

Transcriptomics

Transcriptomics studies gene expression patterns in cells or tissues, both qualitatively and quantitatively.

  • RNA-seq: Uses NGS to catalog and quantify RNA molecules.

  • Applications: Cancer diagnostics, developmental biology, and disease research.

  • Bulk vs. Single-Cell RNA-seq: Bulk provides average expression; single-cell reveals cell-specific heterogeneity.

Epigenomics

Epigenomics investigates genome-wide epigenetic modifications, such as DNA methylation and histone modifications, which regulate gene expression without altering DNA sequence.

  • Techniques: Whole genome bisulfite sequencing (WGBS), ATAC-seq, ChIP-seq.

  • Applications: Cancer research, developmental biology, environmental studies.

Proteomics

Proteomics analyzes the structure, function, and interactions of all proteins in a cell or tissue.

  • Techniques: Liquid chromatography-mass spectrometry (LC-MS/MS), Western blotting, ELISA.

  • Applications: Cancer research, biomarker discovery, and functional annotation.

Summary Table: Multi-Omics Technologies

Omics Field

Analyte

Key Techniques

Applications

Genomics

DNA

WGS, NGS

Gene discovery, disease genetics

Transcriptomics

RNA

RNA-seq

Gene expression, diagnostics

Epigenomics

DNA/histones

WGBS, ChIP-seq

Gene regulation, cancer

Proteomics

Proteins

LC-MS/MS

Protein function, biomarkers

Metabolomics

Metabolites

MS, NMR

Metabolic pathways, disease

Pearson Logo

Study Prep