BackGenomic Technologies and Applications: DNA Sequencing, Genomics, and Multi-Omics
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Recombinant DNA Technology and Genetically Modified Organisms (GMOs)
Introduction to GMOs and Recombinant DNA
Recombinant DNA technology enables the creation of genetically modified organisms (GMOs) by combining DNA from different sources. This technology is foundational in modern genetics, agriculture, and medicine.
Genetically Modified Organisms (GMOs): Organisms whose genetic material has been altered using recombinant DNA methods.
Recombinant DNA: DNA molecules formed by laboratory methods of genetic recombination to bring together genetic material from multiple sources.
Applications: Agriculture (GMO crops), medicine (insulin production), research (model organisms).

DNA Sequencing Technologies
Overview of DNA Sequencing
DNA sequencing is the process of determining the precise order of nucleotides within a DNA molecule. Advances in sequencing technology have revolutionized genomics, enabling high-throughput and cost-effective analysis.
Goal: To determine the complete sequence of nucleotide bases (A, T, C, G) in a DNA sample.
Applications: Genome mapping, disease gene identification, evolutionary studies.
Generations of DNA Sequencing
DNA sequencing technologies are classified into three generations, each with distinct methodologies and capabilities.
First Generation: Sanger sequencing – chain-termination method, low throughput, high accuracy for small fragments.
Second Generation: Next Generation Sequencing (NGS) – massively parallel, short reads, high throughput, cost-effective.
Third Generation: Single-molecule sequencing – long reads, real-time, high throughput, suitable for complex genomes.
Sanger Sequencing (First Generation)
Sanger sequencing, also known as chain-termination sequencing, uses dideoxynucleotides (ddNTPs) to terminate DNA synthesis at specific bases, allowing the sequence to be determined by fragment length analysis.
Key Principle: Incorporation of ddNTPs, which lack a 3'-OH group, terminates DNA strand elongation.
Detection: Fragments are separated by size using gel electrophoresis, and the sequence is read from the pattern of terminated fragments.
Applications: Sequencing single genes or small DNA fragments.

Sanger Sequencing Workflow
PCR with fluorescent, chain-terminating ddNTPs
Size separation by capillary gel electrophoresis
Laser excitation and detection by sequencing machine

Next Generation Sequencing (NGS, Second Generation)
NGS technologies, such as Illumina sequencing, enable massively parallel sequencing of millions of short DNA fragments, greatly increasing throughput and reducing cost per base.
Sequencing by Synthesis: Each nucleotide incorporation is detected in real time, allowing for high-throughput data collection.
Applications: Whole genome sequencing, transcriptomics, metagenomics.
Advantages: High throughput, cost-effective, suitable for large-scale studies.

Third Generation Sequencing
Third generation sequencing technologies, such as PacBio and Oxford Nanopore, sequence single DNA molecules in real time, producing long reads that are valuable for resolving complex genomic regions.
Single-Molecule Real-Time (SMRT) Sequencing: DNA polymerase synthesizes DNA in a nanowell, and nucleotide incorporation is detected in real time.
Nanopore Sequencing: DNA passes through a nanopore, and changes in electrical current are used to identify bases.
Advantages: Long read lengths, real-time data, ability to resolve repetitive regions and structural variants.

Comparison of Sequencing Technologies
Generation | Technology | Read Length | Throughput | Applications |
|---|---|---|---|---|
First | Sanger | 500-1,000 bp | Low | Single genes, small fragments |
Second | Illumina NGS | 50-500 bp | High | Whole genomes, transcriptomes |
Third | PACBIO, Nanopore | 10,000+ bp | Very High | Complex genomes, structural variants |
Genomics and Genome Analysis
What is Genomics?
Genomics is the study of the complete set of DNA (the genome) in an organism. It encompasses the structure, function, evolution, and mapping of genomes.
Structural Genomics: Determining the DNA sequence and physical structure of the genome.
Functional Genomics: Assigning biological functions to genomic elements, such as genes and regulatory regions.
Comparative Genomics: Comparing genomes across species to identify conserved and unique features.
Genome Sequencing and Assembly
Whole genome sequencing (WGS) involves sequencing the entire genome and assembling the sequence reads into a complete genome.
DNA Libraries: Collections of DNA fragments representing the genome, used as input for sequencing.
Genome Assembly: Overlapping sequence reads are matched and assembled into contiguous sequences (contigs) and scaffolds.
Reference Genome: A composite genome assembled from multiple individuals, used as a standard for comparison.

The Human Genome Project (HGP)
The Human Genome Project was an international effort to sequence and map all human genes. It provided a reference genome and revealed the genetic basis of many diseases.
Timeline: Draft released in 2000, declared complete in 2003, final gapless assembly in 2022.
Methods: BAC/YAC cloning, Sanger sequencing, later replaced by NGS.
Findings: ~20,000 protein-coding genes, 99.9% similarity across humans, identification of SNPs and CNVs.

Genome Assembly: Contigs, Scaffolds, and Chromosomes
Genome assembly is the process of reconstructing the original genome from sequence reads. Contigs are continuous sequences, scaffolds are ordered sets of contigs, and chromosomes are assembled from scaffolds.
Contig: A contiguous sequence of DNA assembled from overlapping reads.
Scaffold: A series of contigs joined together using additional information, such as genetic or physical maps.
Chromosome: The final, ordered assembly of scaffolds.
Genetic Mapping vs. Physical Mapping
Type | Basis | Unit | Resolution |
|---|---|---|---|
Genetic Map | Recombination frequency | centimorgan (cM) | Low (marker order) |
Physical Map | Physical distance | base pairs (bp, kb, Mb) | High (precise location) |
Multi-Omics Technologies
Introduction to Multi-Omics
Multi-omics integrates data from various molecular levels to provide a comprehensive view of biological systems. The main omics fields include genomics, transcriptomics, epigenomics, proteomics, and metabolomics.
Genomics: Study of the entire DNA sequence.
Transcriptomics: Analysis of all expressed RNA molecules.
Epigenomics: Study of DNA methylation, histone modifications, and chromatin accessibility.
Proteomics: Analysis of all proteins in a cell or tissue.
Metabolomics: Study of metabolites and small molecules involved in metabolism.
Transcriptomics
Transcriptomics involves the global analysis of gene expression, identifying which genes are expressed and at what levels in specific tissues or conditions.
RNA Sequencing (RNA-seq): Uses NGS to quantify and compare gene expression.
Applications: Cancer diagnostics, developmental biology, disease research.
Bulk RNA-Seq: Measures average expression across many cells.
Single-Cell RNA-Seq: Resolves expression at the level of individual cells, revealing heterogeneity.
Epigenomics
Epigenomics studies heritable changes in gene expression that do not involve changes to the DNA sequence, such as DNA methylation and histone modification.
Methods: Whole genome bisulfite sequencing (WGBS), ATAC-seq, ChIP-seq.
Applications: Cancer research, developmental biology, environmental studies.
Proteomics
Proteomics is the large-scale study of proteins, including their expression, structure, and function.
Methods: Liquid chromatography-mass spectrometry (LC-MS/MS), Western blotting, ELISA.
Applications: Disease biomarker discovery, cancer research, functional annotation.
Metagenomics
Metagenomics involves sequencing the genomes of entire microbial communities directly from environmental samples, bypassing the need for culturing.
Applications: Microbiome studies, environmental monitoring, discovery of novel genes and organisms.
Case Study: Human Microbiome Project, NYC Subway Microbiota.
Summary Table: Multi-Omics Technologies
Omics Field | Analyte | Key Methods | Applications |
|---|---|---|---|
Genomics | DNA | WGS, NGS | Genome mapping, disease gene discovery |
Transcriptomics | RNA | RNA-seq | Gene expression profiling, diagnostics |
Epigenomics | Epigenetic marks | WGBS, ATAC-seq, ChIP-seq | Regulation, development, disease |
Proteomics | Proteins | LC-MS/MS | Biomarker discovery, functional studies |
Metabolomics | Metabolites | Mass spectrometry | Metabolic profiling, disease research |