BackGenome-Wide Association Studies (GWAS) and Linkage Mapping in Genetics
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Genome-Wide Association Studies (GWAS) and Linkage Mapping
Introduction to GWAS and Linkage Mapping
Genome-wide association studies (GWAS) and linkage mapping are powerful genetic tools used to identify genetic variants associated with specific traits or diseases. These approaches are central to modern genetics, especially for understanding the genetic basis of complex traits in populations.
GWAS is used to detect associations between genetic variants (usually SNPs) and traits of interest across the entire genome.
Linkage mapping identifies the chromosomal location of genes influencing traits by studying recombination frequencies in families.
Linkage Mapping: Principles and Applications
Linkage mapping relies on the principle that genes or genetic markers that are physically close on a chromosome tend to be inherited together. This method is especially effective for Mendelian (monogenic) traits.
Linkage: The tendency of alleles at different loci to be inherited together due to their physical proximity on a chromosome.
Recombination frequency: Used to estimate the distance between loci; lower recombination frequency indicates closer proximity.
Pedigree analysis: Linkage mapping is often performed in families to track inheritance patterns.
Limitations:
Limited by the number of informative meioses (crosses) in humans.
Pedigree information may be incomplete or incorrect.
Related individuals may share many alleles, reducing mapping resolution.
Most effective for rare, monogenic traits.
Genome-Wide Association Studies (GWAS): Concepts and Methodology
GWAS is a population-based approach to identify genetic variants associated with traits, especially complex (polygenic) traits. It examines the entire genome for statistical associations between SNPs and phenotypes.
GWAS: A method for locating causative genetic variation for a trait of interest by testing the association between SNPs and the trait in a large, unrelated population.
Linkage disequilibrium (LD): Non-random association of alleles at different loci in a population. GWAS relies on LD to detect associations.
Steps of GWAS:
Define the study population.
Determine the phenotype for each individual.
Genotype individuals at many SNPs distributed throughout the genome.
Statistically test for association between each SNP and the trait.
SNPs in LD with causative variants will show statistical association.
Manhattan plot: A graphical representation of GWAS results, showing the statistical significance of associations across the genome.
Linkage Disequilibrium and Its Role in GWAS
Linkage disequilibrium (LD) is central to GWAS, as it allows detection of associations between SNPs and causative variants even if the SNP itself is not causative.
LD occurs when alleles at two loci are inherited together more often than expected by chance.
Physical proximity: LD is strongest between loci that are physically close on the chromosome.
Recombination: Over generations, recombination breaks down LD between distant loci.
Closely linked SNPs remain in LD with causative alleles, while distant SNPs do not.
Comparison: Linkage Mapping vs. GWAS
Feature | Linkage Mapping | GWAS |
|---|---|---|
Population | Families/pedigrees | Unrelated individuals |
Best for | Monogenic (single-gene) traits | Complex (polygenic) traits |
Resolution | Low (large chromosomal regions) | High (can pinpoint small regions) |
Allele frequency | Rare alleles | Common alleles |
GWAS Catalog and Data Interpretation
The GWAS Catalog is a curated resource of published GWAS results, providing information on SNP-trait associations across many studies.
GWAS Catalog: A database of SNP-trait associations from published GWAS studies.
Data interpretation: Manhattan plots and summary tables are used to visualize and interpret GWAS findings.
Example: GWAS has been used to identify genetic variants associated with severe COVID-19 outcomes.
Key Terms and Definitions
Genome-wide association study (GWAS): A study that scans the genome for SNPs associated with a trait.
Linkage disequilibrium (LD): The non-random association of alleles at different loci.
Manhattan plot: A plot used to display GWAS results, with each point representing a SNP and its association p-value.
Equations and Statistical Concepts
Recombination frequency (r):
Statistical association (p-value): The probability that the observed association between a SNP and a trait is due to chance.
Applications and Limitations
Applications:
Identifying genetic risk factors for diseases.
Understanding the genetic architecture of complex traits.
Informing personalized medicine approaches.
Limitations:
GWAS often identifies SNPs in LD with causative variants, not the causative variants themselves.
Further functional studies are needed to confirm causality.
GWAS is less effective for rare variants or traits with low heritability.
Summary Table: GWAS vs. Linkage Mapping
Method | Best for | Population | Resolution | Allele Frequency |
|---|---|---|---|---|
Linkage Mapping | Monogenic traits | Families | Low | Rare |
GWAS | Complex traits | Unrelated individuals | High | Common |
Additional info:
GWAS is based on the "common disease, common variant" hypothesis, which posits that common diseases are influenced by common genetic variants, each with a small effect on phenotype.
Manhattan plots are named for their resemblance to the Manhattan skyline, with tall peaks indicating strong associations.