Definition
A haplotype block is a contiguous segment of the genome within which a limited number of distinct haplotypes—specific combinations of alleles at neighboring genetic loci—account for the majority of observed genetic variation in a population. Within a block, historical recombination events are relatively rare, resulting in high linkage disequilibrium (LD) among the constituent single‑nucleotide polymorphisms (SNPs). Consequently, the allelic composition of a block can often be inferred from a subset of representative “tag” SNPs.
Historical Background
The concept of haplotype blocks emerged from early whole‑genome scans of human genetic variation in the early 2000s. Gabriel et al. (2002) demonstrated that the human genome is organized into regions of high LD interspersed with recombination hotspots, proposing a formal definition of blocks based on confidence intervals for the LD measure D′. Subsequent studies, including those by the International HapMap Consortium, refined block detection algorithms and applied the framework to diverse populations.
Structure and Identification
Haplotype blocks are identified using statistical criteria that quantify LD among markers:
- D′‑based methods: Define a block when the lower confidence bound of D′ exceeds a preset threshold (commonly 0.8) for a majority of marker pairs.
- r²‑based methods: Emphasize the predictive power of one SNP for another; blocks may be defined when r² exceeds 0.8 for most pairs.
- Sliding‑window or clustering approaches: Scan the genome with windows of fixed size or variable SNP density to locate regions of uniform LD.
The resolution of block detection depends on marker density, sample size, and the genetic diversity of the studied population. Blocks observed in one ethnic group may be fragmented or merged in another due to differing recombination histories.
Applications
- Tag SNP selection: By genotyping a reduced set of SNPs that capture the haplotype diversity within each block, researchers can cost‑effectively conduct genome‑wide association studies (GWAS).
- Population genetics: Block patterns provide insight into demographic history, population structure, and the effects of natural selection.
- Clinical genetics: Haplotype blocks facilitate the identification of disease‑associated haplotypes and the design of diagnostic panels.
- Pharmacogenomics: Block structure aids in predicting drug response phenotypes linked to specific haplotypes.
Limitations and Controversies
- Population specificity: Block boundaries are not universal; they shift with allele frequencies and recombination landscapes across populations.
- Marker density dependence: Sparse SNP arrays may artificially inflate block size, while dense sequencing data often reveal a more fragmented LD landscape.
- Methodological variability: Different block‑definition algorithms can yield divergent block maps, complicating cross‑study comparisons.
- Biological relevance: In some genomic regions, recombination hotspots are ubiquitous, resulting in minimal block structure, which challenges the utility of the block concept for certain analyses.
Current Research
Recent work integrates whole‑genome sequencing, high‑resolution recombination maps, and statistical phasing to refine block delineation. Machine‑learning approaches are also being explored to predict functional consequences of haplotype variation within blocks.
See also
- Linkage disequilibrium
- Single‑nucleotide polymorphism (SNP)
- Tag SNP
- HapMap Project
- Recombination hotspot
References (representative)
- Gabriel, S. B. et al. (2002). The structure of haplotype blocks in the human genome. Nature Genetics, 31(2), 210–215.
- International HapMap Consortium (2005). A haplotype map of the human genome. Nature, 437(7063), 1299–1320.
- Slatkin, M. (2008). Linkage disequilibrium—understanding the evolutionary past and mapping the future. Nature Reviews Genetics, 9(6), 477–485.