Genetic drift and neutral evolution

Overview

Genetic drift, the random fluctuation of allele frequencies in finite populations, was formalised by Sewall Wright in the 1930s and recognised as a fundamental mechanism of evolution that operates independently of natural selection.
Motoo Kimura's neutral theory of molecular evolution (1968) and Tomoko Ohta's nearly neutral extension (1973) demonstrated that the majority of DNA sequence changes fixed during evolution are selectively neutral or nearly so, with their fate governed by drift rather than adaptation.
Genomic evidence from synonymous substitution rates, pseudogene evolution, the molecular clock, and coalescent analyses confirms that neutral processes dominate molecular evolution, while natural selection remains the primary architect of phenotypic adaptation — a synthesis that now underpins modern evolutionary genetics.

Genetic drift and neutral evolution are among the most important conceptual developments in the modern understanding of how genomes change over time. While natural selection acts as the primary creative force shaping phenotypic adaptation, a vast body of theoretical and empirical evidence demonstrates that most evolutionary change at the molecular level — the substitutions, insertions, and deletions that accumulate in DNA sequences across lineages — is driven not by adaptation but by the random sampling of selectively neutral or nearly neutral variants in finite populations.^{2, 4} This insight, formalised by Motoo Kimura in 1968 and subsequently refined by Tomoko Ohta and others, fundamentally transformed evolutionary biology by establishing a rigorous null hypothesis against which the action of natural selection can be tested, providing the theoretical basis for the molecular clock, and revealing that chance plays a far larger role in genomic evolution than the architects of the Modern Synthesis had envisioned.^{2, 5, 13}

Historical origins of the drift concept

Three panels of line graphs showing allele frequency simulations over 50 generations in populations of 20, 200, and 2000 individuals, demonstrating that smaller populations show larger random fluctuations — Ten simulations each of allele frequency change over 50 generations in populations of size 20 (top), 200 (middle), and 2,000 (bottom). In small populations, random sampling causes dramatic fluctuations—many alleles reach fixation or are lost entirely within a few dozen generations. In large populations, frequencies barely deviate from the starting point. This size-dependence is the core prediction of Sewall Wright's mathematical theory of genetic drift. Professor marginalia, Wikimedia Commons, CC BY-SA 3.0

The theoretical foundations of genetic drift were laid in the 1930s by Sewall Wright, one of the three architects (alongside R. A. Fisher and J. B. S. Haldane) of the mathematical framework of population genetics. In his landmark 1931 paper "Evolution in Mendelian Populations," Wright demonstrated that in any finite population, allele frequencies will fluctuate from generation to generation due to the random sampling of gametes during reproduction, a process he initially termed "random drift" or the "Sewall Wright effect."¹ Wright showed that the magnitude of these fluctuations is inversely proportional to the population size: in large populations, random sampling produces negligible changes in allele frequency per generation, but in small populations the fluctuations can be substantial enough to drive alleles to fixation or extinction regardless of their selective value.^{1, 11}

Wright's work established two results of lasting importance. First, the probability that a new neutral mutation will eventually reach fixation in a diploid population of effective size N_e is 1/(2N_e), because the mutation initially exists as a single copy among 2N_e total alleles at its locus. Second, the expected time for a neutral allele to reach fixation, conditional on its eventual fixation, is approximately 4N_e generations.^{1, 11} These seemingly simple results would prove central to the neutral theory proposed three decades later. For most of the mid-twentieth century, however, the dominant view was that natural selection was the overwhelmingly important force in evolution, and drift was regarded as a minor factor relevant only to unusually small or isolated populations.⁴

The neutral theory of molecular evolution

The intellectual landscape shifted dramatically in 1968, when Motoo Kimura published a brief but revolutionary paper (see neutral theory of molecular evolution for a detailed treatment) in Nature titled "Evolutionary rate at the molecular level."² Kimura began with a quantitative observation: comparisons of protein sequences between mammals and other vertebrates, made possible by the new techniques of protein sequencing, indicated that amino acid substitutions were accumulating at a rate of approximately one per polypeptide per 28 million years across the entire genome. Extrapolating to the full genome, Kimura estimated that nucleotide substitutions were occurring at a rate of approximately one every two years in mammalian lineages.² This rate was far too high to be explained by positive natural selection, because each selective substitution imposes a "cost" in the form of selective deaths (a concept first quantified by Haldane in 1957 as the "cost of natural selection"), and the cumulative cost of fixing beneficial alleles at such a pace would require an implausibly high reproductive surplus.^{2, 6}

Kimura's solution was elegant: the vast majority of these substitutions must be selectively neutral, having no appreciable effect on the fitness of the organism. If neutral mutations arise at a rate of μ per individual per generation, and each has a fixation probability of 1/(2N_e) in a diploid population, then the total rate of neutral substitution is simply 2N_eμ × 1/(2N_e) = μ. The population size cancels, and the rate of neutral molecular evolution equals the mutation rate, independent of population size.^{2, 4} This prediction is both counterintuitive and powerful: it means that neutral sequences should accumulate substitutions at a roughly constant rate per unit time, providing a theoretical foundation for the molecular clock first observed empirically by Zuckerkandl and Pauling in comparisons of haemoglobin sequences across vertebrates.^{7, 13}

The following year, King and Jukes independently marshalled a wide range of evidence for what they provocatively titled "Non-Darwinian Evolution," arguing that the redundancy of the genetic code, the prevalence of conservative amino acid substitutions, and the roughly constant rate of protein evolution across lineages all pointed to neutral drift as the dominant mode of molecular change.³ Kimura expanded his arguments into a comprehensive monograph in 1983, incorporating data from protein electrophoresis, DNA sequence comparisons, and the emerging field of molecular phylogenetics to build the case that neutral drift, not adaptive selection, accounts for the majority of molecular variation both within and between species.⁴

The empirical puzzle that motivated neutralism

The neutral theory did not emerge from abstract theorising alone; it was a direct response to empirical observations that could not be reconciled with the prevailing selectionist framework. In 1966, Lewontin and Hubby used protein gel electrophoresis to survey genetic variation at multiple enzyme loci in natural populations of Drosophila pseudoobscura and discovered that an unexpectedly high proportion of loci — roughly 30 percent — were polymorphic, with individual flies heterozygous at approximately 12 percent of their genes.⁸ This level of variation was difficult to explain under the classical "balance" model of selection, which posited that most genetic variation is maintained by balancing selection (such as heterozygote advantage), because maintaining so many polymorphisms simultaneously by selection would impose an enormous segregation load on the population.^{4, 8}

Polyacrylamide protein gel showing bands of proteins separated by molecular weight in multiple lanes — A polyacrylamide gel after protein electrophoresis, showing protein bands separated by molecular weight across multiple sample lanes. This technique — used by Lewontin and Hubby in 1966 to reveal unexpectedly high levels of protein polymorphism in *Drosophila pseudoobscura* — provided the empirical puzzle that motivated Kimura's neutral theory: most of the variation turned out to be selectively neutral rather than maintained by balancing selection. optimal tweezers, Wikimedia Commons, CC BY 2.0

Kimura and Ohta argued that most of this protein polymorphism represents transient neutral variation on its way to fixation or loss by drift, rather than alleles maintained by balancing selection.⁹ Under the neutral theory, the expected level of heterozygosity at a neutral locus is determined by the product of the effective population size and the mutation rate (approximately 4N_eμ for a diploid locus), and the observed levels of protein polymorphism fell within the range predicted by plausible values of these parameters.^{4, 9} The debate between selectionists and neutralists over the causes of protein polymorphism dominated population genetics throughout the 1970s and 1980s and proved extraordinarily productive, stimulating the development of new statistical tests and theoretical models that remain in use today.^{4, 20}

The nearly neutral theory

In 1973, Tomoko Ohta refined the neutral theory by recognising that many mutations are not strictly neutral but instead have selection coefficients so small that their evolutionary fate is governed primarily by drift rather than by selection. Ohta proposed that a substantial fraction of molecular mutations are slightly deleterious, with fitness effects too weak to be efficiently removed by purifying selection in small populations but strong enough to be purged in large ones.⁵ The critical parameter is the product of the effective population size and the selection coefficient, N_es. When |N_es| is much less than 1, selection is too weak to overcome the stochastic noise of drift, and the mutation behaves as if it were neutral. When |N_es| is much greater than 1, selection efficiently determines the mutation's fate.^{5, 10}

The nearly neutral theory makes several distinctive predictions. Species with small effective population sizes should accumulate slightly deleterious mutations at higher rates, because the threshold of "effective neutrality" expands as N_e shrinks. This in turn predicts that the molecular clock should tick slightly faster in lineages with smaller N_e, at least for functionally constrained sequences where slightly deleterious mutations are common.^{5, 10} Both predictions have been confirmed by comparative genomic studies. Species with small effective population sizes, such as primates and island-dwelling vertebrates, tend to exhibit higher ratios of nonsynonymous to synonymous substitutions (dN/dS), indicating that a greater proportion of amino acid changes have reached fixation in these lineages — consistent with the fixation of slightly deleterious mutations by drift.^{10, 23}

The nearly neutral theory thus provides a more nuanced framework than strict neutrality, bridging the gap between purely neutral drift and deterministic selection by recognising a continuous spectrum of mutational effects. It also accounts for patterns that the original neutral theory could not fully explain, including the observation that the molecular clock is not perfectly constant across lineages and that the rate of molecular evolution varies systematically with effective population size.^{5, 13}

Effective population size and the drift-selection boundary

The concept of effective population size (N_e), introduced by Wright in 1931 and subsequently elaborated by Crow and Kimura, is the single most important parameter in determining the relative influence of drift and selection on molecular evolution.^{1, 11} N_e is defined as the size of an idealised Wright-Fisher population that would experience the same rate of genetic drift as the actual population under study. In real populations, N_e is almost always substantially smaller than the census population size (N) because of fluctuations in population size over time, unequal sex ratios, variance in reproductive success, and population structure.¹⁰

The consequences of variation in N_e for genome evolution are profound. Michael Lynch has argued that many features of eukaryotic genome architecture — including the proliferation of introns, transposable elements, and noncoding DNA in organisms with small N_e — are not adaptive innovations but rather the consequence of weakened purifying selection in lineages where drift is the dominant evolutionary force.²³ Bacteria and other prokaryotes, which typically maintain effective population sizes orders of magnitude larger than those of multicellular eukaryotes, have correspondingly streamlined genomes with minimal noncoding DNA, because purifying selection in large populations efficiently removes insertions and duplications that would slightly reduce fitness. In contrast, the large, intron-rich genomes of vertebrates and flowering plants reflect the inability of purifying selection to prevent the accumulation of mildly deleterious genomic elements when N_e is small.^{10, 23}

Approximate effective population sizes across diverse taxa^{10, 23}

Organism	Approx. N_e	Genome size (Mb)	Drift dominance
Escherichia coli	~25,000,000	4.6	Low — selection highly efficient
Drosophila melanogaster	~1,000,000	180	Moderate — most coding changes selected
Caenorhabditis elegans	~80,000	100	Moderate
Mouse (Mus musculus)	~500,000	2,700	Moderate
Human (Homo sapiens)	~10,000–15,000	3,200	High — many slightly deleterious mutations fixed
Chimpanzee (Pan troglodytes)	~25,000–50,000	3,300	High

The table illustrates the striking inverse relationship between effective population size and genome complexity. Organisms with very large N_e, such as bacteria, maintain compact genomes under strong purifying selection, while those with small N_e, such as humans, harbour vastly larger genomes laden with noncoding sequences whose persistence reflects the diminished efficiency of selection relative to drift.^{10, 23}

The molecular clock as evidence for neutrality

One of the most striking predictions of the neutral theory is that neutral sequences should evolve at an approximately constant rate per unit time, because the substitution rate equals the mutation rate and is independent of population size. This prediction aligns with the empirical observation, first noted by Zuckerkandl and Pauling in 1965, that the number of amino acid differences between homologous proteins in different species is roughly proportional to the time since those species last shared a common ancestor.⁷ This regularity, termed the molecular clock, has become one of the most widely used tools in evolutionary biology for estimating divergence times between lineages from DNA or protein sequence data.¹³

The approximate constancy of the molecular clock is difficult to explain under a purely selectionist model, because the rate of adaptive substitution would be expected to vary dramatically across lineages in response to differing environmental pressures, population sizes, and ecological circumstances. Under the neutral theory, by contrast, the clock's regularity follows naturally from the fact that the neutral substitution rate depends only on the mutation rate per individual, which is determined by the biochemistry of DNA replication and repair and is therefore relatively (though not perfectly) constant across related lineages.^{2, 4, 13}

The molecular clock is not perfectly metronomic. Rates of molecular evolution vary among lineages due to differences in generation time, metabolic rate, and the efficiency of DNA repair, and they vary among genes due to differences in functional constraint.¹³ These departures from strict clock-like behaviour are accommodated by "relaxed clock" methods in modern molecular phylogenetics, but the approximate regularity of the clock across diverse lineages and genes remains one of the strongest pieces of evidence that neutral drift dominates molecular evolution.^{13, 20}

Genomic evidence for neutral evolution

Phylogenetic tree of animals and their unicellular relatives with a geological timeline showing early animal evolution — A phylogenetic timetree of animals and their unicellular relatives. The molecular clock, grounded in the neutral theory, enables such dated phylogenies by treating selectively neutral substitutions as a consistent temporal signal. Differences in substitution rates across branches reflect variation in effective population size and functional constraint. Ros-Rocher et al., Wikimedia Commons, CC BY 4.0

The advent of genome sequencing has provided overwhelming evidence that neutral processes dominate the evolution of DNA sequences. Several lines of genomic evidence are particularly compelling. First, synonymous substitutions — nucleotide changes in protein-coding genes that do not alter the encoded amino acid due to the redundancy of the genetic code — accumulate at rates substantially higher than nonsynonymous substitutions (those that do change the amino acid). Li, Wu, and Luo developed methods for estimating synonymous (dS) and nonsynonymous (dN) substitution rates and showed that across a large sample of mammalian genes, dS typically exceeds dN by a factor of three to ten, indicating that purifying selection constrains amino acid changes while synonymous sites evolve in a largely neutral fashion.¹⁴

Second, pseudogenes — genomic sequences that originated as functional gene copies but have been inactivated by disabling mutations — evolve at rates essentially equal to the neutral mutation rate, because they are free from selective constraint. Nachman and Crowell used comparisons of processed pseudogenes between humans and chimpanzees to estimate the human mutation rate at approximately 2.5 × 10⁻⁸ per nucleotide site per generation, a value that closely matches the substitution rate at other putatively neutral sites such as intergenic regions and fourfold degenerate coding positions.¹⁵ The concordance between pseudogene evolution rates and synonymous substitution rates provides direct confirmation that these sequences are evolving neutrally by drift.^{4, 15}

Third, the sheer quantity of noncoding DNA in eukaryotic genomes supports the prediction that drift, rather than selection, shapes most of the genome. The human genome comprises approximately 3.2 billion base pairs, yet only about 1.5 percent encodes proteins.²⁵ The ENCODE project reported in 2012 that up to 80 percent of the genome shows some form of "biochemical activity," but evolutionary analyses using the neutral theory as a framework suggest that the functionally constrained fraction is far smaller. Graur estimated an upper bound of approximately 10 to 25 percent for the fraction of the human genome under purifying selection, implying that the majority of the genome evolves under effectively neutral conditions.^{24, 25}

Coalescent theory and neutral genealogies

In 1982, John Kingman introduced the coalescent, a mathematical framework that describes the genealogical relationships among a sample of gene copies drawn from a population, looking backward in time.¹² Rather than tracking allele frequencies forward through generations (as in the classical Wright-Fisher model), the coalescent traces lineages backward until they merge, or "coalesce," into common ancestors. Under the assumption of selective neutrality, the coalescent provides a complete probabilistic description of the genealogy of a sample, with the rate of coalescence determined by the effective population size.¹²

The coalescent has become the dominant analytical framework in population genetics because it provides computationally efficient methods for inferring population history from DNA sequence data. Under the standard neutral coalescent, the expected time to the most recent common ancestor of two gene copies is 2N_e generations (in a diploid population), and the total expected tree length for a sample of n sequences can be computed analytically.^{12, 11} The patterns of genetic variation predicted by the neutral coalescent — including the distribution of pairwise differences, the frequency spectrum of segregating sites, and the pattern of linkage disequilibrium — serve as the null expectations against which signatures of natural selection are tested.^{12, 18}

Fumio Tajima exploited the predictions of the neutral coalescent to develop his widely used test of neutrality in 1989. Tajima's D statistic compares two estimators of the population-scaled mutation parameter θ = 4N_eμ: one based on the number of segregating sites in a sample and another based on the average number of pairwise nucleotide differences. Under neutral equilibrium, these estimators should converge, and significant deviations indicate either the action of natural selection or departures from demographic equilibrium (such as population expansion or contraction).¹⁸ The neutral coalescent thus serves not only as a model of drift-driven evolution but also as the essential baseline from which non-neutral forces are detected.

Statistical tests of neutrality and selection

The neutral theory's greatest methodological contribution has been the establishment of a rigorous null hypothesis for molecular evolution. If most molecular variation is neutral, then deviations from neutral predictions provide evidence for the action of selection. Over the past four decades, population geneticists have developed an arsenal of statistical tests designed to detect such deviations, each exploiting different predictions of the neutral model.^{4, 20}

The Hudson-Kreitman-Aguadé (HKA) test, introduced in 1987, compares levels of within-species polymorphism and between-species divergence across multiple loci. Under neutrality, loci with high divergence should also exhibit high polymorphism, because both are proportional to the neutral mutation rate. Deviations from this expectation at a particular locus suggest the action of selection: excess divergence relative to polymorphism implicates directional selection driving substitutions, while excess polymorphism relative to divergence may indicate balancing selection.¹⁷

The McDonald-Kreitman (MK) test, published in 1991, compares the ratio of nonsynonymous to synonymous changes within species (polymorphism) to the same ratio between species (divergence). Under strict neutrality, these ratios should be equal. McDonald and Kreitman applied their test to the Alcohol dehydrogenase (Adh) locus in Drosophila and found a significant excess of nonsynonymous fixed differences between species relative to the expectation from polymorphism data, providing evidence that some amino acid substitutions at this locus were driven by positive selection.¹⁶ The MK test has since become one of the most widely used methods for quantifying the proportion of adaptive amino acid substitutions across the genome.^{16, 20}

These tests, together with Tajima's D, the Fu and Li statistics, and their many extensions, have collectively demonstrated that while neutral drift dominates the evolution of most genomic sites, natural selection has left detectable signatures at a subset of loci involved in adaptation, immune function, reproduction, and sensory perception.^{18, 19, 20}

The modern synthesis of selectionism and neutralism

The debate between selectionists and neutralists, which dominated evolutionary genetics from the late 1960s through the 1990s, has largely resolved into a modern consensus that integrates the core insights of both positions. The consensus holds that most nucleotide substitutions fixed during the divergence of species are neutral or nearly neutral, that the majority of within-species polymorphism at the DNA level is likewise selectively inconsequential, and that the neutral theory provides the appropriate null model for molecular evolutionary analysis.^{4, 20, 22} At the same time, natural selection is recognised as the primary force responsible for phenotypic adaptation, and a nontrivial fraction of amino acid substitutions — perhaps 10 to 50 percent in Drosophila and other organisms with large N_e, and a smaller fraction in mammals — are driven by positive selection.^{16, 19}

This synthesis was not reached without controversy. In 2018, Kern and Hahn argued that the neutral theory should be "firmly rejected" as a universal description of genome evolution, citing evidence for widespread linked selection (both selective sweeps and background selection) affecting patterns of variation across the genome.²¹ In a detailed response, Jensen and twenty-two co-authors defended the continuing centrality of the neutral theory, arguing that its value lies not as a literal description of every nucleotide but as an indispensable null model and analytical framework without which the detection and quantification of selection would be impossible.²² The distinction is important: the neutral theory need not claim that every single nucleotide is neutral to remain scientifically essential. Its power lies in providing the expected baseline against which the signature of selection is measured.²²

Eyre-Walker and Keightley's comprehensive review of the distribution of fitness effects (DFE) of new mutations has further refined understanding of the drift-selection boundary. Their analysis indicates that a large fraction of new nonsynonymous mutations are effectively neutral or only mildly deleterious in organisms with moderate to small effective population sizes, while a tail of strongly deleterious mutations is efficiently purged by selection and a small fraction of beneficial mutations drives adaptive evolution.¹⁹ The shape of the DFE varies across species and across functional categories within genomes, but in all cases examined, neutral and nearly neutral mutations constitute the majority of new variants, consistent with the core prediction of the neutral and nearly neutral theories.^{19, 20}

The modern understanding of molecular evolution is therefore neither purely selectionist nor purely neutralist. It is a quantitative framework in which the effective population size determines which mutations are "visible" to selection and which are effectively neutral, in which drift and selection jointly shape the genetic composition of populations, and in which the neutral theory serves as the foundational null model against which all claims of adaptive evolution must be tested.^{4, 10, 20, 22} The recognition that chance — embodied in the stochastic sampling of alleles from generation to generation — is not merely a nuisance but a central and pervasive force in the history of life stands as one of the most important intellectual achievements of twentieth-century biology.^{2, 4}

References

Evolution in Mendelian populations

Wright, S. · Genetics 16: 97–159, 1931