Phylogenetics

Updated on Oct 13, 2024

Edit

Comment

In biology, phylogenetics /ˌfaɪloʊdʒəˈnɛtɪks, -lə-/ (Greek: φυλή, φῦλον - phylé, phylon = tribe, clan, race + γενετικός - genetikós = origin, source, birth) is the study of the evolutionary history and relationships among individuals or groups of organisms (e.g. species, or populations). These relationships are discovered through phylogenetic inference methods that evaluate observed heritable traits, such as DNA sequences or morphology under a model of evolution of these traits. The result of these analyses is a phylogeny (also known as a phylogenetic tree) – a diagrammatic hypothesis about the history of the evolutionary relationships of a group of organisms. The tips of a phylogenetic tree can be living organisms or fossils, and represent the "end," or the present, in an evolutionary lineage. Phylogenetic analyses have become central to understanding biodiversity, evolution, ecology, and genomes.

Taxonomy is the classification, identification and naming of organisms. It is usually richly informed by phylogenetics, but remains a methodologically and logically distinct discipline. The degree to which taxonomies depend on phylogenies (or classification depends on evolutionary development) differs depending on the school of taxonomy: phenetics ignores phylogeny altogether, trying to represent the similarity between organisms instead; cladistics (phylogenetic systematics) tries to reproduce phylogeny in its classification without loss of information; evolutionary taxonomy tries to find a compromise between them.

Construction of a phylogenetic tree

Usual methods of phylogenetic inference involve computational approaches implementing the optimality criteria and methods of parsimony, maximum likelihood (ML), and MCMC-based Bayesian inference. All these depend upon an implicit or explicit mathematical model describing the evolution of characters observed.

Phenetics, popular in the mid-20th century but now largely obsolete, used distance matrix-based methods to construct trees based on overall similarity in morphology or other observable traits (i.e. in the phenotype, not the DNA), which was often assumed to approximate phylogenetic relationships.

Prior to 1990, phylogenetic inferences were generally presented as narrative scenarios. Such methods are often ambiguous and lack explicit criteria for evaluating alternative hypotheses.

History

The term "phylogeny" derives from the German Phylogenie, introduced by Haeckel in 1866, and the Darwinian approach to classification became known as the "phyletic" approach.

Ernst Haeckel's recapitulation theory

During the late 19th century, Ernst Haeckel's recapitulation theory, or "biogenetic fundamental law", was widely accepted. It was often expressed as "ontogeny recapitulates phylogeny", i.e. the development of a single organism during its lifetime, from germ to adult, successively mirrors the adult stages of successive ancestors of the species to which it belongs. But this theory has long been rejected. Instead, ontogeny evolves – the phylogenetic history of a species cannot be read directly from its ontogeny, as Haeckel thought would be possible, but characters from ontogeny can be (and have been) used as data for phylogenetic analyses; the more closely related two species are, the more apomorphies their embryos share.

Timeline of key events

14th century, lex parsimoniae (parsimony principle), William of Ockam, English philosopher, theologian, and Franciscan monk, but the idea actually goes back to Aristotle, precursor concept

1763, Bayesian probability, Rev. Thomas Bayes, precursor concept

18th century, Pierre Simon (Marquis de Laplace), perhaps 1st to use ML (maximum likelihood), precursor concept

1809, evolutionary theory, Philosophie Zoologique, Jean-Baptiste de Lamarck, precursor concept, foreshadowed in the 17th century and 18th century by Voltaire, Descartes, and Leibniz, with Leibniz even proposing evolutionary changes to account for observed gaps suggesting that many species had become extinct, others transformed, and different species that share common traits may have at one time been a single race, also foreshadowed by some early Greek philosophers such as Anaximander in the 6th century BC and the atomists of the 5th century BC, who proposed rudimentary theories of evolution

1837, Darwin's notebooks show an evolutionary tree

1843, distinction between homology and analogy (the latter now referred to as homoplasy), Richard Owen, precursor concept

1858, Paleontologist Heinrich Georg Bronn (1800–1862) published a hypothetical tree to illustrating the paleontological "arrival" of new, similar species following the extinction of an older species. Bronn did not propose a mechanism responsible for such phenomena, precursor concept.

1858, elaboration of evolutionary theory, Darwin and Wallace, also in Origin of Species by Darwin the following year, precursor concept

1866, Ernst Haeckel, first publishes his phylogeny-based evolutionary tree, precursor concept

1893, Dollo's Law of Character State Irreversibility, precursor concept

1912, ML recommended, analyzed, and popularized by Ronald Fisher, precursor concept

1921, Tillyard uses term "phylogenetic" and distinguishes between archaic and specialized characters in his classification system

1940, term "clade" coined by Lucien Cuénot

1949, Jackknife_resampling, Maurice Quenouille (foreshadowed in '46 by Mahalanobis and extended in '58 by Tukey), precursor concept

1950, Willi Hennig's classic formalization

1952, William Wagner's groundplan divergence method

1953, "cladogenesis" coined

1960, "cladistic" coined by Cain and Harrison

1963, 1st attempt to use ML (maximum likelihood) for phylogenetics, Edwards and Cavalli-Sforza

1965

Camin-Sokal parsimony, 1st parsimony (optimization) criterion and 1st computer program/algorithm for cladistic analysis both by Camin and Sokal

character compatibility method, also called clique analysis, introduced independently by Camin and Sokal (loc. cit.) and E. O. Wilson

1966

English translation of Hennig

"cladistics" and "cladogram" coined (Webster's, loc. cit.)

1969

dynamic and successive weighting, James Farris

Wagner parsimony, Kluge and Farris

CI (consistency index), Kluge and Farris

introduction of pairwise compatibility for clique analysis, Le Quesne

1970, Wagner parsimony generalized by Farris

1971

Fitch parsimony, Fitch

NNI (nearest neighbour interchange), 1st branch-swapping search strategy, developed independently by Robinson and Moore et al.

ME (minimum evolution), Kidd and Sgaramella-Zonta (it is unclear if this is the pairwise distance method or related to ML as Edwards and Cavalli-Sforza call ML "minimum evolution".)

1972, Adams consensus, Adams

1974, 1st successful application of ML to phylogenetics (for nucleotide sequences), Neyman

1976, prefix system for ranks, Farris

1977, Dollo parsimony, Farris

1979

Nelson consensus, Nelson

MAST (maximum agreement subtree)((GAS)greatest agreement subtree), a consensus method, Gordon

bootstrap, Bradley Efron, precursor concept

1980, PHYLIP, 1st software package for phylogenetic analysis, Felsenstein

1981

majority consensus, Margush and MacMorris

strict consensus, Sokal and Rohlf

1st computationally efficient ML algorithm, Felsenstein

1982

PHYSIS, Mikevich and Farris

branch and bound, Hendy and Penny

1985

1st cladistic analysis of eukaryotes based on combined phenotypic and genotypic evidence Diana Lipscomb

1st issue of Cladistics

1st phylogenetic application of bootstrap, Felsenstein

1st phylogenetic application of jackknife, Scott Lanyon

1986, MacClade, Maddison and Maddison

1987, neighbor-joining method Saitou and Nei

1988, Hennig86 (version 1.5), Farris

Bremer support (decay index), Bremer

1989

RI (retention index), RCI (rescaled consistency index), Farris

HER (homoplasy excess ratio), Archie

1990

combinable components (semi-strict) consensus, Bremer

SPR (subtree pruning and regrafting), TBR (tree bisection and reconnection), Swofford and Olsen

1991

DDI (data decisiveness index), Goloboff

1st cladistic analysis of eukaryotes based only on phenotypic evidence, Lipscomb

1993, implied weighting Goloboff

1994, reduced consensus: RCC (reduced cladistic consensus) for rooted trees, Wilkinson

1995, reduced consensus RPC (reduced partition consensus) for unrooted trees, Wilkinson

1996, 1st working methods for BI (Bayesian Inference)independently developed by Li, Mau, and Rannala and Yang and all using MCMC (Markov chain-Monte Carlo)

1998, TNT (Tree Analysis Using New Technology), Goloboff, Farris, and Nixon

1999, Winclada, Nixon

2003, symmetrical resampling, Goloboff

References

Phylogenetics Wikipedia

(Text) CC BY-SA

Contents

Construction of a phylogenetic tree

History

Ernst Haeckel's recapitulation theory

Timeline of key events

References