Citizendia
Your Ad Here

Part of the Biology series on
Evolution
Mechanisms and processes

Adaptation
Genetic drift
Gene flow
Mutation
Natural selection
Speciation

Research and history

Evidence
Evolutionary history of life
History
Modern synthesis
Social effect / Objections

Evolutionary biology fields

Cladistics
Ecological genetics
Evolutionary development
Human evolution
Molecular evolution
Phylogenetics
Population genetics

Biology Portal · v  d  e 

Cladistics is the hierarchical classification of species based on evolutionary ancestry. Foundations of modern biology There are five unifying principles eVolution is the third Album by eLDee, it was due to be released in 2008 An adaptation is a characteristic of an Organism that has been favored by Natural selection and In Population genetics, genetic drift is the accumulation of random events that change the makeup of a gene pool slightly but often compound over time In Population genetics, gene flow (also known as gene migration) is the transfer of Alleles of Genes from one Population to another In biology mutations are changes to the Nucleotide sequence of the Genetic material of an organism Natural selection is the process by which favorable Heritable traits become more common in successive Generations of a Population of Speciation is the Evolutionary process by which new biological Species arise The wide range of evidence of common descent of living things strongly indicates the occurrence of Evolution and provides a wealth of information on the natural processes Although evidence of early Life is scarce and often difficult to interpret it appears that life appeared on Earth relatively soon (on the Geologic time scale) after Evolutionary thought, the idea that species change over time has roots in antiquity in the ideas of the Greeks, Romans, Chinese and Muslims theory of transmutation had early origins in the speculations and hypotheses of Erasmus Darwin, and Jean-Baptiste Lamarck. Objections to evolution have been raised ever since various evolutionary ideas came to prominence around the start of the nineteenth century Ecological genetics is the study of Genetics in the context of the interactions among organisms and between the organisms and their environment Evolutionary developmental biology ( evolution of development or informally evo-devo) is a field of Biology that compares the developmental processes Human evolution, or anthropogenesis, is the part of biological Evolution concerning the emergence of Homo sapiens as a distinct Species Molecular evolution is the process of evolution at the scale of DNA, RNA, and Proteins Molecular evolution emerged as a scientific field in the 1960s as Population genetics is the study of the Allele frequency distribution and change under the influence of the four evolutionary forces Natural selection, Genetic In Biology, a species is one of the basic units of Biological classification and a Taxonomic rank. Cladistics is distinguished from other taxonomic classification systems because it focuses on evolution rather than similarities between species, and because it places heavy emphasis on objective, quantitative analysis. Taxonomy is the practice and science of classification The word comes from the Greek, taxis (meaning 'order' 'arrangement' and, nomos eVolution is the third Album by eLDee, it was due to be released in 2008 Cladistics generates diagrams called cladograms that represent the evolutionary tree of life. See also Tree of life (disambiguation for other meanings of the Tree of Life DNA and RNA sequencing data are used in many important cladistic efforts. Deoxyribonucleic acid ( DNA) is a Nucleic acid that contains the genetic instructions used in the development and functioning of all known Ribonucleic acid ( RNA) is a Nucleic acid that consists of a long chain of Nucleotide units Computer programs are widely used in cladistics, due to the highly complex nature of cladogram generation procedures. Computer programs (also software programs, or just programs) are instructions for a Computer. Computational phylogenetics is the application of computational Algorithms methods and programs to phylogenetic analyses Cladistics originated in the work of the German entomologist, Willi Hennig, who himself referred to it as phylogenetic systematics; the use of the terms "cladistics" and "clade" was popularized by other researchers. Entomology (from Greek grc ἔντομος entomos, "that which is cut in pieces or engraved/segmented" hence "insect" and grc -λογία Emil Hans Willi Hennig ( April 20, 1913 in Dürrhennersdorf/ Oberlausitz – November 5, 1976 in Ludwigsburg) was a [1] The term phylogenetics is often used synonymously with cladistics. Cladistics originated in the field of biology but in recent years has found application in other disciplines. Foundations of modern biology There are five unifying principles The word cladistics is derived from the ancient Greek κλάδος, klados, "branch. The Ancient Greek language is the historical stage in the development of the Hellenic language family spanning the Archaic (c "

Contents

Cladograms

Cladogram showing the evolutionary relationships of various insect groups.
Cladogram showing the evolutionary relationships of various insect groups.
Cladograms are trees in the graph theoretic sense.
Cladograms are trees in the graph theoretic sense. In Graph theory, a tree is a graph in which any two vertices are connected by exactly one path. In Mathematics and Computer science, graph theory is the study of graphs: mathematical structures used to model pairwise relations between objects
A cladogram showing how Eukaryota and Archaea are more closely related to each other than to Bacteria.  Note the 3-way fork in the middle of the cladogram.
A cladogram showing how Eukaryota and Archaea are more closely related to each other than to Bacteria. Animals Plants fungi, and Protists are eukaryotes (juːˈkærɪɒt or -oʊt Organisms whose cells are organized into complex The Bacteria ( singular: bacterium) are a large group of unicellular Microorganisms Typically a few Micrometres in length bacteria have Note the 3-way fork in the middle of the cladogram.

The starting point of cladistic analysis is a group of species and molecular, morphological, or other data characterizing those species. The end result is a tree-like relationship diagram called a cladogram. In Graph theory, a tree is a graph in which any two vertices are connected by exactly one path. [2] The cladogram graphically represents a hypothetical evolutionary process. Cladograms are subject to revision as additional data become available.

Synonyms — The terms evolutionary tree, and sometimes phylogenetic tree are often used synonymously with cladogram,[3] but others treat phylogenetic tree as a broader term that includes trees generated with a nonevolutionary emphasis. A phylogenetic tree, also called an evolutionary tree, is a tree showing the Evolutionary relationships among various biological Species or other A phylogenetic tree, also called an evolutionary tree, is a tree showing the Evolutionary relationships among various biological Species or other

Subtrees are Clades — In cladograms, all organisms lie at the leaves. [4] The two taxa on either side of a split are called sister taxa or sister groups. A taxon (plural taxa) or taxonomic unit, is a name designating an organism or a group of Organisms In Biological nomenclature according to Each subtree, whether it contains one item or a hundred thousand items, is called a clade. A clade is a taxonomic group comprising a single Common ancestor and all the descendants of that ancestor

2-Way versus 3-Way Forks — Many cladists require that all forks in a cladogram be 2-way forks. Some cladograms include 3-way or 4-way forks when the data is insufficient to resolve the forking to a higher level of detail. See phylogenetic tree for more information about forking choices in trees. A phylogenetic tree, also called an evolutionary tree, is a tree showing the Evolutionary relationships among various biological Species or other

Depth of a Cladogram — If a cladogram represents N species, the number of levels (the "depth") in the cladogram is on the order of log2(N). [5] For example, if there are 32 species of deer, a cladogram representing deer will be around 5 levels deep (because 25 = 32). A deer is a Ruminant Mammal belonging to the family Cervidae. A cladogram representing the complete tree of life, with about 10 million species, would be about 23 levels deep. This formula gives a lower limit: in most cases the actual depth will be a larger value because the various branches of the cladogram will not be uniformly deep. Conversely, the depth may be shallower if forks larger than 2-way forks are permitted.

Number of Distinct Cladograms — For a given set of species, the number of distinct cladograms that can be drawn (ignoring which cladogram best matches the species characteristics) is:[6]

Number of Species 2 3 4 5 6 7 8 9 10 N
Number of Cladograms 1 3 15 105 945 10,395 135,135 2,027,025 34,459,425 1*3*5*7*. . . *(2N-3)

This exponential growth of the number of possible cladograms explains why manual creation of cladograms becomes very difficult when the number of species is large.

Extinct Species in Cladograms — Cladistics makes no distinction between extinct and nonextinct species,[7] and it is appropriate to include extinct species in the group of organisms being analyzed. Cladograms that are based on DNA/RNA generally do not include extinct species because DNA/RNA samples from extinct species are rare. Cladograms based on morphology, especially morphological characteristics that are preserved in fossils, are more likely to include extinct species.

Time Scale of a Cladogram — A cladogram tree has an implicit time axis,[8] with time running forward from the base of the tree to the leaves of the tree. If the approximate date (for example, expressed as millions of years ago) of all the evolutionary forks were known, those dates could be captured in the cladogram. Thus, the time axis of the cladogram could be assigned a time scale (e. g. 1 cm = 1 million years), and the forks of the tree could be graphically located along the time axis. Such cladograms are called scaled cladograms. Many cladograms are not scaled along the time axis, for a variety of reasons:

Cladistics compared with traditional taxonomy

A highly resolved, automatically generated tree of life based on completely sequenced genomes
A highly resolved, automatically generated tree of life based on completely sequenced genomes[10]

Prior to the advent of cladistics, most taxonomists used Linnaean taxonomy and later Evolutionary taxonomy to organize life forms. See also Tree of life (disambiguation for other meanings of the Tree of Life Linnaean taxonomy is a method of classifying living things originally devised by (and named for Carolus Linnaeus, although it has changed considerably since his time Evolutionary Taxonomy or evolutionary systematics seeks to classify Organisms using a combination of Phylogenetic relationship and overall similarity These traditional approaches, still in use by some researchers (especially in works intended for a more general audience[11]) use several fixed levels of a hierarchy, such as kingdom, phylum, class, order, and family. A phylum ( Plural: phyla) is a Taxonomic rank between Kingdom and above Class. A class is the Taxonomic rank in the Biological classification of organisms in Biology below phylum and above order. This article is about the taxonomic rank for the sequence of species in a taxonomic list see Taxonomic order In scientific classification used In Biological classification, family ( Latin Cladistics does not use those terms, because one of the fundamental premises of cladistics is that the evolutionary tree is so deep and so complex that it is inadvisable to set a fixed number of levels.

Evolutionary taxonomy insists that groups reflect phylogenies. In contrast, Linnean taxonomy allows both monophyletic and paraphyletic groups as taxa. A taxon (plural taxa) or taxonomic unit, is a name designating an organism or a group of Organisms In Biological nomenclature according to Since the early 20th century, Linnaean taxonomists have generally attempted to make genus-level and lower-level taxa monophyletic. A genus (plural genera from Γένος Latin genus "descent family type gender" is a low-level Taxonomic Ernst Mayr drew a distinction between the terms cladistics and phylogeny, using the term cladistics to refer to classifications which only take into account genealogy, as opposed to phylogeny, which had previously been used in a broader sense to refer to the combination of genealogy and amount of divergence from an ancestor (i. Ernst Walter Mayr ( July 5, 1904, Kempten, Germany &ndash February 3, 2005, Bedford Massachusetts U Genealogy (from Greek: el γενεά el-Latn genea, "descent" and el λόγος el-Latn logos, "knowledge" is the study of e. Evolutionary taxonomy). Mayr wrote, in 1985:

It would seem to me to be quite evident that the two concepts of phylogeny (and their role in the construction of classifications) are sufficiently different to require terminological distinction. The term phylogeny should be retained for the broad concept of phylogeny, promoted by Darwin and adopted by most students of phylogeny in the ensuing 90 years. Charles Robert Darwin (February 12 1809 &ndash April 19 1882 was an English naturalist, who realised and demonstrated that all Species of life The concept of phylogeny as mere genealogy should be terminologically distinguished as cladistics. To lump the two concepts together terminologically could not help but produce harmful equivocation. [12]

Willi Hennig's pioneering work provoked a spirited debate[13] about the relative merits of cladistics versus traditional taxonomy which has continued down to the present. Emil Hans Willi Hennig ( April 20, 1913 in Dürrhennersdorf/ Oberlausitz – November 5, 1976 in Ludwigsburg) was a [14] Some of the debates that the cladists engaged in had been running since the 19th century, but they entered these debates with a new fervor,[15] as can be seen from the Foreword to Hennig (1979) by Rosen, Nelson, and Patterson:

Encumbered with vague and slippery ideas about adaptation, fitness, biological species and natural selection, neo-Darwinism (summed up in the "evolutionary" systematics of Mayr and Simpson) not only lacked a definable investigatory method, but came to depend, both for evolutionary interpretation and classification, on consensus or authority. (Foreword, page ix)

Cladistics strictly and exclusively follows phylogeny, and has arbitrarily deep trees with binary branching: each taxon is a clade. Linnaean taxonomy, while following phylogeny, also subjectively considers morphology, and has a fixed hierarchy, whose taxa are not always clades. The term morphology in Biology refers to the outward appearance ( Shape, Structure, Colour, Pattern) of an Organism

Example: Tetrapoda

For example, Linnaean taxonomy contains the taxon Tetrapoda, defined morphologically as vertebrates with four limbs (as well as animals with four-limbed ancestors, such as snakes), which is often given the rank of superclass, and divides into the classes Amphibia, Reptilia, Aves, Mammalia, and some extinct families. Tetrapods ( Greek τετραποδη tetrapoda, Latin Quadruped, "four-footed" are Vertebrate Animals Vertebrates are members of the Subphylum Vertebrata, Chordates with backbones or spinal columns The grouping sometimes includes A class is the Taxonomic rank in the Biological classification of organisms in Biology below phylum and above order. A class is the Taxonomic rank in the Biological classification of organisms in Biology below phylum and above order. Prehistoric amphibian Amphibians (class Amphibia such as Frogs Toads Salamanders Newts Gymnophiona, Sirens and Reptiles, or members of the class Reptilia are air-breathing Cold-blooded Vertebrates that have skin covered in scales as opposed to hair or feathers Birds ( class Aves) are bipedal endothermic ( Warm-blooded) Vertebrate animals that lay eggs. Mammals ( class Mammalia) are a class of Vertebrate Animals characterized by the presence of Sweat glands, including sweat glands In Biological classification, family ( Latin

Cladistics also contains the taxon Tetrapoda, whose living members can be classified phylogenically as "the clade defined by the common ancestor of amphibians and mammals", or more precisely the clade defined by the common ancestor of a specific amphibian and mammal (or bird or reptile), but whose tree is still being worked out (there are a number of extinct branches). Tetrapods ( Greek τετραποδη tetrapoda, Latin Quadruped, "four-footed" are Vertebrate Animals The taxon does not have a rank, and its subtaxa are subclades: these can be contained within one another, but one does not divide the clade into several non-overlapping taxa (as in traditional taxonomy): one can split into two clades at the first branching, but that is all. With regards to the traditional classes, Aves and Mammalia are subclades, contained in the subclade Amniota, but Reptilia* is a paraphyletic taxon, not a clade — "At best, the cladists suggest, we could say that the traditional Reptilia are "non-avian, non-mammalian amniotes"[16] — and instead one divides Amniota into the two clades Sauropsida (which contains birds and all living amniotes other than mammals, including all living traditional reptiles) and Theropsida (mammals and the extinct "mammal-like reptiles"). Birds ( class Aves) are bipedal endothermic ( Warm-blooded) Vertebrate animals that lay eggs. Mammals ( class Mammalia) are a class of Vertebrate Animals characterized by the presence of Sweat glands, including sweat glands The amniotes are a group of Tetrapod Vertebrates that include the Synapsida ( Mammals and Mammal-like reptiles and Sauropsida Reptiles, or members of the class Reptilia are air-breathing Cold-blooded Vertebrates that have skin covered in scales as opposed to hair or feathers Synapsids ('fused arch' also known as theropsids ('beast face' are a class of Animals that includes Mammals and everything closer to mammals than Similarly, Amphibia* is a paraphyletic taxon. See Reptilia#History of classification for further details. Reptiles, or members of the class Reptilia are air-breathing Cold-blooded Vertebrates that have skin covered in scales as opposed to hair or feathers

Distinctions

Proponents of cladistics enumerate key distinctions between cladistics and Linnaean taxonomy as follows:[17]

Cladistics Linnaean Taxonomy
Treats all levels of the tree as equivalent. Treats each tree level uniquely. Uses special names (such as Family, Class, Order) for each level.
Handles arbitrarily deep trees. Often must invent new level names (such as superorder, suborder, infraorder, parvorder, magnorder) to accommodate new discoveries. Biased towards trees about 4 to 12 levels deep.
Discourages naming or use of groups that are not monophyletic Acceptable to name and use paraphyletic groups
Primary goal is to reflect actual process of evolution Primary goal is to group species based on morphological similarities
Assumes that the shape of the tree will change frequently, with new discoveries New discoveries often require renaming or releveling of Classes, Orders, and Kingdoms
Definitions of taxa are objective, hence free from personal interpretation Definitions of taxa require individuals to make subjective decisions. A clade is a taxonomic group comprising a single Common ancestor and all the descendants of that ancestor A clade is a taxonomic group comprising a single Common ancestor and all the descendants of that ancestor For example, various taxonomists suggest that the number of Kingdoms in Biology is two, three, four, five, or six. In biological Taxonomy, a kingdom or regnum is a Taxonomic rank in either (historically the highest rank or (in the new three-domain system
Taxa, once defined, are permanent (e. g. "taxon X comprises the most recent common ancestor of species A and B along with its descendants") Taxa can be renamed and eliminated (e. g. Insectivora is one of many taxa in the Linnaean system that have been eliminated). The order Insectivora (from Latin insectum "insect" and vorare "to eat" is a now-abandoned biological grouping within the

Proponents of Linnaean taxonomy contend that it has some advantages over cladistics, such as:[18]

Cladistics Linnaean Taxonomy
Limited to entities related by evolution or ancestry Supports groupings without reference to evolution or ancestry
Does not include a process for naming species Includes a process for giving unique names to species
Difficult to understand the essence of a clade, because clade definitions emphasize ancestry at the expense of meaningful characteristics Taxa definitions based on tangible characteristics
Ignores sensible, clearly defined paraphyletic groups such as reptiles Permits clearly defined groups such as reptiles
Difficult to determine if a given species is in a clade or not (e. Reptiles, or members of the class Reptilia are air-breathing Cold-blooded Vertebrates that have skin covered in scales as opposed to hair or feathers Reptiles, or members of the class Reptilia are air-breathing Cold-blooded Vertebrates that have skin covered in scales as opposed to hair or feathers g. if clade X is defined as "most recent common ancestor of A and B along with its descendants", then the only way to determine if species Y is in the clade is to perform a complex evolutionary analysis) Straightforward process to determine if a given species is in a taxon or not
Limited to organisms that evolved by inherited traits; not applicable to organisms that evolved via complex gene sharing or lateral transfer Applicable to all organisms, regardless of evolutionary mechanism

Cladistics compared to phenetics

For some decades in the mid to late 20th century, a commonly used methodology was phenetics ("numerical taxonomy"). Phenetics should not be confused with Phonetics, the study of speech sounds despite the similarity in pronunciation This can be seen as a predecessor[19] to some methods of today's cladistics (namely distance matrix methods like neighbor-joining), but made no attempt to resolve phylogeny, only similarities. Distance matrices are used in phylogeny as non-parametric distance methods were originally applied to Phenetic data using a matrix of pairwise distances In Bioinformatics, neighbor-joining is a bottom-up clustering method used for the construction of phylogenetic trees Usually used for trees based on

Considered cutting edge in their time as they were among the first bioinformatics applications, phenetic methods are today superseded by cladistic analyses due to the inability of phenetics to provide an evolutionary hypothesis, except by chance: as phenetics does not distinguish between plesiomorphies (ancient common retained characters) and apomorphies (novel characters that arose after the last common ancestor), it will consider groups as "natural" even if they are only united by "primitive" (i. Bioinformatics is the application of information technology to the field of molecular biology eVolution is the third Album by eLDee, it was due to be released in 2008 e. , retained) characters.

Consider for example a cow, a whale and a human. Cattle, colloquially referred to as cows, are domesticated Ungulates a member of the Subfamily Bovinae of the family Whales are marine mammals which are neither Dolphins (ie members of the families Delphinidae or Platanistoidae) nor Porpoises Orcas Human beings, humans or man (Origin 1590–1600 L homō man OL hemō the earthly one (see Humus A cladistic analysis would recognize the whale's lack of legs as an apomorphy, whereas the presence of legs in cows and humans is plesiomorphic. It thus does not provide information on their relationships in a cladistic analysis, except that their last common ancestor had legs too. In a phenetic analysis, the presence of legs in cows and humans could be considered to indicate that they are closer relatives of each other than either is to whales. In fact, whales and cows are closer related to each other than either is to humans.

In short, phenetic analysis tend to resolve evolutionary grades as presumably monophyletic groups. In Alpha taxonomy, a grade refers to a level of morphological and/or Physiological complexity A clade is a taxonomic group comprising a single Common ancestor and all the descendants of that ancestor

Monophyletic groups encouraged

Many cladists discourage the use of paraphyletic groups because they detract from cladistics' emphasis on clades (monophyletic groups). In contrast, proponents of the use of paraphyletic groups argue that any dividing line in a cladogram creates both a monophyletic section above and a paraphyletic section below. They also contend that paraphyletic taxa are necessary for classifying earlier sections of the tree – for instance, the early vertebrates that would someday evolve into the family Hominidae cannot be placed in any other monophyletic family. They also argue that paraphyletic taxa provide information about significant changes in organisms' morphology, ecology, or life history – in short, that both paraphyletic groups and clades are valuable notions with separate purposes.

Simplified step by step procedure

Unrooted cladogram of the myosin supergene family
Unrooted cladogram of the myosin supergene family[20]

A simplified procedure for generating a cladogram is:[21]

  1. Gather and organize data
  2. Consider possible cladograms
  3. Select best cladogram

Step 1: Gather and organize data

A cladistic analysis begins with the following data:

For example, if analyzing 20 species of birds, the data might be:

Molecular versus morphological data

The characteristics used to create a cladogram can be roughly categorized as either morphological (synapsid skull, warm blooded, notochord, unicellular, etc. The notochord is a flexible rod-shaped body found in Embryos of all Chordates It is composed of cells derived from the Mesoderm and defines the ) or molecular (DNA, RNA, or other genetic information). [21] Prior to the advent of DNA sequencing, all cladistic analysis used morphological data.

As DNA sequencing has become cheaper and easier, molecular systematics has become a more and more popular way to reconstruct phylogenies. The term DNA sequencing encompasses biochemical methods for determining the order of the Nucleotide bases Adenine, Guanine, Cytosine [22] Using a parsimony criterion is only one of several methods to infer a phylogeny from molecular data; maximum likelihood and Bayesian inference, which incorporate explicit models of sequence evolution, are non-Hennigian ways to evaluate sequence data. Maximum likelihood estimation ( MLE) is a popular statistical method used for fitting a mathematical model to some data Bayesian inference is Statistical inference in which evidence or observations are used to update or to newly infer the Probability that a hypothesis may be true Another powerful method of reconstructing phylogenies is the use of genomic retrotransposon markers, which are thought to be less prone to the problem of reversion that plagues sequence data. Retrotransposon markers are Retrotransposons that are used as cladistic markers In biology mutations are changes to the Nucleotide sequence of the Genetic material of an organism They are also generally assumed to have a low incidence of homoplasies because it was once thought that their integration into the genome was entirely random; this seems at least sometimes not to be the case, however. In classical genetics the genome of a Diploid Organism including Eukarya refers to a full set of chromosomes or genes in a Gamete, thereby

Ideally, morphological, molecular, and possibly other phylogenies should be combined into an analysis of total evidence: All have different intrinsic sources of error. For example, character convergence (homoplasy) is much more common in morphological data than in molecular sequence data, but character reversions that are unrecognizable as such are more common in the latter (see long branch attraction). Convergent evolution describes the acquisition of the same biological trait in unrelated lineages Long branch attraction (LBA is a phenomenon in Phylogenetic analyses (most commonly those employing Maximum parsimony) when rapidly evolving lineages are inferred Morphological homoplasies can usually be recognized as such if character states are defined with enough attention to detail.

Plesiomorphies and synapomorphies

The researcher decides which character states were present before the last common ancestor of the species group (plesiomorphies) and which were present in the last common ancestor (synapomorphies) by considering one or more outgroups. This makes the choice of an outgroup an important task, since this choice can profoundly change the topology of a tree. Note that only synapomorphies are of use in characterising clades.

Avoid homoplasies

A homoplasy is a character that is shared by multiple species due to some cause other than common ancestry. Convergent evolution describes the acquisition of the same biological trait in unrelated lineages [23] Typically, homoplasies occur due to convergent evolution. Use of homoplasies when building a cladogram is sometimes unavoidable but is to be avoided when possible.

A well known example of homoplasy due to convergent evolution would be the character, "presence of wings". Though the wings of birds, bats, and insects serve the same function, each evolved independently, as can be seen by their anatomy. Anatomy (from the Greek anatomia, from ana separate apart from and temnein, to cut up cut open is a branch of Biology that is the consideration If a bird, bat, and a winged insect were scored for the character, "presence of wings", a homoplasy would be introduced into the dataset, and this would confound the analysis, possibly resulting in a false evolutionary scenario.

Homoplasies can often be avoided outright in morphological datasets by defining characters more precisely and increasing their number. When analyzing "supertrees" (datasets incorporating as many taxa of a suspected clade as possible), it may become unavoidable to introduce character definitions that are imprecise, as otherwise the characters might not apply at all to a large number of taxa; to continue with the "wings" example, the presence of wings would be hardly be a useful character if attempting a phylogeny of all Metazoa, as most of these don't have wings at all. Cautious choice and definition of characters thus is another important element in cladistic analyses. With a faulty outgroup or character set, no method of evaluation is likely to produce a phylogeny representing the evolutionary reality.

Step 2: Consider possible cladograms

When there are just a few species being organized, it is possible to do this step manually, but most cases require a computer program. Computational phylogenetics is the application of computational Algorithms methods and programs to phylogenetic analyses There are scores of computer programs available to support cladistics. [24] See phylogenetic tree for more information about tree-generating computer programs. A phylogenetic tree, also called an evolutionary tree, is a tree showing the Evolutionary relationships among various biological Species or other

Because the total number of possible cladograms grows exponentially with the number of species, it is impractical for a computer program to evaluate every individual cladogram. A typical cladistic program begins by using heuristic techniques to identify a small number of candidate cladograms. heuristic (hyu̇-ˈris-tik is a method to help solve a problem commonly an informal method Many cladistic programs then continue the search with the following repetitive steps:

  1. Evaluate the candidate cladograms by comparing them to the characteristic data
  2. Identify the best candidates that are most consistent with the characteristic data
  3. Create additional candidates by creating several variants of each of the best candidates from the prior step
  4. Use heuristics to create several new candidate cladograms unrelated to the prior candidates
  5. Repeat these steps until the cladograms stop getting better

Computer programs that generate cladograms use algorithms that are very computationally intensive,[25] because the cladogram algorithm is NP-hard. NP-hard (nondeterministic Polynomial-time hard in Computational complexity theory, is a class of problems informally "at least as hard as the hardest problems

Step 3: Select the best cladogram

There are several algorithms available to identify the "best" cladogram. In Mathematics, Computing, Linguistics and related subjects an algorithm is a sequence of finite instructions often used for Calculation [26] Most algorithms use a metric to measure how consistent a candidate cladogram is with the data. In Mathematics, a metric or distance function is a function which defines a Distance between elements of a set. Most cladogram algorithms use the mathematical techniques of optimization and minimization. In Mathematics, the term optimization, or mathematical programming, refers to the study of problems in which one seeks to minimize or maximize a real function

In general, cladogram generation algorithms must be implemented as computer programs, although some algorithms can be performed manually when the data sets are trivial (for example, just a few species and a couple of characteristics).

Some algorithms are useful only when the characteristic data is molecular (DNA, RNA) data. Other algorithms are useful only when the characteristic data is morphological data. Other algorithms can be used when the characteristic data includes both molecular and morphological data.

Algorithms for cladograms include least squares, neighbor-joining, parsimony, maximum likelihood, and Bayesian inference. The method of least squares is used to solve Overdetermined systems Least squares is often applied in statistical contexts particularly Regression analysis. In Bioinformatics, neighbor-joining is a bottom-up clustering method used for the construction of phylogenetic trees Usually used for trees based on Parsimony is a 'less is better' concept of frugality economy stinginess or caution in arriving at a hypothesis or course of action Maximum likelihood estimation ( MLE) is a popular statistical method used for fitting a mathematical model to some data Bayesian inference is Statistical inference in which evidence or observations are used to update or to newly infer the Probability that a hypothesis may be true

Biologists sometimes use the term parsimony for a specific kind of cladogram generation algorithm and sometimes as an umbrella term for all cladogram algorithms. Parsimony is a 'less is better' concept of frugality economy stinginess or caution in arriving at a hypothesis or course of action [27]

Algorithms that perform optimization tasks (such as building cladograms) can be sensitive to the order in which the input data (the list of species and their characteristics) is presented. Inputting the data in various orders can cause the same algorithm to produce different "best" cladograms. In these situations, the user should input the data in various orders and compare the results.

Using different algorithms on a single data set can sometimes yield different "best" cladograms, because each algorithm may have a unique definition of what is "best".

Because of the astronomical number of possible cladograms, algorithms cannot guarantee that the solution is the overall best solution. A nonoptimal cladogram will be selected if the program settles on a local minimum rather than the desired global minimum. [28] To help solve this problem, many cladogram algorithms use a simulated annealing approach to increase the likelihood that the selected cladogram is the optimal one. Simulated annealing (SA is a generic probabilistic Meta-algorithm for the Global optimization problem namely locating a good approximation to the [29]

How complex is the Tree of Life?

One argument in favor of cladistics is that it supports arbitrarily complex, arbitrarily deep trees. Especially when extinct species are considered (both known and unknown), the complexity and depth of the tree can be very large. Every single speciation event, including all the species that are now extinct, represents an additional fork on the hypothetical, complete cladogram representing the full tree of life. Fractals can be used to represent this notion of increasing detail: as a viewpoint zooms into the tree of life, the complexity remains virtually constant[30]. A fractal is generally "a rough or fragmented geometric shape that can be split into parts each of which is (at least approximately a reduced-size copy of the whole" This great complexity of the tree, and the uncertainty associated with the complexity, are among the reasons that cladists cite for the attractiveness of cladistics over traditional taxonomy.

Proponents of noncladistic approaches to taxonomy point to punctuated equilibrium to bolster the case that the tree of life has a finite depth and finite complexity. Punctuated equilibrium is a theory of evolutionary biology which states that most sexually reproducing populations experience little change for most of their geological If the number of species currently alive is finite, and the number of extinct species that we will ever know about is finite, then the depth and complexity of the tree of life is bounded, and there is no need to handle arbitrarily deep trees.

Phylocode approach to naming species

A formal code of phylogenetic nomenclature, the PhyloCode[31], is currently under development for cladistic taxonomy. The International Code of Phylogenetic Nomenclature, known for short as the PhyloCode, is a developing draft for a formal set of rules governing Phylogenetic It is intended for use by both those who would like to abandon Linnaean taxonomy and those who would like to use taxa and clades side by side. In several instances (see for example Hesperornithes) it has been employed to clarify uncertainties in Linnaean systematics so that in combination they yield a taxonomy that is unambiguously placing the group in the evolutionary tree in a way that is consistent with current knowledge. Hesperornithes is an extinct and highly specialized Clade of Cretaceous toothed Birds Hesperornithine birds apparently limited to former aquatic habitats

Terminology

The yellow group (sauropsids) is monophyletic, the blue group (reptiles) is paraphyletic, and the red group (warm-blooded animals) is polyphyletic.
The yellow group (sauropsids) is monophyletic, the blue group (reptiles) is paraphyletic, and the red group (warm-blooded animals) is polyphyletic. Reptiles, or members of the class Reptilia are air-breathing Cold-blooded Vertebrates that have skin covered in scales as opposed to hair or feathers A clade is a taxonomic group comprising a single Common ancestor and all the descendants of that ancestor Reptiles, or members of the class Reptilia are air-breathing Cold-blooded Vertebrates that have skin covered in scales as opposed to hair or feathers In Phylogenetics, a group of organisms is said to be paraphyletic if the group contains its most recent common ancestor but does not contain all In Phylogenetics, a Taxon is polyphyletic ( Greek for "of many races" if the trait its members have in common evolved separately in different

Origin of the term "cladistics"

Hennig's major book, even the 1979 version, does not contain the term cladistics in the index. He referred to his own approach as phylogenetic systematics, as implied by the book's title. A review paper by Dupuis observes that the term clade was introduced in 1958 by Julian Huxley, cladistic by Cain and Harrison in 1960, and cladist (for an adherent of Hennig's school) by Mayr in 1965. Sir Julian Sorell Huxley FRS ( 22 June 1887 &ndash 14 February 1975) was an English Evolutionary biologist [32]

Three definitions of clade

There are three ways to define a clade for use in a cladistic taxonomy. [33]

Applying cladistics to other disciplines

The processes used to generate cladograms are not limited to the field of biology[34]. The generic nature of cladistics means that cladistics can be used to organize groups of items in many different academic realms. The only requirement is that the items have characteristics that can be identified and measured.

A triple family tree of Linux distributions.
A triple family tree of Linux distributions.

Recent attempts in the use of cladistic methods outside of biology attack problems in anthropology[35], the filiation of manuscripts in textual criticism, and the lineage of Linux distros, a class of computer operating system. Textual criticism (or lower criticism) is a branch of Literary criticism that is concerned with the identification and removal of Transcription errors in A Linux distribution (also called GNU/Linux by distributions such as Debian, Fedora, Ubuntu, Linux Mint, Mandriva and An operating system (commonly abbreviated OS and O/S) is the software component of a Computer system that is responsible for the management and coordination

The attempts to apply cladistic programs on languages overlook that languages are learnt and often subject to sudden loss or acquisition of features, contrasting to biology, where most traits are inherited. Historical linguistics (also called diachronic linguistics) is the study of language change The results therefore, are generally unsatisfactory[36].

Footnotes

  1. ^ Phylogenetic Systematics is the title of Hennig's 1966 book
  2. ^ pp. 45, 78 and 555 of Joel Cracraft and Michael J. Donaghue, eds. (2004). Assembling the Tree of Life. Oxford, England: Oxford University Press.
  3. ^ Singh, Gurcharan (2004). Plant Systematics: An Integrated Approach. Science, 203-4. ISBN 1578083516.  
  4. ^ Albert, Victor (2006). Parsimony, Phylogeny, and Genomics. Oxford University Press, 3-55. ISBN 0199297304.  
  5. ^ Aldous, David (1996), “Probability Distributions on Cladograms”, Random Discrete Structures, Springer, p. 13 
  6. ^ Lowe, Andrew (2004). Ecological Genetics: Design, Analysis, and Application. Blackwell Publishing, 164. ISBN 1405100338.  
  7. ^ Scott-Ram, N. R. (1990). Transformed Cladistics, Taxonomy and Evolution. Cambridge University Press, 83. ISBN 0521340861.  
  8. ^ Freeman, Scott (1998). Evolutionary Analysis. Prentice Hall, 380. ISBN 0135680239.  
  9. ^ Carroll, Robert Lynn (1997). Patterns and Processes of Vertebrate Evolution. Cambridge University Press, 80. ISBN 052147809X.  
  10. ^ Letunic, I (2007). "Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation." (Pubmed). PubMed is a free search engine for accessing the MEDLINE database of citations and abstracts of biomedical research articles Bioinformatics 23(1): 127–8. doi:10.1093/bioinformatics/btl529. A digital object identifier ( DOI) is a permanent identifier given to an Electronic document.  
  11. ^ Unwin, David M. (2006). The Pterosaurs: From Deep Time. New York: Pi Press, 246. ISBN ISBN 0-13-146308-X.  
  12. ^ Mayr, E. (1985). "Darwin and the Definition of Phylogeny. " Systematic Zoology, 34(1): 97-98.
  13. ^ Wheeler, Quentin (2000). Species Concepts and Phylogenetic Theory: A Debate. Columbia University Press. ISBN 0231101430.  
  14. ^ Benton, M. (2000). "Stems, nodes, crown clades, and rank-free lists: is Linnaeus dead?". Biological Reviews 75 (4): 633–648.  
  15. ^ Hull, David (1988). David Lee Hull (born 15 June 1935) is a philosopher with a particular interest in the Philosophy of biology. Science as a Process. University of Chicago Press, 232-276. ISBN 0226360512.  
  16. ^ Colin Tudge (2000). Colin Tudge (born 22 April 1943) is a British science writer A biologist by training he is the author of numerous works on food agriculture genetics 2000 ( MM) was a Leap year that started on Saturday of the Common Era, in accordance with the Gregorian calendar. The Variety of Life. Oxford University Press. ISBN 0198604262.  
  17. ^ Hennig, Willi (1975). Emil Hans Willi Hennig ( April 20, 1913 in Dürrhennersdorf/ Oberlausitz – November 5, 1976 in Ludwigsburg) was a "'Cladistic analysis or cladistic classification': a reply to Ernst Mayr". Systematic Zoology 24: 244–256. doi:10.2307/2412765. A digital object identifier ( DOI) is a permanent identifier given to an Electronic document.  
  18. ^ Mayr, Ernst (1976). Ernst Walter Mayr ( July 5, 1904, Kempten, Germany &ndash February 3, 2005, Bedford Massachusetts U Evolution and the diversity of life (Selected essays). Cambridge, MA: Harvard Univ. Press. ISBN 0-674-27105-X.  
  19. ^ Mayr, Ernst (1982). Ernst Walter Mayr ( July 5, 1904, Kempten, Germany &ndash February 3, 2005, Bedford Massachusetts U The growth of biological thought: diversity, evolution and inheritance. Cambridge, MA: Harvard Univ. Press, 221. ISBN 0-674-36446-5.  
  20. ^ Hodge T, Cope M (2000). "A myosin family tree". J Cell Sci 113 Pt 19: 3353–4. PMID 10984423.  
  21. ^ a b DeSalle, Rob (2002). Techniques in Molecular Systematics and Evolution. Birkhauser. ISBN 376436257X.  
  22. ^ Hillis, David (1996). Molecular Systematics. Sinaur. ISBN 0878932828.  
  23. ^ West-Eberhard, Mary (2003). Developmental Plasticity and Evolution. Oxford Univ. Press, 353-376. ISBN 0195122356.  
  24. ^ List of Cladistics Software Programs.
  25. ^ Hodkinson, Trevor (2006). Reconstructing the Tree of Life: Taxonomy and Systematics of Species Rich Taxa. CRC Press, 61-128. ISBN 0849395798.  
  26. ^ Kitching, Ian (1998). Cladistics: The Theory and Practice of Parsimony Analysis. Oxford University Press. ISBN 0198501382.  
  27. ^ Stewart, Caro-Beth (1993). "The Powers and Pitfalls of Parsimony". Nature 361: 603–607. doi:10.1038/361603a0. A digital object identifier ( DOI) is a permanent identifier given to an Electronic document.  
  28. ^ Foley, Peter (1993). Cladistics: A Practical Course in Systematics. Oxford Univ. Press, 66. ISBN 0198577664.  
  29. ^ Nixon K. C. (1999). "The Parsimony Ratchet: a new method for rapid parsimony analysis". Cladistics 15: 407–414. doi:10.1111/j.1096-0031.1999.tb00277.x. A digital object identifier ( DOI) is a permanent identifier given to an Electronic document.  
  30. ^ Gordon, Richard (1999). The Hierarchical Genome and Differentiation Waves. World Scientific, 632. ISBN 9810222688.  
  31. ^ Pennisi, E. (2001). "Evolutionary Biology: Preparing the Ground for a Modern 'Tree of Life'". Science 293: 1979–1980. doi:10.1126/science.293.5537.1979. A digital object identifier ( DOI) is a permanent identifier given to an Electronic document.  
  32. ^ Dupuis, Claude (1984). "Willi Hennig's impact on taxonomic thought". Annual Review of Ecology and Systematics 15: 1–24. ISSN 0066-4162. An International Standard Serial Number ( ISSN) is a unique eight-digit number used to identify a print or electronic Periodical publication.  
  33. ^ de Queiroz, K. and J. Gauthier (1994). "Toward a phylogenetic system of biological nomenclature". Trends in Research in Ecology and Evolution 9 (1): 27–31. doi:10.1016/0169-5347(94)90231-3. A digital object identifier ( DOI) is a permanent identifier given to an Electronic document.  
  34. ^ Mace, Ruth (2005). The Evolution of Cultural Diversity: A Phylogenetic Approach. Routledge Cavendish. ISBN 1844720993.  
  35. ^ Lipo, Carl (2005). Mapping Our Ancestors: Phylogenetic Approaches in Anthropology and Prehistory. Aldine Transaction. ISBN 0202307514.  
  36. ^ Holm, Hans J. (2007). "The new Arboretum of Indo-European 'Trees'. Can New Algorithms Reveal the Phylogeny and Even Prehistory of Indo-European?". Journal of Quantitative Linguistics 14,2 - 3: 167–214. doi:10.1080/09296170701378916. A digital object identifier ( DOI) is a permanent identifier given to an Electronic document.  

See also

References

External links

An International Standard Serial Number ( ISSN) is a unique eight-digit number used to identify a print or electronic Periodical publication.

Dictionary

cladistics

-noun

  1. (systematics): An approach to biological systematics in which organisms are grouped based upon synapomorphies (shared derived characteristics) only, and not upon symplesiomorphies (shared ancestral characteristics).
© 2009 citizendia.org; parts available under the terms of GNU Free Documentation License, from http://en.wikipedia.org
Dapyx Software network: MP3 Explorer | Ebook Manager | Zenithic