67
Anatomy of a Cladistic Analysis Nico M. Franz School of Life Sciences, Arizona State University XXXI Willi Hennig Meeting UC Riverside June 26, 2012; Riverside, CA

Franz. Anatomy of a Cladistic Analysis

Embed Size (px)

DESCRIPTION

The sequential stages culminating in the publication of a morphological cladistic analysis of weevils in the Exophthalmus genus complex (Coleoptera: Curculionidae: Entiminae) are reviewed, with an emphasis on how early- stage homology assessments were gradually evaluated and refined in light of intermittent phylogenetic insights. In all, 60 incremental versions of the evolving character matrix were congealed and analysed, starting with an assembly of 52 taxa and ten traditionally deployed diagnostic characters, and ending with 90 taxa and 143 characters that reflect significantly more narrow assessments of phylogenetic similarity and scope. Standard matrix properties and analytical tree statistics were traced throughout the analytical process, and series of incongruence length indifference tests were used to identify critical points of topology change among succeeding matrix versions. This kind of parsimony-contingent rescoping is generally representative of the inferential process of character individuation within individual and across multiple cladistic analyses. The expected long-term outcome is a maturing observational terminology in which precise inferences of homology are parsimony-contingent, and the notions of homology and parsimony are inextricably linked. This contingent view of cladistic character individuation is contrasted with current approaches to developing phenotype ontologies based on homology-neutral structural equivalence expressions. Recommendations are made to transparently embrace the parsimony-contingent nature of cladistic homology.

Citation preview

Page 1: Franz. Anatomy of a Cladistic Analysis

Anatomy of a Cladistic Analysis

Nico M. Franz

School of Life Sciences, Arizona State University

XXXI Willi Hennig Meeting – UC Riverside

June 26, 2012; Riverside, CA

Page 2: Franz. Anatomy of a Cladistic Analysis

• "The monophyly of the Neotropical entimine weevil genus Exophthalmus Schoenherr, 1823 (Curculionidae: Entiminae: Eustylini Lacordaire) is reassessed."

• […] "The present study scrutinizes these traditional perspectives, based on a cladistic analysis of 143 adult morphological characters and 90 species, representing 30 genera and seven tribes of Neotropical entimine weevils."

• The character matrix yielded eight most-parsimonious cladograms (length = 239 steps; consistency index = 66; retention index = 91), with mixed clade support that remains particularly wanting for some of the deeper in-group divergences."

From the abstract:

Page 3: Franz. Anatomy of a Cladistic Analysis

(A) E. agrestis (Boheman); (B) E. consobrinus (Marshall); (C) E. hieroglyphicus Chevrolat

(D) E. impressus (Fabricius); (E) E. nicaraguensis Bovie; (F) E. quadrivittatus (Olivier)

(G) E. quinquedecimpunctatus (Olivier); (H) E. roseipes (Chevrolat); (I) E. sulcicrus Champion

(J) E. triangulifer Champion; (K) E. verecundus (Chevrolat); (L) E. vittatus (Linnaeus)

Page 4: Franz. Anatomy of a Cladistic Analysis

Fig. 2. Preferred cladogram & character state optimizations

Page 5: Franz. Anatomy of a Cladistic Analysis

Dissecting the process – motivating themes

"Cladists may use the congruence test to iteratively refine assessments of homology, and thereby increase the odds of reliable phylogenetic inference under parsimony. This explanation challenges alternative views which tend to ignore the effects of parsimony on the process of character individuation in systematics."

Franz. 2005. Outline of an explanatory account of cladistic practice. Biol. Phil. 20: 489–515.

Page 6: Franz. Anatomy of a Cladistic Analysis

"Cladists may use the congruence test to iteratively refine assessments of homology, and thereby increase the odds of reliable phylogenetic inference under parsimony. This explanation challenges alternative views which tend to ignore the effects of parsimony on the process of character individuation in systematics."

Franz. 2005. Outline of an explanatory account of cladistic practice. Biol. Phil. 20: 489–515.

"The identification of the valid scope for character statements cannot be a matter

of mere ostension, or rigid designation, but must be a matter of scientific theory

construction. Scope expansion of character statements can result in a situation

where purportedly similar structures, apparently denoted by the same name

(proper name or kind name), are in fact not the same. The nonhomology of such

characters may be revealed through morphological complexity at the comparative

level, by tree topology at the analytical level, or both."

Rieppel. 2007. The performance of morphological characters in broad-scale phylogenetic analyses. Biol. J. Linn. Soc. 92: 297–308.

Dissecting the process – motivating themes

Page 7: Franz. Anatomy of a Cladistic Analysis

Sneak preview: 60 matrices, 8 stages – tracking the analysis

Page 8: Franz. Anatomy of a Cladistic Analysis

Legacy character assembly

Page 9: Franz. Anatomy of a Cladistic Analysis

Source: Champion. 1911. Otiorhynchinae Alatae. In: Biologia Centrali-Americana, Vol. 4, Part 3. London.

Stage 1: legacy character assembly

Champion (1911) – winged vs. wingless entimine groups

Page 10: Franz. Anatomy of a Cladistic Analysis

Character has a length of 10 steps in a narrowly scoped analysis (Exophthalmus and closely related genera).

Stage 1: legacy character assembly

Page 11: Franz. Anatomy of a Cladistic Analysis

Initial characters and states had a reputable pedigree

Lacordaire. 1863. Histoire naturelle des insectes, Vol 6. Paris, Roret.Champion. 1911. Otiorhynchinae Alatae. Biologia Centrali-Americana, Vol. 4, Part 3. London; pp. 178–317.van Emden. 1944. A key to the genera of Brachyderinae of the world. Ann. Mag. Nat. Hist. 11: 503–532, 559–586.Anderson. 2002. Family 131. Curculionidae. In: American B, Vol. 2. Boca Raton, CRC Press; pp. 722–815.Franz. 2010. Redescriptions of critical type species in the Eustylini Lacordaire (Coleoptera: Curculionidae: Entiminae). J. Nat. Hist. 44: 41-80.

Page 12: Franz. Anatomy of a Cladistic Analysis

Early cladistic outcomes (Stage 1)

52 taxa

Page 13: Franz. Anatomy of a Cladistic Analysis

Early cladistic outcomes (Stage 1)

100 characters

Page 14: Franz. Anatomy of a Cladistic Analysis

547 steps!(~ 5.5 / char.)

Early cladistic outcomes (Stage 1)

Page 15: Franz. Anatomy of a Cladistic Analysis

Matrix 10

Select legacy characters

Page 16: Franz. Anatomy of a Cladistic Analysis

Consistency index and retention index

rapid decline

Page 17: Franz. Anatomy of a Cladistic Analysis

Character coding – binary, multi-state

~ 40%multi-state characters

Page 18: Franz. Anatomy of a Cladistic Analysis

Character provenance – external, internal

initiallyall external

Page 19: Franz. Anatomy of a Cladistic Analysis

Number of MPTs and nodes collapsed in consensus

3 trees,3 collapsed

Not publication worthy.

Return to initial homology assessments (chars./states).

Page 20: Franz. Anatomy of a Cladistic Analysis

Rescoping starts…

Page 21: Franz. Anatomy of a Cladistic Analysis

Matrices 11-17: rescoping phase I – some examples

Page 22: Franz. Anatomy of a Cladistic Analysis

Intermediate stages 2-5 – rescoping, taxon/character addition

addressingpoorest

characters

Page 23: Franz. Anatomy of a Cladistic Analysis

Intermediate stages 2-5 – rescoping, taxon/character addition

evidence of large gaps in

sampling

Page 24: Franz. Anatomy of a Cladistic Analysis

Intermediate stages 2-5 – rescoping, taxon/character addition

addition of out-/ingroup taxa (52 90)

Page 25: Franz. Anatomy of a Cladistic Analysis
Page 26: Franz. Anatomy of a Cladistic Analysis

Intermediate stages 2-5 – rescoping, taxon/character addition

new taxa new (unscoped)characters

Page 27: Franz. Anatomy of a Cladistic Analysis

Intermediate stages 2-5 – rescoping, taxon/character addition

rescoping all in expanded context

Page 28: Franz. Anatomy of a Cladistic Analysis

Consistency index and retention index

steadyincrease

Page 29: Franz. Anatomy of a Cladistic Analysis

Character coding – binary, multi-state

> 90% binary characters

"exploratory" reductive

coding

Page 30: Franz. Anatomy of a Cladistic Analysis

Character provenance – external, internal

> 20% internal

characters

Page 31: Franz. Anatomy of a Cladistic Analysis

Number of MPTs and nodes collapsed in consensus

Still not ready for publication.

> 1200 trees,> 35 collapsed

Focus on detail homology, perform "aggressive scope reduction"

Page 32: Franz. Anatomy of a Cladistic Analysis

And so…

Page 33: Franz. Anatomy of a Cladistic Analysis

Contingent rescoping – tricarinate rostrum of Diaprepes spp.

17. "Rostrum tricarinate, …

7 species

Page 34: Franz. Anatomy of a Cladistic Analysis

17. "Rostrum tricarinate, …

Contingent rescoping – tricarinate rostrum of Diaprepes spp.

Exophthalmus Otiorhynchus

Pachnaeus Phaops Rhinospathe

7 species

Are these rostra also tricarinate (in homology to Diaprepes)?

Page 35: Franz. Anatomy of a Cladistic Analysis

17. "Rostrum tricarinate, with a characteristic combination of one median carina and two (dorso-) lateral, apically slightly diverging carinae, each carina narrow, moderately sharp."

Contingent rescoping – tricarinate rostrum of Diaprepes spp.

Exophthalmus Otiorhynchus

Pachnaeus Phaops Rhinospathe

7 species

(1) present (0) absent

(0) absent

(–) inapplicable

(–) inapplicable (–) inapplicable

No, not if intermittent phylogenetic insights are transparently included.

Page 36: Franz. Anatomy of a Cladistic Analysis

Narrowly scoped, deep-level homologies

Page 37: Franz. Anatomy of a Cladistic Analysis

Narrowly scoped, deep-level homologies

Page 38: Franz. Anatomy of a Cladistic Analysis

Narrowly scoped, deep-level homologies

Page 39: Franz. Anatomy of a Cladistic Analysis

Terminal stages 6-8 – added resolution, robustness

narrowly rescoped characters, more stable tree length

Page 40: Franz. Anatomy of a Cladistic Analysis

Consistency index and retention index

approaching "maximum" levels

Page 41: Franz. Anatomy of a Cladistic Analysis

Character coding – binary, multi-state

> 20% characters with inapplicables

Page 42: Franz. Anatomy of a Cladistic Analysis

Character provenance – external, internal

~ 40% internalcharacters

Page 43: Franz. Anatomy of a Cladistic Analysis

Number of MPTs and nodes collapsed in consensus

8 trees,3 collapsed

Ready for publication.

Page 44: Franz. Anatomy of a Cladistic Analysis

Review…from matrix 10 to 60

Topology, Optimization, Language

Page 45: Franz. Anatomy of a Cladistic Analysis

Overview of significant topology changes – matrices 1-60

Transition of matrix 30-31 yielded earliest "reliable" results.

Page 46: Franz. Anatomy of a Cladistic Analysis

Matrix 10[topology]

• 52 taxa• 100 characters• 4 MPTs• L = 547 steps• CI = 28• RI = 66• 3 nodes collapsed• Bremer support

Exophthalmus spp. Nested within Exoph.

Page 47: Franz. Anatomy of a Cladistic Analysis

• 52 taxa (=)• 69 characters (– 31)• 3 MPTs (– 1)• L = 159 steps (– 388) • CI = 48 (+ 20)• RI = 83 (+ 17)• 3 nodes collapsed (=)• Bremer support

Matrix 20[topology]

Exophthalmus spp. Nested within Exoph.

Page 48: Franz. Anatomy of a Cladistic Analysis

• 90 taxa (+ 38)• 91 characters (+ 22)• 2192 MPTs (+ 2189)• L = 205 steps (+ 46) • CI = 45 (– 3)• RI = 83 (=)• 38 nodes collapsed (+ 35)• Bremer support

Matrix 30[topology]

Exophthalmus spp. Nested within Exoph.

Page 49: Franz. Anatomy of a Cladistic Analysis

• 90 taxa (=)• 143 characters (+ 52)• 8 MPTs (– 2184)• L = 239 steps (+ 24) • CI = 66 (+ 21)• RI = 91 (+ 8)• 3 nodes collapsed (– 35)• Bremer support

Matrix 60[topology]

Exophthalmus spp. Nested within Exoph.

Page 50: Franz. Anatomy of a Cladistic Analysis

• 52 taxa• 100 characters• 4 MPTs• L = 547 steps• CI = 28• RI = 66• 3 nodes collapsed• Diagnoses unwieldy• Synapomorphies rare

Matrix 10[optimization]

Select legacy characters

Page 51: Franz. Anatomy of a Cladistic Analysis

• 90 taxa (+ 38)• 143 characters (+ 43)• 8 MPTs (+ 4)• L = 239 steps (– 308) • CI = 66 (+ 38)• RI = 91 (+ 25)• 3 nodes collapsed (=)• Diagnoses concise• Synapomorph. common

Matrix 60[optimization]

Page 52: Franz. Anatomy of a Cladistic Analysis

Matrix 10 [language]

Lacordaire. 1863. Histoire naturelle des insectes, Vol 6. Paris, Roret.Champion. 1911. Otiorhynchinae Alatae. Biologia Centrali-Americana, Vol. 4, Part 3. London; pp. 178–317.van Emden. 1944. A key to the genera of Brachyderinae of the world. Ann. Mag. Nat. Hist. 11: 503–532, 559–586.Anderson. 2002. Family 131. Curculionidae. In: American B, Vol. 2. Boca Raton, CRC Press; pp. 722–815.Franz. 2010. Redescriptions of critical type species in the Eustylini Lacordaire (Coleoptera: Curculionidae: Entiminae). J. Nat. Hist. 44: 41-80.

Page 53: Franz. Anatomy of a Cladistic Analysis

Matrix 60 [language]

Page 54: Franz. Anatomy of a Cladistic Analysis

Related issue…

Which characters should constitute

phenotype anatomy ontologies?

Page 55: Franz. Anatomy of a Cladistic Analysis

Related issue – the value of parsimony-contingent homology

Special emphasis on constructing phenotypic anatomy ontologies.

"We have taken an integrative approach in the building of Uberon, and in doing so embrace multiple axes of classification. […] This homology-neutrality of Uberon is a deliberate design feature of the ontology. We believe that specifying homology relationships and descent from common ancestral structures is of obvious high value, but that this need not be tightly coupled to the development of an upper anatomical ontology."

Mungall et al. 2012. Uberon, an integrative multi-species anatomy ontology. Genome Biol. 13: R5.

Page 56: Franz. Anatomy of a Cladistic Analysis

Related issue – the value of parsimony-contingent homology

Special emphasis on constructing phenotypic anatomy ontologies.

"We have taken an integrative approach in the building of Uberon, and in doing so embrace multiple axes of classification. […] This homology-neutrality of Uberon is a deliberate design feature of the ontology. We believe that specifying homology relationships and descent from common ancestral structures is of obvious high value, but that this need not be tightly coupled to the development of an upper anatomical ontology."

"Explanatory homology hypotheses should not be mistaken and blended with morphological descriptions, which in their turn are by nature descriptive and not explanatory. […] Instead, we differentiate phylogenetic investigations into the step of producing data and the step of phylogenetic reasoning."

Mungall et al. 2012. Uberon, an integrative multi-species anatomy ontology. Genome Biol. 13: R5.

Vogt et al. 2010. The linguistic problem of morphology: structure versus homology and the standardization of morphological data. Cladistics 26: 301–325.

Page 57: Franz. Anatomy of a Cladistic Analysis

Source: Davis. 2011. Delimiting baridine weevil evolution (Coleoptera: Curculionidae: Baridinae). Zool. J. Linn. Soc. 161: 88–156.

So then is this what we have in mind for classes in phenotype ontologies?

Page 58: Franz. Anatomy of a Cladistic Analysis

• Analytical phylogenetic methods not only organize character information, but may furthermore have the purpose of shaping character individuation.

• In the case of the Exophthalmus analysis, I would have been hard pressed to arrive at the final descriptions of characters and states without benefitting from intermittent parsimony-driven inferences that led to the reweighting and rescoping of earlier homology assessments. It is not always conducive to a researcher's reputation to expose these practices, but they do and must occur frequently.

Conclusions

Page 59: Franz. Anatomy of a Cladistic Analysis

• Analytical phylogenetic methods not only organize character information, but may furthermore have the purpose of shaping character individuation.

• In the case of the Exophthalmus analysis, I would have been hard pressed to arrive at the final descriptions of characters and states without benefitting from intermittent parsimony-driven inferences that led to the reweighting and rescoping of earlier homology assessments. It is not always conducive to a researcher's reputation to expose these practices, but they do and must occur frequently.

• Under the cladistic paradigm, the most precise inferences of homology are often parsimony-influenced and parsimony-contingent, and the two notions are inextricably linked and entrenched in our maturing observational terminology.

• By integrating expressions of structural equivalence at increasingly greater scales, phenotype ontologies also run the risk of 'dialing down' the most precise and phylogenetically scoped assessments of homology that systematics can produce.

Conclusions

Page 60: Franz. Anatomy of a Cladistic Analysis

Acknowledgments

WHS XXXI Organizers

Juliana Cardona Duque, Jennifer Girón, Anyimilehidi Mazo Vargas, Quentin Wheeler

NSF-DEB 1155984: "Systematics of eustyline and geonemine weevils: Connecting and contrasting Caribbean and Neotropical mainland radiations"

Page 61: Franz. Anatomy of a Cladistic Analysis
Page 62: Franz. Anatomy of a Cladistic Analysis

Source: Lacordaire. 1863. Histoire naturelle des insectes, Vol. 6. Paris: Roret.

Lacordaire (1863) – typically a 1-3 character system for tribes/genera.

Stage 1: legacy character assembly

Page 63: Franz. Anatomy of a Cladistic Analysis

van Emden (1944) state-of-the-art for entimine tribes / genera

Source: van Emden. 1944. A key to the genera of Brachyderinae of the world. Annals and Magazine of Natural History 11: 503-532, 559-586.

Page 64: Franz. Anatomy of a Cladistic Analysis

Anderson (2002) – identifies subfamilies, genera; not tribes

Source: Anderson. 2002. Family 131. Curculionidae. In: Arnett et al. (Eds.): American Beetles, Vol. 2. Boca Raton, CRC Press; pp. 722-815.

Page 65: Franz. Anatomy of a Cladistic Analysis

Initial characters and states had a reputable pedigree

Page 66: Franz. Anatomy of a Cladistic Analysis

Examples of poorly performing characters ( recode / eliminate)

Page 67: Franz. Anatomy of a Cladistic Analysis

Matrix 18: Examples of deactivated characters