More taxa or more characters revisited: combining data from nuclear protein-encoding genes for phylogenetic analyses of Noctuoidea (Insecta: Lepidoptera)

Mitchell, A; Mitter, C; Regier, J.C.

More taxa or more characters revisited: combining data from nuclear protein-encoding genes for phylogenetic analyses of Noctuoidea (Insecta: Lepidoptera)

Author

Mitchell, A; Mitter, C; Regier, J.C.
Year

2000
Journal

Systematic Biology

Abstract

A central question concerning data collection strategy for molecular phylogenies has been, is it better to increase the number of characters or the number of taxa sampled to improve the robustness of a phylogeny estimate? A recent simulation study concluded that increasing the number of taxa sampled is preferable to increasing the number of nucleotide characters, if taxa are chosen specifically to break up long branches. We explore this hypothesis by using empirical data from noctuoid moths, one of the largest superfamilies of insects. Separate studies of two nuclear genes, elongation factor-1 alpha (EF-1 alpha) and dopa decarboxylase (DDC), have yielded similar gene trees and high concordance with morphological groupings for 49 exemplar species. However, support levels were quite low for nodes deeper than the subfamily level. We tested the effects on phylogenetic signal of (1) increasing the taxon sampling by nearly 60%, to 77 species, and (2) combining data from the two genes in a single analysis. Surprisingly, the increased taxon sampling, although designed to break up long branches, generated greater disagreement between the two gene data sets and decreased support levels for deeper nodes. We appear to have inadvertently introduced new long branches, and breaking these up may require a yet larger taxon sample. Sampling additional characters (combining data) greatly increased the phylogenetic signal. To contrast the potential effect of combining data from independent genes with collection of the same total number of characters from a single gene, we simulated the latter by bootstrap augmentation of the single-gene data sets. Support levels for combined data were at least as high as those for the bootstrap-augmented data set for DDC and were much higher than those for the augmented EF-1 alpha data set. This supports the view that in obtaining additional sequence data to solve a refractory systematic problem, it is prudent to take them from an independent gene.

Bibliographic Data

Title: More taxa or more characters revisited: combining data from nuclear protein-encoding genes for phylogenetic analyses of Noctuoidea (Insecta: Lepidoptera)
Author: Mitchell, A; Mitter, C; Regier, J.C.
Year: 2000
Publication Type: Refereed Article
Journal: Systematic Biology
Number of pages: 202-224
Volume: 49
Issue: 2
Language: en

More taxa or more characters revisited: combining data from nuclear protein-encoding genes for phylogenetic analyses of Noctuoidea (Insecta: Lepidoptera)

Contents

Abstract

Bibliographic Data

Molecular phylogenetics of Caenogastropoda (Gastropoda: Mollusca)

The phylogenetic position of the Isopoda in the Peracarida (Crustacea: Malacostraca)

Phylogenetic utility of the nuclear gene dopa decarboxylase in noctuoid moths (Insecta: Lepidoptera: Noctuoidea)

A phylogenetic study of the parrotfish family Scaridae (Pisces: Labroidea), with a revision of genera

Multi-Gene Analyses of the Phylogenetic Relationships among the Mollusca, Annelida, and Arthropoda

The Marsupial Dystrophin Gene

Phylogenetic relationships within Serpulidae (Annelida: Polychaeta) inferred from molecular and morphological data.

Phylogeny of the gastropod superfamily Cerithioidea using morphology and molecules

The phylogenetic position of Neritimorpha based on the mitochondrial genome of Nerita melanotragus (Mollusca: Gastropoda)

Phylogenetic relationships of rock-wallabies, Petrogale (Marsupialia: Macropodidae) and their biogeographic history within Australia

A highly conserved nuclear gene for low-level phylogenetics: Elongation Factor-1a recovers morphology-based tree for heliothine moths

Further phylogenetic studies of the Polychaeta using 18S rDNA sequence data