ANGIOSPERM PHYLOGENY WEBSITE, version 14.
On classifications in general, and in particular on the classification used here.
On forming clade characterizations (and thinking about apomorphies).
SUMMARY OF APG IV SYSTEM AND LINKS TO MAIN PAGES.
On some poorly-known taxa that are in need of study.
On the organization and design of this site.
On the interpretation of the text, etc.
Important - Warning to All Users!
History of the site.
Website developed and maintained by Hilary Davis: email@example.com
Systematics is a profoundly historical discipline, and we forget this at our peril. Only with a phylogeny can we begin to understand diversification, regularities in patterns of evolution, or simply suggest individual evolutionary changes within a clade. Our recovery of that phylogeny is the recovery of evidence of a series of unique events that comprises the history of life. These pages are a series of characterizations of all orders and families of extant angiosperms (flowering plants) and gymnosperms, i.e. all seed plants, as well as many of clades grouping families and orders and some smaller clades, especially within larger families; non-seed plants are covered more briefly. The pages are designed to help in teaching seed plant phylogeny at a time when our knowledge of the major clades of seed plants and the relationships within and between them are still somewhat in a state of flux, even if much of the broad outline is becoming clear (but c.f. A.P.G. IV 2016), one can see a daily-updated Tree of Life based on genome sequences as they appear, for example (Fang et al. 2013), a comprehensive Tree of Life is being developed (Hinchcliff et al. 2015). Nevertheless, as of viii.2018, I estimate that there are still substantial questions about relationships within about twelve orders of seed plants, less serious questions about five more, and four or more internal nodes/areas on the main tree where there are important problems. Perennial aggravations have been the position of Gnetales and relationships of and within the bryophytes, although there seems to be progress with these issues (e.g. Ran et al. 2018, and Wickett et al. 2014, Puttick et al. 2018, and Morris et al. 2018, respectively). The now almost routine analysis of massive amounts of data from choroplast genomes is helping to shed light on relationships in some of these areas, but analyses of nuclear and mitochondrial data question some relationships that we thought we understood (e.g. Sun et al. 2014: rosids; Rydin et al. 2017: Rubiaceae). As details of phylogeny are clarified and new findings made in anatomy, morphology, etc. (but there are far fewer of these than one would like), they can be rapidly integrated into the Angiosperm Phylogeny Group system that is followed here. Indeed, there are discussions as how best to provide the means to produce trees and analyses of features in the context of phylogenies using vast amounts of data in a repeatable way and on a regularly updatable basis (Eiserhart et al. 2018) - we shall see. Certainly books are out-of-date before they appear, but the best comprehensive phylogeny-based treatment of angiosperms is by D. Soltis et al. (2005b, 2017, ed. 2); see also K. Bremer et al. (2004b). There are useful treatments of the main European families in Sitte et al. (2002) and the 37th edition of Strasburger's Lehrbuch (Kadereit et al. 2014), and of North American - and some other - families in Judd et al. (1999, 2015, ed. 4) and Simpson (2010). Neotropikey (Milliken et al. 2009 onwards) - more than a simple interactive key - is a resource for students of the neotropical flora while the most recent edition of Adolf Engler's Syllabus of Plant Families/Syllabus der Pflanzenfamilien (Frey 2015) that is coming out is based on recent findings in phylogeny. There is a regularly updated and printable Angiosperm Phylogeny Poster (now in more than 20 languages: Cole et al. 2017 onwards) and a different visualization of the seed plant universe at Botanical Chart. Johansson (2013) and Byng (2014) are comprehensive phylogeny-based accounts of angiosperms both of which are being updated, Byng (2015) covers gymnosperms, the beautifully-illustrated Christenhusz et al. (2017) includes accounts of all vascular plant families, while Lecointre and Guyadet (2016) are going to cover everything.
The focus of this site is on angiosperm families, although treatments of gymnosperms were added in 2005 and other embryophytes/land plants are covered in a less detailed fashion. Emphasis is placed on plant families because they are the groups - admittedly partly arbitrary as to circumscription, but now monophyletic (= including all and only the known species of a common ancestor, but see below) - around which many of us organize our understanding of plant diversity. I also pay attention to groupings of families and orders (for which, see A.P.G. IV 2016), while infrafamilial groupings in larger families like Annonaceae, Apocynaceae, Ericaceae, Fabaceae, Malvaceae, Poaceae, Rubiaceae, Rutaceae, Scrophulariaceae, etc., are being added as studies become available. I tend to focus on literature that deals with clades with fifty or more species, although in smaller families the coverage is more detailed.
Relationships between families, etc., are shown as branching diagrams or trees; Baum and Smith (2012) provide an excellent account of how to interpret and think about such trees. Trees are a means to an end, that is, to help us understand - in the broadest sense - phylogeny. Throughout the site, the treatment of variation is as hierarchical as I can make it, with putative apomorphies being mentioned at the appropriate place in the tree, although in the ultimate groupings both apomorphies and characters varying within these groups are mentioned. Finding out the composition of clades is quite often easier than finding the synapomorphies for those clades, indeed, recent developments in ancestral state reconstruction make it clear how difficult the latter can be (see the discussion below), and uncertainties about the ages of clades remain pervasive. And knowing about synapomorphies and ages, difficult although this may be, is just the beginning of understanding the whys and wherefores of the evolution and diversification of seed plants, our ultimate goal. Hence the sections on various aspects of evolution that include "Divergence & Distribution", "Ecology & Physiology", "Pollination Biology & Seed Dispersal", "Plant-Animal Interactions" and "Bacterial/Fungal Associations" in these pages. And of course phylogeny is just one way of looking at land plants, but I hope it will become clear that it is a very important one that allows the incorporation of other viewpoints, too, I hope giving them interesting new perspectives.
There is a separate and more extended discussion on the evolution and diversification of angiosperms as a whole, and here in particular the emphasis is less on species numbers, more on what species/clades "do", and the major role that plants play in constructing the environment in which they (and we) live. Here thinking about the microbiome, other organisms closely associated with plants such as bacteria and endophytic and mycorrhizal fungi, is important, since it is the plant and microbiome together that help shape the ecological relationships between plants and their environment. Indeed, knowledge of the microbiome and its interactions with the host plant changes what we think of as an individual plant - a plant is part of a holobiont, a group of more or less closely integrated and interacting organisms, and genomes of all these organisms make up the hologenome, a microcosm - maybe not so micro - that can be thought of as the holobiome (Bordenstein & Theis 2015; Gilbert & Tauber 2016; Martin et al. 2017; Tripp et al. 2017a: symbiome). Thus the ectomycorrhizal (ECM) associates of seedlings of pinyon pines (Pinus edulis) differed, but the ECM fungi and the genotype of the seedlings resembled those of the parent pines, which were either drought-tolerant or -intolerant (Gehring et al. 2017); and as those authors (ibid.: p. 11170) noted, "ECM community composition represents a heritable plant trait". Along the same lines, the effect of a caterpillar on a plant can be manipulated by parasitoids of that caterpillar, and ultimately by polydnaviruses in the parasitoids - the saliva of the infected caterpillar is less likely to elicit plant defences than that of an uninfected caterpillar (Tan et al. 2018; see also Zhu et al. 2018) - all for the benefit of the polydnavirus.
Further information about all families mentioned on this page may readily be found by using the "Families" link in the top bar or the Summary below; general information about characters mentioned may also be found in the characters page.
ON CLASSIFICATIONS IN GENERAL, AND THIS CLASSIFICATION IN PARTICULAR
[Back to Top]
On classifications in general
Godfray and Knapp (2004: p. 562) observed that "users want stable, informative and accessible classifications that enable easy identification" - although invoking "users" without specifying those who make up this group rather begs the question. The classification here is for all interested in comparative biology, hence the emphasis on monophyly; there are, as we shall see, many ways of making such a classification accessible to all.
Classifications in the broad sense are box-in-box, group-in-group, or part/whole naming devices that we use to communicate aspects of our knowledge of things in general (see Parsons & Wand 2008 for an introduction). For any classification system to be effective, it must be stable, universal, i.e., be used by a wide range of people, and it must enhance communication of knowledge by helping us to relate things in our minds. From this point of view biological classification systems are no different from any others (Stevens 2006a for references). The phylogeny-based classification used here conveys aspects of our knowledge about the phylogenetic relationships about land plants. Thus a family is clearly flagged as such and is a monophyletic group that can contain several genera, also flagged as such and also for the most part monophyletic, but a genus can never include families. Generic, family, etc., names are simply words we use to denote appropriate parts of phylogenies and convey some, if rather minimal, information about their relationships. The circumscription of individual taxa may as much reflect the fact that the birthplace of contemporary systematics was Europe and North America, not the Antipodes (e.g. Walters 1961), rather than any philosophical ideas of the authors describing these taxa (some nineteenth century botanists were aware of this local, eurocentric bias).
Here I use what may be called a flagged hierarchy (Stevens 2006a) for naming taxa. The only thing that the rank terminations used (-ales, -aceae, etc.) suggest is relative inclusion relationships of groups of species in the local hierarchy. If we are talking about a monophyletic group, say Ericaceae, and Vaccinioideae are mentioned, then the latter must refer to a clade contained within the former, but neither is necessarily comparable with Polemoniaceae and Cobaeoideae, or any other family-subfamily combination. All are monophyletic groups (I hope - but see below), but that is all they have in common. Taxa at the same rank are equivalent only by designation and have nothing necessarily in common other than their monophyly. Rank as used here has no meaning other than signifying a monophyletic group that includes other monophyletic groups with appropriately subordinate rank terminations. Even sister taxa have only age and their immediate common ancestor and its ancestors necessarily in common. The non-equivalence of taxa at the same rank has long been clear (Darwin 1859; see also e.g. Stevens 1997; Bertrand et al. 2006; Giribet et al. 2015 and references).
A flagged hierarchy is useful in memory and communication (e.g. Stevens 2006a). It improves memorization by tapping in to the hierarchical structure of language (an extension of the noun-adjective structure of binomials); emphasis on genera, families and orders, as here, is simply a didactic device. Most genera, families, orders, etc., are monophyletic and for the most part include largish groups of species, they are units useful in communication that are used in general conversation by biologists and others world-wide.
However, Valentine and May (1996) describe the Linnaean hierarchy as an example of an aggregative hierarchy with emergent properties at the different levels, while they dismiss phylogenies as being simply positional structures lacking emergent properties (for hierarchies, see also Eldredge 1985; Salthe 1985). Indeed, in the past rank terminations denoted absolute rank, the classification of Linnaeus (at least in theory) at the level of genus and species being such an example. Classifications where rank is absolute, taxa at the same rank being legitimately comparable entities and with particular (kinds of) properties, are class hierarchies in the strict sense (Stevens 2002, 2006a).
Taxa at the same rank are unfortunately still sometimes treated as if they were equivalent by those attempting to understand evolutionary or biogeographic problems (e.g. Ricotta et al. 2012), and some still worry about the properties of genera (e.g. Strand & Panova 2014). Hendricks et al. (2014) discuss the (mis)use of the generic rank by palaeontologists. Interestingly, in the middle of the last century, at least, taxonomists thought that the rank of genus was more "real" or "natural" than that of species (Anderson 1940), although a recent (rather small) survey suggests that opinions have flipped, with most thinking species are more "real" (Barraclough & Humphreys 2015). However, discussions about "reality" seem somewhat passé. However, the focus of this site is not on species and speciation. The general lineage species concept (de Queiroz 1998, 2007) and its descendants, which - not for the first time - makes a clear distinction between species concepts and the criteria that can be used in connection with particular concepts. Indeed, coupling uncertainty over what might be an individual (e.g. a dandelion; different parts of an ancient clone that now have no organic connection), and the availability of massive amounts of data (e.g. Novikova et al. 2016: Arabidopsis and relatives), means that deciding on the limits of species, let alone applying concepts such as monophyly there, is becoming increasingly difficult (see also e.g. Freudenstein et al. 2016b).
Understanding the distinction between grouping and ranking is important. We can both agree that there is a genus Acer, yet disagree as to whether it should be in Aceraceae or submerged in Sapindaceae. Although from one point of view such disagreements are utterly trivial, they still exercise the botanical community, perhaps because this is more a matter of personal preference than anything much to do with whatever science might be (e.g. c.f. Heenan & Smissen 2013 - splitting Nothofagus and Hill et al. 2015 - lumping it; Schuettpelz et al. 2018 - splitting ferns, and Christenhusz & Chase 2018 - lumping them; Ohashi et al. 2018 - splitting Desmodium).
Turning more specifically to phylogenetic classifications in general, and to the classification used here, Backlund and Bremer (1998) provide a very useful discussion on the principles of phylogenetic classification that is applicable at all levels apart from species. The zoologists Vences et al. (2013) list classificatory principles that are largely in agreement with those of Backlund and Bremer (1996) and those followed in this site, although Vences et al. are less concerned about having small genera that I am, perhaps partly because the groups on which they focus are small compared to those under discussion here. See also Stevens (1998), Albach et al. (2004), Entwisle and Weston (2005), Pfeil and Crisp (2005), Pauly et al. (2009), Bateman (2012 and references), The Legume Phylogeny Working Group (2013b), Y. Tang et al. (2015: pp. 22-24), Giribet et al. (2016), Kadereit (2017), etc., for additional discussion, and many other papers are mentioned in the "Classification" section at the end of each family discussion.
The discussions in these and other papers can be applied to supraspecific classifications in general. Whereever possible taxa that are recognised formally should be monophyletic, that is, they should include all and only the descendants of a hypothesized common ancestor (but see below). However, this does not indicate which particular clades we might wish to name as families, genera, etc., and talk about in general conversation: If a well-supported hypothesis of monophyly is a necessary prerequisite for a group to be named, it is not a sufficient prerequisite (but c.f. the PhyloCode - Cantino & de Queiroz 2006). Additional criteria should be invoked. Other things being equal, it is helpful if 1, taxa recognised formally are easily recognizable, 2, groups that are well-established in the literature are preserved, 3, the size of groups is taken into account, numerous small groups having little to recommend them since individually they summarise little information and tend to clog the memory, while groups that are too big may be amorphous, and 4, nomenclatural changes are minimized, the accessory criteria of Backlund and Bremer (1998). The broader phylogenetic context should also be taken into account, that is, formal recognition of a taxon may necessitate the recognition of additional taxa at that rank if, for example, there are several pectinations immediately below the taxon of immediate interest that will also have to recognized at the same rank (e.g. Stevens 1998). Examples are the three or four small families needed in Dipsacales if Valerianaceae and Dipsacaceae are maintained (see the broadly-drawn Caprifoliaceae) and the proliferation of genera caused by the piece-meal dismemberment of Spermacoce (Rubiaceae). This emphasizes that it is essential to understand the general phylogeny of a group before changing names in it (Bateman 2012; see Y. Tang et al. 2015 for an example), and this should be considered another accessory criterion.
The accessory principles of Backlund and Bremer (1998) should be used in combination. Thus keeping the monogeneric Platanaceae separate from their sister taxon, Proteaceae, is justifiable: Both are much-used names that signal well supported, well defined and easily recognisable groups that have long been recognised as distinct, and both have several synapomorphies and do indeed look very unlike each other. Combining the two because Platanaceae are small (one genus, ca ten species) would yield a clade with few obvious apomorphies, not to mention the fact that Nelumbonaceae should by the same logic (it is also monogeneric, with two species) also be included in the expanded family... On the other hand, it is difficult to justify the continued recognition of Callitrichaceae or Hippuridaceae, monophyletic and distinctive although they may be. If they were recognised, several poorly characterised clades would also have to be carved out of Plantaginaceae in any classification that aimed to convey a comprehensive view of the world's flora (see also Pfeil & Crisp 2005; Orthia et al. 2005; Albach 2008, etc., for good practical discussions of such matters). But there are no absolute guidelines. Podostemaceae are sister to Hypericaceae, previously in Clusiaceae s.l.. The dismemberment of Clusiaceae s.l. allows the continued recognition of the very distinctive and long-established Podostemaceae, and Hypericaceae, Calophyllaceae and Clusiaceae s. str. are all characterizable and of moderate size; the recognition of Rafflesiaceae is a similar example. A somewhat similar situation: If Lemna and its relatives are a clade sister to most other Araceae, "should" they be recognised as a separate family? Gymnostachys, a phenetically fairly distinctive taxon, as well as the less phenetically distinct Orontioideae would also have to be recognised as a separate families, too, and Araceae in a somewhat more restricted sense would indeed be morphologically more coherent, although not greatly so, yet not so notably distinct... Unfortunately, it is in the very nature of such decisions that they are ultimately somewhat arbitrary and unsatisfactory.
To summarize: The emphasis here is on establishing consensus classifications, and on classifications as simply being a means to an end, not an end in themselves. It is the phylogeny that is central, and the names in classifications are simply names attached to larger or smaller branches of the phylogenetic tree that facilitate our discussion about larger or smaller parts of that phylogeny.
It is sometimes suggested that taxa at the same rank should have a similar level or number of morphological and/or molecular differences, or be based on similar characters, or be similar in these respects to taxa elsewhere in a larger clade under consideration (Heenan & Smissen 2013 for an example). This will lead to name instability if used in taxa where such criteria had not previously been used, especially if attempts are then made to extend such criteria beyond the local group of interest, and they may also imply that taxa at the same rank are equivalent and comparable in a biological context (for an example, see Fritsch et al. 2008). Of course, equivalencies in morphological and/or molecular differences are unlikely to be fixed (new apomorphies will continue to be found, or what were thought to be apomorphies turn out not to be) or easily quantifiable. It has also been suggested that rank could reflect the age of the clade (e.g. Hennig 1966; Holt & Jønsson 2014; Eguchi & Tamura 2016). However, dating clades is still a very difficult enterprise (see below), furthermore, if age were introduced as a ranking criterion, huge - if arguably only temporary - disruptions to the names that we use would result, certainly if this criterion were used across all of life. In this context, some recent proposals which invoke the use of age in classifications focus more on providing a standardized timeclip, i.e. set of letters referring to a particular geological period, that could be added to a conventional taxon name (Avise & Mitchell 2007; Avise & Liu 2011; see also Vences et al. 2013).
There is a useful distinction to be made between crown and stem groups. The former are the monophyletic groups that include the extant members of a particular clade and their immediate common ancestor (see the figure below), and also any fossils that can be placed in this part of the tree. The groups characterized in this site are such groups. Thus Proteaceae are crown-group Proteaceae, apomorphies like the single carpel, four-merous perianth, etc., being found in their common ancestor. Stem groups, on the other hand, include all the members of that clade below the crown group to immediately after its split from its sister group - and all branches (which have fossil representatives only, of course) of this part of the tree. Thus stem Proteaceae would include everything after the split from its sister group, Platanaceae, but they would not be the Proteaceae of these pages. Obviously, most of the organisms in that part of the tree are unknown, only a few fossils being placed there, and it is also not known when/where particular apomorphies of crown group Proteaceae evolved along this branch. In the case of the stem group of angiosperms, not only is it largely unknown and probably well over 100 m.y.o., but almost certainly most of the organisms to be placed along it will be gymnospermous.
Problems with an emphasis on monophyly can be caused by reticulation events such as endosymbiosis (Gray & Archibald 2012 for literature), lateral gene transfer (see e.g. Dunning Hotopp et al. 2007; Boto 2010; F.-W. Li et al. 2014) and hybridization, but the first two are unlikely to be major confusing factors in embryophytes. The major endosymbiotic events that characterize the clade of which flowering plants are a part (and gave rise to chloroplasts and mitochondria) are very ancient and cause no problems for the student of multicellular organisms, indeeed, patterns of subsequent movement of genes from the mitochondrion and chloroplast to the nucleus are phylogenetically interesting. Thus there is substantial lateral movement of mitochondrial genes in particular in some host-parasite relationships such as Tetrastigma-Rafflesia (Xi et al. 2012a; Yoshida et al. 2010), via chloroplasts in grafts (Stegemann et al. 2012: exchange between Nicotiana), and in Poaceae just about any pollen grain at least germinates on grass stigmas, and zygotes are formed in crosses between e.g. Poöideae and Panicoideae (Lausser et al. 2010; Kellogg 2015 and references). Here, too, there are no major problems, providing one is careful, since the genome in such cases seems to be overwhelmingly from one source; such transfers do, however, raise all sorts of interesting biological questions (see Richardson & Palmer 2007 for a summary). There are other wide but inexplicable transfer of mitochochondrial genes to plants like Amborella from its epiphytes (Rice et al. 2013 and references), and even transfers of nuclear genes (Vallenback et al. 2008).
Genome duplications - hybridization is a common cause of them - are common, and are turning out to be a major driver of seed plant evolution. They have occurred at various times during the evolution of seed plants (e.g. Soltis et al. 2009; Litt 2013: summary; Y. Yang et al. 2015; Landis et al. 2018; see also below). Many of these duplications do not currently pose major problems for either phylogeny reconstruction or the adoption of monophyly as the major criterion for groups to be recognised formally in phylogenetic classifications, certainly for genera and deeper clades.
Of course, more or less extensive reticulation can be common between species, and in a number of genera - Medicago is just one example - relationships at the species level are indeed turning out to be highly reticulating (Maureira-Butler et al. 2008). Members of the one species of Senecio and Elymus may have two or more independent origins (Yan & Sun 2012). Thus there are many problems if one tries to apply a concept of strict monophyly to species, and many - but not all - biologists suggest that invoking strict monophyly would be inappropriate (e.g. de Queiroz 1998; Funk & Omland 2003; much of the discussion in Hörandl 2006), and the extent of this problem is becoming generally recognized - it is also common in vertebrates, for example (Pennisi 2016). So why not peg genera to above the level at which such hybridization is common? Would that life were so simple. Hybridization is widespread in Poaceae-Pöoideae-Triticeae and causes intractable problems where well established common usage and the principle of monophyly are likely to remain at odds; many genera here are certainly not monophyletic, being allopolyploids, and the genera are ultimately based on different genome combinations (Dewey 1984; Löve 1984; Brassac et al. 2012: Barkworth 2000 for a history of Triticeae classification; Petersen et al. 2006). There is also extensive reticulation reported within Danthonioideae (Pirie et al. 2009) and Bambusoideae (Triplett et al. 2011, 2014; Y.-X. Zhang et al. 2012; H.-M. Yang et al. 2013). Indeed, there is increasing evidence of old hybridization events elsewhere in flowering plants that at the very least cause discordance between relationships suggested by different genomic compartments, as in Smedmark and Anderberg (2007: Sapotaceae), Fehrer et al. (2007: Asteraceae-Lactucoideae), Y. Liu et al. (2013: perhaps Crepidinae x Lactucinae), Pelser et al. (2008, 2010: Asteraceae-Asteroideae), Tripp et al. (2013b: Acanthaceae-Acanthoideae, Acantheae x Justicieae), Folk et al. (2016: Heuchera area, Saxifragaceae), García et al. (2017: Amaryllidaceae-Hippeastreae). Similarly, incomplete lineage sorting, which one often thinks of as a species-level problem, may also pose problems at surprisingly deep levels (e.g. Jarvis et al. 2014: birds).
Such reports are becoming more common as nuclear genes are increasingly being used, and the chloroplast genome, the mitochondrial genome, morphology, and the nuclear genome may suggest conflicting relationships. Thus there are cases of hybridization in which organellar genomes become introgressed yet the nuclear genome is little or not at all affected (Folk et al. 2016), and there may even have been hybridization between ancestors of the fabid/N-fixing clade and the malvid clade the descendents of which are now represented by the COM clade (Sun et al. 2014: see elsewhere). In Rubiaceae phylogenies based on mitochondrial genome variation question the hitherto accepted three-subfamily classification of the family (Rydin et al. 2017). Since many of the phylogenies discussed here were based initially entirely or mainly on chloroplast genes, warnings against reliance on uniparental markers are very much to the point (Marcussen et al. 2014).
When there are reticulations the principle of strict monophyly as the basis of classification is comprimised. Wheeler (2014) suggests ways of dealing with such situations, at least if they are fairly simple. A taxon A can be recognised that includes all the descendents of its common ancestor, even if one of these is also the result of a hybridization event with a member of its sister clade, clade B, a situation that he dubbed epiphyly (c.f. epiphylly!). This sister clade, B, will then necessarily include all descendents of a common ancestor bar the one involved in hybridization with the member of clade A (= periphyly). Another alternative might be to recognise the clade that results from hybridization as a separate taxon at the same level as the remains of clades A and B - both would then be periphyletic and the "hybrid clade" would be polyphyletic. However, hybridization scenarios may be very complex, for example, in Bambusoideae hybridizations involve members of fairly deep clades that have no extant diploid members (e.g. Triplett et al. 2014 and references). If there is a preponderance of nuclear genes from one of the parents of a hybridisation event in the descendants, it would seem reasonable to place the hybrids in the same genus (say) with that parent. In all cases, clearly describing and justifying the reasons for classificatory decisions is essential, and in cases of extensive hybridization there is a need for consensus, as in Triticeae, where the different classifications make things very difficult for the poor user who has to disentangle what is essentially an unnecessary mess (e.g. Card et al. 2014: p. 102).
Thinking of aspects of number and size of groups recognized, findings in ethnobiology and cognitive psychology can be used to suggest that a moderate number - probably fewer than 500 - of families is a reasonable goal at which to aim, and that groupings of taxa throughout any system should be rather small in size (e.g. Berlin 1992; Stevens 1994, 1997). Major systems such as those of Linnaeus and Bentham and Hooker were constructed explicitly to ease the burden on the memory, the latter in particular ensuring that all groups in their classification were relativeley small, often containing three to eight immediately subordinate taxa - but by no means all their groups were formally named (Stevens 1997, 2002; see also Scharf 2007; c.f. in part Vences et al. 2013). Along the same lines, Burtt (1977b) suggested that the number of names at any rank should be at most one third of those at the immediately lower rank - and monotypic taxa might not need formal names (Burtt 1977b). Consistent with such ideas, a fairly broad view of families and orders is taken here, as in the APG system, whenever the constraints of monophyly and other criteria used when constructing classifications (see above) permit. Indeed, there is a principle in evolutionary classification that is quite useful here: The size of the gap between two groups tends to be inversely proportional to the sizes of the groups involved (Davis & Heywood 1963). In some situations a large group is formally divided even although the distinguishing characters of the two parts are weak, whereas a smaller group that is similarly divisable may be left intact.
Takhtajan (1997) suggested that smaller families are more "natural" than larger families. This is incorrect. Although "having more features in common" is one common meaning of "more natural", monophyletic groups that include fewer taxa - Takhtajan's smaller families, for example - do not necessarily have more apomorphies than larger groups, although all members of smaller groups are likely to have more features in general in common, since they will have more plesiomorphies - features that are apomorphies of the larger clades of which they are parts - in common. A group of any size may have apomorphies, and any inverse correlation between size of group and apomorphy number is at best weak - think of all the apomorphies of angiosperms and monocots. However, unreversed apomorphies are more common in smaller, less inclusive clades. But by this kind of argument all families should be very small, since then their members will have a great deal in common, and so will be most "natural".
The slippery slope is obvious. As families (for example) are split, the relationships evident between the segregates and that were responsible for their being placed in a single family in the first place will seem to necessitate the recognition of a new order, etc., as is evident in Takhtajan's own work - general taxonomic inflation is the result (see also comparable suggestions in a cladistic context for Brassicales in particular - Ronse de Craene & Haston 2006). Such splitting is also questionable when teaching and learning families, since the student needs to understand the system as a whole. Nevertheless, for some genera removed from the families that until now have included them, the phenetic-classificatory-phylogenetic structure in their new home may mandate the recognition of small families. Note that Takhtajan's suggestion that narrowly defined families (better: taxa in general) are more useful for phylogenetic studies may be true, but this is for other reasons and is a separate issue. Indeed, I have more than once regretted prematurely combining groups, whether species (in the context of monographic work) or families (in the course of preparing these notes).
There are ultimately no reasons other than convention or convenience why any group should not be segregated into several smaller monophyletic groups, or merged to produce a larger unit; we can talk about one large thing, or about several smaller things. Here I follow van Steenis (1978), Philipson (1987b), and others who have questioned the utility of splitting a group when ideas of the relationships of its constituent members have not changed - that is, very good reasons have to be provided for splitting a family if the genera within it remain part of the same clade, rather than belonging to another clade. Thus A.P.G. (2003) broadened the circumscription of Malvaceae because of the para/polyphyly of some of the families that had historically been associated with it (Judd & Manchester 1997; Alverson et al. 1999; Bayer et al. 1999). These families, particularly Tiliaceae and Sterculiaceae, were not at all easy to distinguish, their close relationship had long been recognized (see e.g. Brown 1814), and to some workers, at least, their combination was something of a relief. Although most of the larger clades within Malvaceae s.l. remain difficult to distinguish, even with flowers, Cheek (2007) opted for their wholesale and novel dismemberment into ten families; the "very good reasons" for doing this are wanting.
The same principles are of course applicable when it comes to dividing genera. Groups in which there is substantial disagreement over taxon circumscription despite basic agreement over classificatory philosophy include Orchidaceae, Asparagaceae-Scilloideae, Rubiaceae-Spermacoceae. However, little other than a headache is gained by splitting genera such as Drosera and Gnetum as has been proposed (e.g. Doweld 2000). Indeed, even if there is well supported phylogenetic structure within a genus, this is certainly no signal that two or more genera have to be recognized, even if the new clades are consistent with morphology and had been recognized as genera some time in the past. Thus I think that the dismemberment of Pterostylis (Jones & Clements 2002b) was somewhat unfortunate (see Janes & Duretto 2010 and Schuitema & Adama 2011 for something close to a return to the status quo ante); that a previously unnamed clade is morphologically distinct and has synapomorphies does not necessitate its formal recognition (c.f. Clements et al. 2011). Along the same lines, if a newly-discovered taxon is sister to an existing named genus, this does not mean that a separate genus is needed for the newly described species (c.f. Davis 2002; Léveillé-Bourret et al. 2017). Humphreys and Linder (2009) provide a well-documented survey of generic concepts in plants which the reader should consult; they note that generic limits (broad versus narrow) have oscillated historically, and that currently larger genera tend to be recognised because studies tend to be on a broader scale than in the past - and broad limits are my preference, too.
Note that invoking "similarity" or "difference" - whether qualified ("considerable similarities", "substantial differences") or not - in a cladistic context as justification for combining or splitting taxa is not a particularly strong argument (see e.g. Cardiopteridaceae/Stemonuraceae - Kårehed 2002c). Similarity and difference cannot be defined precisely, since what may seem to be substantial similarities to me, may not to the next person, nor are they likely to be stable, our knowledge of morphology and what might be synapomorphies being in a stste of flux.
I might have prefered to merge some families recognised here or split others, but by and large I do not think my own preferences matter very much - and I take the same position with regards to comparable preferences expressed by others. Indeed, the bottom line is that in flagged hierarchies of the kind used here, the limits of any monophyletic unit generally taught and discussed can be established only by convention and consensus (e.g. Stevens 2002, 2006a; Entwisle & Weston 2005; James & Duretto 2010). Given the increasing molecular and morphological support for the outlines of angiosperm phylogeny detailed in these pages, a stable classification based on this phylogeny seems attainable (e.g. Reveal 2011). Indeed, in addition to providing current ideas of relationships of seed plants in a synthesised form, this site is part of an attempt to build such a consensus over taxon circumscription (see A.P.G. 1999, 2003, 2009, 2016; Grass Phylogeny Working Group 2001, 2010; Mabberley 2017: also Hibbett et al. 2007 for a good example in fungi).
Reaching a consensus is important, since what we know of angiosperm phylogeny allows a very large number of classifications to be based on it, and unfortunately, "nature" does not dictate what the classification should be. All classifications are constructed by humans to communicate particular aspects of groups, their characters, and relationships, they are means to an end, not an end in themselves - in this context, the papers by Bob O'Hara (e.g. 1988, 1992, 1993) are still very much to the point. Our goals as systematists are surely to produce robust hypotheses of relationships, to understand the evolution of morphology, and the like - having a tree allows onr to think about the evolution of evolutionary novelties or apomorphies, and how or even why homologies change over time, in a way that was previously very largely impossible (O'Hara 1997; Wagner 2015). However, to argue ad nauseam whether something "should" be a family or a subfamily, or whether a genus, known to be monophyletic, should be split, or a group of genera, all monophyletic, should be merged, is largely a waste of time. It is not simply that it becomes difficult to read literature in which different taxon limits have been adopted, or different names used for the same clade, but the unintended consequences can be considerable. Thus the acceptance of generic name changes in some Orchidaceae-Epidendroideae in Pridgeon et al. (2005) meant over 10,000 changes to the names of orchid hybrids which had to be transferred to a hybrid genus other than the one in which they were originally registered (Royal Horticultural Society 2008) - note that here something did have to be done about generic limits. Changing names unnecessarily surely leads to madness, and worse: The discredit of our discipline.
There are similar issues whatever naming system is used. Thus in phylogenetic naming (Baum et al. 1998 for an example, but c. f. Baum et al. 2004; for the PhyloCode, see Cantino & De Queiroz 2006) an unflagged hierarchy is used in which terminations of names used are uninformative about the relative position of taxa. Such unflagged hierarchies have serious deficiences as communication devices. They lack one aspect integral to classifications, whether of vehicles, of organisms by local people, or of organisms in the context of a phylogeny - they contain no intrinsic information about the relationships of the group in question to any others (e.g. Pfeil & Crisp 2005; Stevens 2006a). Recent suggestions for using prefixes like "Apo-" and "Pan-" to PhyloCode names will, however, allow limited information of this kind to be conveyed, but only as it pertains to individual branches, and current proposals do not even mandate that such prefixes be employed consistently. In any event, such proposals simply prevent the tripling (or substantially more) of the number of quite different names that could be used to describe different aspects of the one phylogenetic tree. In general, where n is the number of extant species in a group, the number of clades in such a group = n-1. Other dodges that may help include the use of prefixes and suffixes like Holo-, -formae, -morpha, Eo-, Eu-, and Neo-. (Species will also need names, too; for their names, see e.g. Dayrat et al. 2008; current PhyloCode thought is to leave the naming of species alone for the time being. However, for practitioners of the phylocode the species epithet is the only part of the name that must be used.) Of course, lumping and splitting mercifully have no nomenclatural consequences in phylogenetic naming and the circumscription of clades associated with names remains largely invariant; any well-supported clade can be named without affecting the name of more or less inclusive clades. How the PhyloCode will deal with reticulation is unclear.
Consensus over the clade names commonly learned by students, used in herbaria, and, importantly, used by the general biological community will still be needed, otherwise communication will be impeded. Alternative classifications are an impediment both in bioinformatics and in general communication (e.g. Vorontsova et al. 2015). PhyloCode names in themselves contain no guidelines as to which names should be chosen. But the situation is still more complicated. Terminations that convey ideas of rank in a phylogenetic classification can also be used as PhyloCode names - the use of such names is neither encouraged nor discouraged - however, there they will carry no implications of rank. How they will be used is another matter.
Of course, there are other ways of constructing classifications, and some still prefer evolutionary classifications (see e.g. Ann. Missouri Bot Gard. 100(1-2). 2014 - somewhat dated, c.f. Schmidt-Lebuhn 2010; Grímsson et al. 2017c and Bomfleur et al. 2017: fear of "ancestors"?). There classificatory principles differ substantially from those followed here, e.g. the recognition of paraphyletic taxa is allowed. Evolutionary classifications in general try and combine phylogeny and morphological gaps, although that is no easy thing to do - it is akin to combining chalk and cheese, the result being of little use either for blackboard work or for eating. For attempts to make this impossible task seem more objective, see Stuessy and König (2008) and in particular Willner et al. (2014), although from the comments made throughout this site it is clear that using such classifications as a source of apomorphies, as in Willner et al. (2011), is a burden they cannot bear. Some (e.g. Thorne 1976) suggested that the sizes of gaps between groups at the same rank should be similar, but any principle like this is inherently flawed (see also above). The basic problem with paraphyletic groups is that users do not generally expect a family to be nested inside another family, a genus within another genus, etc. This flouts the basic principles of classifications in general as evident in folk taxonomies and the general expectations of language. Thus when I was working on Ericaceae in the early 1970s it was thought that they were very uncommon in Australia, although it was noted that the related Epacridaceae were very common there, and this caused some bemusement; why were there so few Ericaceae in Australia? Given that Epacridaceae are now nested well within Ericaceae (see Epacridoideae), the largely language-driven biogeographic problem has vanished.
Systematists of all persuasions may invoke ideas of "nature" and of "natural groups" when delimiting groups (e.g. Stuessy & König 2008; Stuessy 2010). Unfortunately, the ultimate meaning of "natural" has long been "a group of the kind [usually unspecified] that I think should be recognised", so any classification competing with yours becomes ex ipso facto unnatural (e.g. Bather 1927). Its use is not helpful since one has first to find out what its user means; in general, the invocation of "nature" or "natural" in such contexts tends to preclude sensible discussion. I prefer not to use the word.
For summaries of commonly used systems, see Brummitt (1992) and Mabberley (2008). New and largely independent systems still come out every couple of years or so, e.g. Takhtajan (1999, 2009), Doweld (2001), Wu et al. (2002), Goldberg (2003), Shipunov (2005 et seq.), Thorne (2007; very elaborate), Heywood et al. (2007), Reveal (2012: also elaborate, no justification), Willner et al. (2014), and on it goes. However, even those who allow or promote the recognition of paraphyletic groups (e.g. Grant 2003; Thorne 2007; Heywood et al. 2007) may find a system recognizing only monophyletic groups of some interest - it can provide a rather different understanding of evolution.
The bottom line is that with a stable hypothesis of phylogeny, a classification based on that phylogeny can (must) be stable, too. We can then get on with our work, that is, testing the structure of phylogenies we have, elucidating phylogenies in areas where relationships remain unclear, studying the evolution of morphology, understanding eco-physiological changes, describing species, etc.. In this context, the spread of the Angiosperm Phylogeny Group system (see below) and its widespread use in technical literature, also floras (e.g. van der Meijden 2005), dictionaries (Mabberley 2008), textbooks (e.g. Judd et al. 2016), more general and popular literature (e.g. Souza & Lorenzi 2012 - the third edition; Spears 2006; Hilger et al. 2010), and in herbaria (Haston et al. 2007, 2009) is gratifying. Returning to Godfray and Knapp's (2004) users of classifications who want a stable, informative and accessible classification that enables easy identification - essentially, they want cake with everything - these pages attempt to satisfy as many of their needs as possible, but without compromising the primary - if sometimes tempered - classificatory principle, that of the monophyly of the groups recognized.
On this classification in particular.
It would be impossible even to think about a higher-level classification such as this without the advances in our understanding of relationships made by the phylogenetic analyses of molecular data carried out over the last twenty five or so years. These are coupled with morphological studies that are evaluated in this new phylogenetic context. For the dramatic changes in this area, see, for instance, the pessimistic attitude about orders in Davis and Heywood (1963: 107-108); "The most unsatisfactory taxon in Angiosperm classification", they were "indefinable", their circumscription was not fixed, etc.. Families, they thought, were likely to be the largest "natural" unit recognizable in flowering plants. Art Cronquist's magnum opus, his An Integrated System of Classification of Flowering Plants, came out in 1981 just about the time that molecular studies were becoming common. Of the 63 families in his Liliopsida or monocots, there have since been additions to and/or deletions from 17% of them, while of his 19 orders, only one, Zingiberales, is recognized today with the same circumscription. In his Magnoliopsida or dicots there have been additions to or deletions from 43% of his 242 families, while none of his 33 orders has the same circumscription today. Families like Liliaceae, Smilacaceae and Grossulariaceae have suffered the most, genera that he included in these families now being found in 10, 7, and 7 other places respectively. Obviously, although he was interested in evolution, how he formed groups and recognised relationships (evolutionary trends were important to him) were quite different from how this is done today, and he was well aware that families like those mentioned ended up as the resting places for genera of uncertain affinities because they were already so heterogenous that adding other genera made little difference to the family description. It is interesting that his groupings have fared rather better in monocots than in "dicots".
Here I very largely follow the most recent version of the Angiosperm Phylogeny Group classification (A.P.G. IV 2016). Any differences are not to be interpreted as differences in principle, simply that new phylogenies continue to be published and that this site is designed to provide an overview of current ideas of higher-level relationships of all seed plants. The Angiosperm Phylogeny Group classification is based on relationships evident in the numerous molecular studies that began to appear in the late 1980s, much of it based on analysis of sequences of chloroplast markers (see A.P.G. 1999 for the principles underlying the classification), and the major outlines of the trees used by A.P.G. II (2003) or even A.P.G. I (A.P.G. 1999) have not had to be changed. There have been only minor changes in the composition of the orders, even if the odd genus or even family is turning out to be seriously misplaced - examples are Hydatellaceae (from monocots-Poales to Nymphaeales: Saarela et al. 2007), Guamatelaceae (from Rosaceae to Crossosomatales: Oh & Potter 2006), and Perrottetia and Bhesa (from Celastraceae to Huerteales and Malpighiales respectively: Zhang & Simmons 2006). The main changes have been clarification of the relationships of unplaced campanulid families (Winkworth et al. 2008) and of genera placed in the old Icacinaceae (see Stull et al. 2015 and euasterids). However, relationships within many of the larger orders are still unclear in part, and as nuclear genes become used more often, there may even be changes in the very scaffolding of the A.P.G. tree (for intimations of what may be in store, see Sun et al. 2014 and rosid relationships). Recent confusion has been introduced by the results of a survey - largely of taxonomists, it seems - that would suggest changes to family level circumscriptions in the absence of changes of phylogenetic relationships (Christenhusz et al. 2015; c.f. Cole 2015), which would seem to fly in the face of common sense, at least if stability of names is of any interest - and if it is not, then surely for some the Phylocode may seem a better way to go.
Phylogenies are shown as tree-like representations, although as mentioned above some phylogenies are more or less reticulating. Most trees have been more or less ladderized, which means showing the smaller (in terms of numbers of terminals) sister taxon first at every node, and the sequence in the text follows that along the top of these trees. (Since phylogenetic trees are like mobiles, the only fixed points being the nodes, there are innumerable other ways to construct a sequence - e.g. Hawthorne & Hughes 2008.) The result is that the trees here tend to be pectinate. When reading a book or following a herbarium sequence, pectination, interpreted carefully, has its value (see Haston et al. 2007, 2009). As one reads the terminals of a pectinate tree left to right, adjacent terminals will tend to be separated by a minimum number of apomorphies. Nymphaeales and Austrobaileyales are here adjacent on the tree, but they could be separated by hundreds of families in another arrangement without offending any relationships; if adjacent in a book or herbarium, then it is relatively easy to relate their apomorphic characters, but if separated by hundreds of pages, or two floors in a large building, then it is less easy to get anything immediate from the sequence. (Since all orders on this site are preceded by the apomorphies of all the nodes immediately below them in the embryophyte phylogeny, and because of the linkages that have been built in, this particular problem does not arise; it is possible to read off all the apomormorphies within the embryophytes of all terminals.)
The best herbarium sequence based on particular phylogeny would maximize the number of taxa that are successive branches of the tree and are also consecutive in the sequence. Since specimens are generally filed under families, the outline of a new family sequence for arranging herbaria can now be suggested (Haston et al. 2007, 2009: note that numbers of species in a terminal ultimately governs the sequence; Waern et al. 2013). This will further help teaching and learning about plants. There are other cognitive and perceptual issues to think about when drawing trees (e.g. Novick & Catley 2007; Novick et al. 2012); thus wind-blown trees with all the lines oblique, as in the trees here, may unfortunately be less immediately comprehensible than erect trees with all lines either vertical or horizontal.
In cases where the Angiosperm Phylogeny Group (2003) suggested alternatives as to the limits of families, e.g. Papaveraceae in the broad sense or Papaveraceae, Pteridophyllaceae and Fumariaceae, Proteaceae in the broad sense or Proteaceae and Platanaceae, the choices made in A.P.G. III (2009) largely follow common usage, as in textbooks like Judd et al. (2007) and Simpson (2006), and particularly in the successive editions of Mabberley's The Plant Book, and Mabberley (2008) in particular was an attempt to reflect a consensus, the result of taking the opinions of botanists at several meetings. A.P.G. III (2009) thus dispensed with alternative classifications, and reasons are given for the choices that are made there. For many the existence of alternative classifications simply confused, so reaching an agreement over which names/groupings to use when alternatives are possible seemed desirable.
A.P.G. IV (2016) and this site continue to attempt to reflect consensus and stability, in particular, they are guided by the important principle, that if relationships do not change, then names should not. Major textbooks and other general treatments remain largely in line (e.g. Judd et al. 2015; Soltis et al. 2017). Some names or authors of names may be incorrect according to the latest findings of bibliographic sleuths, but I wait until activities in this largely inconsequential area stop before making changes - indeed, these are often negated by subsequent findings. For a very useful compilation of suprageneric names, see Reveal (2000 onwards), for a welcome proposal to stabilize familial and generic (and specific) names, see G. F. Smith et al. (2017), and for superordinal names, see Chase and Reveal (2009).
As mentioned above, higher-level relationships in general, and the composition of orders in particular, long presented something of a challenge to systematists (e.g. Davis & Heywood 1963), and the limits of orders have been far more labile than those of families. The composition of clades like Apiales, Crossosomatales and Pandanales is decidedly unexpected, however, these higher level clades are generally accepted even in works with different classificatory philosophies (a good example is Heywood et al. 2007). However, within clades like Malpighiales, Asparagales, etc., attempts to find distinctive characters have largely failed (but see Endress & Matthews 2006a). But as with families, groupings suggested by molecular studies may be supported by morphological and/or chemical characters. Thus Crossosomatales are characterized by the anatomy of their seed coats, while distinctive chemical similarities between Pittosporaceae and Apiaceae/Araliaceae have long been known (Hegnauer 1969b for references). As our knowledge of morphology and chemistry improves we can hope for gradual improvements in clade characterisations and the detection of apomorphies at all levels.
The discovery of the relationships of parasitic and aquatic groups in particular have presented a challenge to systematists. Morphologically, some of these plants are so highly modified that interpretation of the plant body in conventional terms is difficult or even impossible. Thus parasitic groups such as Rafflesiaceae are hard to place since both the vegetative body and the flowers are changed almost beyond recognition (see Naumann et al. 2013: comprehensive study of parasitic taxa and their evolution). The same is true, although to a lesser extent, in many plants pollinated by wind, etc.; a variety of characters may be affected, and wind-pollinated plants have independently acquired similar modifications. Morphological change is also extensive in mycoheterotrophic/sapromycotrophic (Leake 1994) and carnivorous groups. Again, in the latter there are limited number of morphologies (e.g. pitcher leaves, sticky glandular hairs) involved in the carnivorous habit, and there may even be convergence at the molecular level, as in the convergent evolution of digestive enzymes (Fukushima et al. 2017), while mycoheterotrophic monocots from three orders clustered together in an analysis by Lam et al. (2018) because of long branch attraction in the molecular data.
In molecular analyses of parasitic groups, plastid gene sequences may be difficult or impossible to obtain, the chloroplast DNA in particular experiencing extensive gene loss. Gene loss seems to have occured in parallel (independently in the different groups), yet a core of functional genes remains, probably because of functional constraints (Delannoy et al. 2011). In parasitic plants the rate of molecular change in general is often high (e.g. Duff & Nickrent 1997; Nickrent et al. 1998; Caddick et al. 2002a; G. Petersen et al. 2006b; Barkman et al. 2007; Bromham et al. 2103), and the inclusion of parasitic taxa in molecular analyses can cause conniptions (e.g. Nickrent et al. 2004; Davis et al. 2004; Chase et al. 2006; G. Petersen et al. 2006b), not least because there can be horizontal transmission of genes (e.g. Davis & Wurdack 2005: Vitaceae to Rafflesiaceae; Barkman et al. 2007: the mitochondrial atp1 gene commonly moves) including the the invasive cox1 intron (Barkman et al. 2007).
Progress is being made: placements for Rafflesiaceae, Mitrastemonaceae, Cytinaceae and Cynomoriaceae have been suggested (Barkman et al. 2004; Davis & Wurdack 2004; Nickrent et al. 2004; Davis et al. 2007; Bellot et al. 2016), and although it seems likely that Burmanniaceae s.l. are polyphyletic, the parts are all to be located in Dioscoreales (Merckx et al. 2006, 2009a, 2010a). Similarly, in the aquatic habitat neither vessels in particular nor much xylem in general is needed; leaves are highly modified; and water-mediated pollination, if adopted, may well be associated with major changes in floral morphology. Here, too, molecular studies suggest that aquatic groups with hitherto problematic relationships may find homes. Thus Podostemaceae are close to Calophyllaceae and Hypericaceae (Malpighiales: e.g. Kita & Kato 2001; Ruhfel et al. 2011, 2013), Hydatellaceae, which used to be in Poales, are part of Nymphaeales (Saarela et al. 2007), and Hydrostachyaceae have a similar relationship with Cornales (Xiang et al. 2002; Schäferhoff et al. 2010; Xiang et al. 2011), although they have been placed in Lamiales (Burleigh et al. 2009). In the first two cases in particular there are morphological and chemical features that support the movse. Very different placements of Ceratophyllaceae have beeen suggested over the years, unfortunately, its relationships still remain uncertain (e.g. Doyle et al. 2015; Kvacek et al. 2016 and references). If Ceratophyllaceae are sister to eudicots (e.g. Moore et al. 2007), they are so derived that there is no morphological evidence of which I am aware to support this relationship. Needless to say, as relationships for such wind-pollinated, parasitic, etc., taxa, are clarified, it changes one's understanding of evolution, both of the groups themselves (e.g. Hardy & Cook 2012)and the clades in which they find new homes.
In many cases the "new" family limits of the Angiosperm Phylogeny Group (see A.P.G. 1999, 2003, 2009, 2016) are not really controversial, although changes from the limits commonly accepted only a decade ago are sometimes quite dramatic (e.g. Wagenitz 1997). Thus the split of the old Saxifragaceae s.l. is necessitated by its extreme polyphyly, as also with Icacinaceae and Cornaceae. Although such groups had long been considered unsatisfactory, there had been no compelling evidence allowing the user to prefer one circumscription over another; now there is. It is generally accepted that the limits of Lamiaceae and Verbenaceae have to be redrawn, and the content of the two has changed considerably, and they are now easier to identify than before. Hence the decision to recognise the recircumscribed families is not difficult, and if they were not kept separate, much of Lamiales would collapse into a single family. The same is true for Salicaceae and Achariaceae (Malpighiales), two previously small families that have received the bulk of the old Flacourtiaceae. Clade and hence taxon limits do remain difficult in some Malpighiales, but even there clade support is strengthening (Xi et al. 2012b). Euphorbiaceae have presented problems - in the past, one could recognise Euphorbiaceae s.l. quite easily because of their explosively-dehiscent fruits with a distinctive persistent columella. Putranjivaceae, one quite unrelated segregate, are quite distinct, as is confirmed by chemical evidence (presence of glucosinolates/mustard oils), but there are four families that make up the bulk of the old Euphorbiaceae that remain in the same part of the Malpighiales tree (e.g. Wurdack et al. 2004; Davis et al. 2005). However, there is no molecular evidence warranting combining just these Euphorbiaceae segregates, and if they were to come together, the clades in this part of the tree suggest novel groupings not recognised before. Thus Rafflesiaceae appear to be embedded within the single-ovulate ex-Euphorbiaceae (Davis et al. 2007), and there is little enthusiasm for reducing the iconic Rafflesiaceae to synonymy. In the sister clade are biovulate ex-Euphorbiaceae - and the well-known Linaceae. Relationships in core Caryophyllales, especially around Phytolaccaceae (perhaps somewhat surprisingly) and Molluginaceae (less surprisingly) were incompletely understood and refashioning of taxon limits has continued as relationships become more apparent (see e.g. Nyffeler & Eggli 2010; Christin et al. 2011a; Brockington et al. 2011; Y. Yang et al. 2015; Christenhusz et al. 2014). The old Icacinaceae have been broken up (see in particular Stull et al. 2015), however, clade limits in the old Olacaceae, mostly hemiparasites, remain unclear (Nickrent et al. 2010 and references).
Many families in these pages are polythetic at the morphological level, that is, they lack unique features characterizing ("defining") all and only members of that family. They can be recognised phenetically only by the unique combinations of characters that they possess. This is the result of evolution; the synapomorphy characterizing a lineage may be lost or modified beyond easy recognition in some of its members, or the synapomorphy may appear to be identical to a feature that has evolved in parallel in quite unrelated plants. (Taxa in evolutionary classifications are also often polythetic.) Indeed, some families recognized here now include substantial variation as phenetically distinct derived groups are placed in their proper phylogenetic position - examples are the erstwhile Empetraceae, a wind-pollinated group, now included in Ericaceae, and the various derived, small-flowered aquatic and wind-pollinated groups that are included in the overwhelmingly large-flowered and animal-pollinated Plantaginaceae.
In the past, problematic taxa tended to be assigned to groups using a few or even only a single character weighted particularly strongly on a priori grounds. The result was the recognition of conglomerate taxa such as Amentiferae (wind-pollinated), Rafflesiales (parasites), Nepenthales (carnivores), etc. (Cronquist 1981), all of which have been found be highly polyphyletic, and/or the segregation of families like the wind-pollinated Plantaginaceae s. str. (now much expanded) and Leitneriaceae (now in Simaroubaceae). Given the ever-increasing amount and complexity of morphological knowledge and the difficulty of carrying out complex multivariate analyses in one's head, a solution in the days before computers was to emphasize just one or a few characters thought to be evolutionarily important and delimit groups accordingly. Thus the Parietales included a number of families all of which had parietal placentation - and these families are now widely dispersed on the tree.
Indeed, we used to assume that features like highly scalariform vessel perforation plates or complete absence of vessels, or a flower with an androecium that had many stamens, a superior ovary, or separate petals, were necessarily "primitive", and conversely vessels with simple perforation plates, an androecium with few stamens, inferior ovary, or petals that were connnate were almost necessarily apomorphic or "advanced" (but c. f. Stebbins 1951). Such assumptions are incorrect (e.g. Soltis et al. 2005b); for the evolution of vessels in asterids, highly parallel/with reversals, see Lens et al. (2016). Carpels may become secondarily free; carpels may fail to close, the seeds then developing outside the confines of the carpel, as in some Aspagaraceae-Nolinoideae, Violaceae, Berberidaceae, Malvaceae-Sterculioideae, etc.; in Peliosanthes teta, perhaps the only species in Peliosanthes (Asparagaceae-Nolinoideae) the ovary varies from superior to inferior (Jessop 1976: species limits here need investigation!), as in Saxifragaceae, Apiales (esp. Pittosporaceae), Asterales (see Menyanthaceae), Poales, etc.; many-seeded carpels can evolve from few-seeded carpels (Razafimandimbison et al. 2008, but c.f. in part Beaulieu & Donoghue 2013); monoecy may be derived from dioecy (Schaefer & Renner 2010 and references) - and on it goes. Classic studies such as those by Babcock (e.g. 1947) on Crepis that assumed that evolution - in this case of the karyotype in particular - was unidirectional have needed comprehensive re-evaluation (Enke & Gemeinholzer 2008). Most if not all characters have reversed and/or evolved in parallel, even at the level of amino acid substitution, as in the independent acquisition of the phosphoenolpyruvate carboxylase (pepC) gene in C4 photosynthesis in grasses (Christin et al. 2007b; see also Bläsing et al. 2000; Anke et al. 2004; Reimann et al. 2004; Maia et al. 2012; Langel et al. 2010; Irmer et al. 2015; Fukushima et al. 2017; Later et al. 2018; etc.).
Given the suggested relationships of some parasitic and aquatic groups, it can be very difficult to understand how they have evolved from their more morphologically conventional relatives, however, it may well be that our preconceptions as to likely or possible evolutionary change are at fault. With Podostemaceae sister to Hypericaceae, I look forward to seeing hypotheses to explain how the dramatic changes in the vegetative body that have made Podostemaceae so problematic for generations of systematists took place. That conventional wisdom has trouble in understanding or explaining how the morphologies of such groups can be related is largely a problem with conventional wisdom.
Similar problems affect generic circumscriptions. Thus emphasising the importance of floral characters that reflect pollination syndromes when drawing generic boundaries has all too often led to taxa that are highly unsatisfactory phylogenetically (see e.g. Acanthaceae, Bignoniaceae, Campanulaceae, Ericaceae, Melastomataceae, Orchidaceae, etc.) while over-reliance on characters of fruit and seed (see particularly Brassicaceae and Apiaceae) has also led to questionable generic limits. Again, the more general problem has been caused by the use of one or a very few characters that have been weighted a priori to structure classifications (see also García et al. 2009). Working out limits for genera is rightly becoming a major preoccupation as some stability is now evident in higher level phylogenies and taxa. Here the discussions about changes to generic limits in the German flora - which of course have implications far beyond Germany - that have been assembled in Kadereit et al. (2016) make fascinating reading. Again, there are tensions between those who are happy with narrowly-drawn genera, other prefer their genera to be more broadly drawn; some of the changes have already been made, some are definitely needed, but remain to be carried out, and there are yet others where the basic phylogenetic evidence is still incomplete. In some cases piecemeal changes are being proposed based on a local perspective, while in others waiting for the bigger picture to materialize seems the sensible thing to do.
In conclusion. Some families as delimited here may not be easy to recognize, but remember that detecting relationships - use whatever characters you can, even if they are not obvious - and naming a plant - focus on obvious characters that may not reflect relationships - are quite different problems. Taxa, although "natural", may not be readily recognizable, indeed, it was in exactly this context that Lamarck described the basic principles of writing dichotomous keys in 1778. Of course, Lamarck's idea of nature was very different from ours - there was some kind of continuum of form between living organisms, with no major gaps anywhere - and this meant that his genera (for example) might well not be sharply distinct from each other. Perhaps the best way of identifying plants at the family level is by well-made multiple access keys, as in Watson and Dallwitz (1992a onwards: family limits there may differ substantially from those adopted here). Such keys free users from the constraints of dichotomous keys in which particular characters are needed at each step of the identification process before there can be further progress. Instead, whatever characters are evident on a specimen can be used in whatever order is convenient; when linked to illustrations, glossaries, etc., their power is enormous (see Dallwitz et al. 2000  for the principles underlying their construction and use). Nevertheless, dichotomous keys such as those of Hutchinson (1973), Franz Thonner (Geesink et al. 1981) and Cullen (2006) have their uses, although even in the last-named, despite its date of publication, the family limits accepted do not reflect much in the way of recent findings. Furthermore, as Mary Barkworth (pers. comm.) pointed out, the structuring of information in keys may mean that their users can acquire general knowledge about plants - although of course taxa coming out adjacent in keys may well not be at all related, a fact that is all too frequently forgotten. Identification aids are proliferating (e.g. Utteridge & Bramley 2014), but it is easy to make mistakes if you try to go too fast!
(When identifying large numbers of plants, even more efficient than either style of identification, and certainly lots more fun, is sight identification. However, unless one has a photographic memory, one has to build up a good general knowledge of comparative plant morphology, and it is on this the ability to make accurate identifications depends. When faced with an unknown plant, I always look for leaf teeth and stipules and dots of various kinds in the lamina and check leaf insertion; smelling crushed leaves can also be helpful. The short paragraphs added after most families may help in confirming familial identifications. In this context, nodal anatomy can usually be checked using a razor and a hand lens.)
For the record, and for the little that it is worth, there are 4 orders and 13 families of gymnosperms characterised on these pages, and together they include some 82 genera and 947 species. For angiosperms, comparable figures are 57 orders, 446 families, 13,208 genera, and 261,750 species (of which monocots include 11 orders, 89 families, 2,759 genera and 52,760 species). Note, however, that higher mathematics was never my strong point, and anyway these are pretty meaningless figures; even for species, which some might concede smacked slightly more of reality than other taxa, estimates range as high as 422,000 (Govaerts 2001), and perhaps 10-20% more are currently undescribed (Joppa et al. 2010). Indeed, estimates of numbers of species in groups that one would have thought were well known vary widely, two examples being the daffodil genus, Narcissus (estimates in the last five years range from 15-81 species: Marques et al. 2017), and the bee orchid genus, Ophrys (16-252 species: e.g. Bateman et al. 2011a; Vereecken et al. 2011; Sedeek et al. 2014; Alibertis 2015: photographs!, see also below). Recent general estimates also highlight the issue: Christenhusz and Byng (2016) estimate that there are 295,383 species of flowering plants, while 369,434 species is the number in Nic Lughadha et al. (2016). Furthermore, Mallet (2013: p. 690) noted that "species counts over large areas of space and time [i.e., those in these pages!] represent only a sketchy measure of biodiversity, a measure that owes more to taxonomic and metphysical fashion than to science". Metaphysics aside, numbers of genera and species of course change daily. Nevertheless, as emphasized here, families are useful in teaching, we as a community can ensure that their limits remain largely stable, and by concentrating on relatively few of the larger families one can gain some familiarity with much of the world's flora. Generic names are even more used, and again, our goal should be to ensure stability in generic circumscriptions.
ON FORMING CLADE CHARACTERIZATIONS AND THINKING ABOUT APOMORPHIES
[Back to Top]
The organization of the information throughout is hierarchical, that is, character states constant or almost constant in higher groupings are not mentioned at lower levels (see also Stevens et al. 2004b; Kellogg 2015). This is in line with a phylogeny- or tree-based system, yet it has, perhaps ironically, long been seen as being an advantage of many so-called natural systems, even those that owe nothing to evolutionary ideas (e.g. Cesalpino 1583; Jussieu 1789). However, there is much to do to make this style of presentation fully effective. In particular, whether character states more or less constant in a group are synapomorphies for it often waits for further clarification of relationships both within the group and between that group and its immediate relatives. For example, most Annonaceae have stamens with distinctive prolongations of the connective, but Anaxagorea is sister to the rest of the family and such connectives may not be a synapomorphy of Annonaceae, nor may indehiscent fruits and the absence of staminodes (e.g. Scharaschkin & Doyle 2005, 2006). As more is found out about phylogenies, such examples multiply, so the dismemberment of the Icacinaceae s.l. (Stull et al. 2015) has meant that the characterisation of the large euasterid clade has been clarified, which has important implications when we think about euasterid diversification.
The character hierarchy was built up by first drawing up lengthy descriptions of families and then fitting the characters in the descriptions to molecular-based trees with rather conservative topologies. Many of the states of some characters at the base of the angiosperm tree are fairly obvious, hence the fairly lengthy characterisation (apomorphies and plesiomorphies definitely mixed!) for the angiosperms as a whole (but c.f. Sauquet et al. 2017). For some characters, I have worked up the tree, placing them as high as the evidence suggested. Otherwise, features in common to each clade, whether order, families within an order, or groups of orders, are those that are as far as is known common to all the family characterizations in that clade; they may also be synapomorphies (but see below), and are placed at the lowest level in the tree for which there is information available. For some features I have used both approaches, but confusion should be minimal. As relationships and our knowledge of the variation within characters improve, the top-down and bottom-up approaches will merge.
Trees based on morphology alone often have little support, and they may silently lead the user astray (e.g. Zhang et al. 1992). Nevertheless, the validity of an approach that fits morphological variation to a molecular tree may be questioned (but see e.g. Doyle & Endress 2011 and earlier papers). However, I think it rather unlikely that branches that are well supported in molecular analyses will be overturned by morphological data. Analysis of morphological data alone provides support for many of the clades evident in molecule-only analyses, and in conjunction with molecular data may lead to increased support for clades (e.g. Hufford 1992; Nandi et al. 1998: here adding morphological data reduces support for a number of critical clades, too, a not-uncommon phenomenon; Doyle & Endress 2000), however, in none of these papers is the use of morphology without ambiguity. Thus I have been wary of putting much weight on clades that have only morphological support, but molecule-based clades with low bootstrap or jacknife values support values (esp. below 70%) or posterior probabilies (below 0.95) are also treated sceptically - indeed, that we still rely so much on variation in the chloroplast genome alone to construct phylogenies is worrisome, although things are changing. Other reasons for prefering to fit morphological variation to a tree are mentioned below.
Morphological and molecular data are very rarely in irreconcilably strong conflict. However, examples that tend in that direction are the relative positions of the Monimiaceae and Hernandiaceae (Laurales), and there have also been arguments about the position of Hanguanaceae (Commelinales [see below; now very likely] or Zingiberales?), and of Triplostegia (is it in Caprifoliaceae-Dipsacacoideae or -Valerianoideae? - see Dipsacoideae). Although trees based on analyses of morphological data alone rarely command much support, Weins et al. (2010 and references) is an example of the integration of morphological and molecular data, and of fossils and extant organisms.
Indeed, the general congruence between morphological and molecular data is impressive and heartening, and many clades can be characterised morphologically. It seemed in 1998 that there were no unambiguous morphological synapomorphies for angiosperm orders (K. Bremer 2000), and this is still true if by "unambiguous" is meant "non-homoplasious". However, many orders can be characterised and have synapomorphies (see below), even if these are homoplasious and morphologically indistinguishable (at least at the current level of morphological and developmental observation) from synapomorphies elsewhere on the tree.
Identifying apomorphies is important because understanding the evolution of characters is one of our major goals. For this, several preconditions must be met. One needs to have an accurate, robust phylogeny, one has to have examined the right taxa both from the point of view of morphology and molecules, one has to have coded the characters correctly (i.e., delimited states appropriately), one has to have used the right model of evolution when fitting the variation to the tree, and finally, and little discussed here (but see e.g. the notes on the diversification and evolution of angiosperms), one has to establish the right temporal context and to factor in other relevant aspects of the environment (see e.g. Omland 1999; Stevens 2006b).
On each order page the features associated with each node leading to the order in question are summarised, and putative synapomorphies for the order, families, etc., will be found in the page itself (I began to indicate synapomorphies only in xii.2011), although it is clear that one has to be very cautious when talking about character evolution. If the synapomorphies here are compared with those in particular studies cited (e.g. Turgeon et al. 2001; Bremer et al. 2001; Endress 2001; Albach et al. 2001a; Judd et al. 2003; Judd & Olmstead 2004; D. Soltis et al. 2005b; Zhang et al. 2006), differences may be found. Although such studies have been integrated into the characterisations as far as possible, there are several reasons why there may be these differences.
- Firstly, I may not have found all the information about a particular character, there may be disagreement over its interpretation (e.g. Austrobaileyales), or I have added unpublished information (e.g. Primulaceae and their relatives, nodal anatomy, etc.; Diapensiaceae, leaf ptyxis; and Peridiscaceae, Centroplacaceae, etc.).
- Secondly, the sampling of nearly all molecular studies is incomplete, although it is fast improving. Unfortunately, the sampling of the morphological and chemical characters whose evolution we are interested in understanding is also often very poor, yet it is not improving fast. For many anatomical, chemical and embryological characters that are confidently said to characterise families and other groups, we all too often have no idea if those characters are applicable to the whole clade, or just to a subgroup within that clade. Thus Albach et al. (2001a, see also D. Soltis et al. 2005b) assign possession of iridoids to the base of the asterid I + II clades. However, this feature is placed higher up the tree here, partly because of topological uncertainties, but partly because in Lamiales (for example), the first four clades that are successively sister to the remaining Lamiales either lack iridoids or (most Oleaceae) have iridoids different from those found in the other members of the clade. Similar problems arise when thinking of the evolution of ellagic acid in Ericales (Stevens 2006b). (Much older literature, especially that from ca 1870-1920, remains invaluable, since it includes surveys, often never improved on since, of various aspects of plant anatomy and morphology for particular groups, and details there can be optimized on the tree.)
- Thirdly, I am fitting characters to conservative trees that may have several polytomies. This makes optimisation of characters, that is, the assigment of character state change to a particular place on the tree, difficult (e.g. Madison & Madison 2002). In nearly all studies of the evolution of characters (D. Soltis et al. 2005b is a good example), the distributions of those characters are optimised on more or less fully resolved trees, and the construction of supertrees may yield much more detailed hypotheses of relationships; for literature on supertrees, see e.g. Cotton and Wilkinson (2007, 2008), Buerki et al. (2010d) and literature, also see the supermatrix approach. Of course, some nodes even on fully resolved trees and/or supertrees may have little support, and character optimisations will then carry correspondingly little conviction.
And then there are the small matters of character state delimitation and ancestral state reconstruction/character optimisation...
Here there are some general issues to think about first. Sattler (e.g. 1992) has long insisted that our thoughts as it were get trapped inside the terms we use, while Rutishauser and Sattler (1985), Kirchoff et al. (2008), and others outline different ways of thinking about morphology. Here the approach advocated i.a. by Prusinkiewicz and Barbier de Reuille (2010) as they think about plant form - the result of the activity of self-organizing proceses of the plant rather than being under immediate genetic control - very helpful as an antidote; throw in processes like heterochrony and heterotopy (e.g. Baum & Donoghue 2002), and our appreciation of morphology can change dramatically. Mathews and Kramer (2012) also help us to think about evolution of form and how novelties might develop over the course of evolutionary history. The most important thing to remember when using botanical terms is that just because some aspect of an organism is dignified by a sesquipedalian word, this by no means signifies that the term refers to an interesting part of "reality" (see also Kirchoff et al. 2008 and references). As Hesse et al. (2009b: p. 27) noted when writing about pollen morphology "Nature itself neither needs categories nor has any knowledge of them" and "categories are artificial and always delimited by an individual or collective convention". Humans make and define botanical terms, and we use them to facilitate communication, although all too often we take them to be "real" - and then they can be as much an impediment to our understanding as anything else. The terms we use, whether in palynology to describe the shape or surface of pollen grains (e.g. Punt et al. 2006; Hesse et al. 2009b), or in general morphology to describe leaf shape (e.g. see Stearn 1992), may well function quite effectively in communication/description while at the same time being inappropriate for use in character optimization - indeed, the caveats in Hesse et al. (2009b), with sections headed "Inherence of Misrepresentation" and "Controversial or Fuzzy Terms", should make one pause for thought.
The choice of characters and delimititation of their states is a first step to take as we attempt to flesh out phylogenies. For a still useful introduction to the general issue of characters and what they might signify, see the essays in Wagner (2001), and for different ways of thinking about characters, see Kirchoff et al. (2008). Characters and their states ("anthers" and "introrse"/"extrorse"/"latrorse") as they are generally used come from a careful study of morphology, with similar structures on different organisms being compared. Here Remane's three main criteria of "homology"/similarity, special properties, position, and intermediates, are very useful (see Remane 1952; Kaplan 1997: 1 ch. 1; c.f. Endress 2011c, p. 122 - "too limited"), as also when aligning DNA sequences; for how to relate genotype and phenotype using the concept of persistent homology, a somewhat related issue, see e.g. M. Li et al. (2018 and references) and Larson (2018). However, Remane's criteria may not yield an unambiguous answer as to what a structure "is", even given a solid phylogeny to provide context for the subsequent interpretation of the structure in question. As Endress (2005c) observed, a number of features - position, function, development, shape, anatomy, histology, gene activity, and relationships to other taxa that clearly have petals - can be used to distinguish a petal (for example) from other floral structures. And if a petal lacks one of these points of similarity, is it thereby not a petal?, or if it has just one, is it a petal? So what about the outer members of the perianth of Nymphaea which may be part green and part coloured on the outside (Warner et al. 2009)?
At the genetic level the situation is equally complex. Genes do not necessarily do particular things, rather, expression may be a continuous variable. The expression of one gene can be affected by the expression of others, genes are parts of networks, and can be co-opted to act in a variety of different contexts. Thus Maturen et al. (2005) found that floral organ genes (B- and C-class) were expressed in the large, white inflorescence bracts of Cornus (see also Costa et al. 2005), while A-class genes can be expressed in the inflorescence, or even in leaves (Prenner et al. 2011: Euphorbia and Asteraceae). In the inverted repeat loss clade (Fabaceae-Faboideae) FLO/LFY genes, normally floral meristem identity genes, are also expressed in the leaves - which have subtly different morphologies than those of other Fabaceae (Champagne et al. 2007; Wang et al. 2008; Peng et al. 2011; Townsley & Sinha 2012). In Solanum there are similarities at the genetic level between shoot branching and leaf dissection, and development of the abscission zone of the fruit, made up of arrested meristematic cells, is also involved in this regulatory network (Busch et al. 2011; Périlleux et al. 2014).
I have thought a bit about the delimitation of characters and their states (Stevens 2000 for literature), but not nearly enough. One serious problem is that we commonly describe the variation we see, whether in plant architecture, leaf venation, pollen, ovule, nucellus, endosperm, embryo and many other organ systems, as "types" - the solanad type of embryogenesis, Allium type of embryo sac, etc.. Such terms often represent the categorization of simultaneous variation of many features, but these features often vary more or less continuously, and/or are complex, and/or the various elements that make them up may vary independently; all this variation effectively gets "lost" when we use these terms. But worse, we tend to think that evolution is a matter of one "type" of endosperm development (for example) evolving into another. Even when looking at just a single character, we find that many character states refer to a part of a continuum of variation of a particular feature or character such as [character] ovule curvature, [states] ovule anatropous vs campylotropous vs ... (and anther opening mentioned above). Many of these states have been used in the botanical literature for well over a century, but all too often they are the result of the more or less arbitrary division of a continuum of variation (see Gift & Stevens 1997 for how this works in practice), as with ovule curvature itself. Although such character states may seem to communicate "information", whether they are necessarily suitable for either phylogenetic analysis or understanding evolution (see Linder & Bouchenak-Khelladi 2017) is decidedly questionable. Thus when interpreting the literature (e.g. Stevens 2000, 2006b) - and this site, too - one should bear such problems in mind. Of course, as character states change, so may our ideas of evolution (e.g. Lamb Frye & Kron 2003; Hibbett 2004).
Furthermore, understanding even current literature can be difficult because the terms that we use often have no fixed definitions - the term "panicle" can be understood only if you know how the author of the publication in which you find the term was using it (see also Endress 2010b). As Rickett (1954: 2, emphasis in original) noted, "To be uncertain whether "glabrous" means "free from hairs and roughness" or only "free from hairs" is as bad as if π [pi] should stand sometimes for the ratio of a circle to its diameter and sometimes for something else; or as if Cu meant sometimes "copper" and sometimes "brass". Yet this is the state of affairs in botany today". Well over a half century later, this is still the state of affairs. In this context, an ontology that is generally accepted by botanists is essential (see also Vogt et al. 2010); the Plant Ontology Consortium is a resource here, but see Kirchoff et al. (2008) for caveats.
The delimitation of states and characters does not necessarily become easier with increasing knowledge. Thus Buzgo et al. (2004) and Matthews and Endress (2005) show how hard it can be to distinguish between e.g. prophylls and other floral structures, while Penet et al. (2005) find that not all monosulcate pollen in monocots has the same developmental pathway, and suggest that therefore such pollen may not have the same ancestral state. There are all intermediates between vessels and tracheids (Schneider & Carlquist 2009; Carlquist & Schneider 2010; Carlquist 2012a), between fully superior and fully inferior ovaries (Soltis & Hufford 2002), floral phyllotaxis may be very labile, especially in magnoliids and members of the ANITA grade (Staedler & Endress 2009), leaves are not always easily distinguishable from other structures (Rutishauser 1999) - and on it goes. But this is in part a level or universe issue. As a comparison increases in extent, what one thought were differences between structures may disappear - and whether this is important, interesting or simply irrelevant will depend on the problem at hand.
For these and other reasons, character states, the 0's and 1's of a morphological data matrix, are all too often some degrees removed from the actual structures observed on individual specimens, furthermore, all too often there is no clear link between these states and specimens/species on which the structures were observed. Specimens are prepared, observations/measurements made, and these somehow become converted to statements about species and then of more inclusive groups (Stevens 1996). Even for this site, it would be have been ideal if observations and literature could have been directly linked at the level of individual species and specimens in a database - this was obvious even as the possibility of APweb was being discussed in the late 1990s. This would allow problems like wrongly assigned taxa to be dealt with easily - remove a taxon from a family, and all information linked to that taxon would be removed at the same time, perfect when dealing with developing phylogenies. Fortunately, this is an area where are major changes underway, changes spurred by the advent of molecular data and what very soon became a mandatory requirement for DNA sequences used in publications to be deposited in the publicly accessible databases like GenBank (founded in 1982) - publication depends on sequences being made available. Resources like the Dryad Digital Repository (it started in 2008) make the various kinds of data that underpin systematic/evolutionary publications accessible. However, as Drew et al. (2013) noted, alignments and trees were very often not deposited, despite encouragements to do so, and this has made earlier studies of less value than they might have been. There has been nothing that has been functioning as a Genbank for morphological data, and it is often impossible to evaluate whether states in a particular study have been delimited satisfactorily; as in much of taxonomy as a whole there are few data in the sense of observations that can be linked to specimens and that can be reanalyzed when necessary. Data matrices can be deposited in places like MorphoBank (O'Leary & Kaufman 2011), and, more generally, in Dryad, and so you can, for example, access the matrix used by Sauquet et al. (2017) in their work on the ancestral angiosperm flower. More extensive data matrices are being developed (e.g. Sauquet 2016: PROTEUS, not yet open access); Sauquet and Magallón (2018) rightly emphasize the importance of a standardized, fully documented open-access database for plant morphology. Ultimately, fitting basic variation such as measurements, not character states, to a tree allows considerable flexibility in understanding morphology in the context of local phylogenies (see also Stevens 2000; Endress 2005c, but c.f. Weins et al. 2010 for problems here), and this approach has also been used at higher levels, as in a study looking at the evolution of pollen size and shape in Myrtales (Kriebel et al. 2017; see also Parins-Fukuchi 2017).
A good example of the kinds of problems faced are in the description of endosperm morphology and an understanding of its evolution, particularly in monocots. Tobe and Kadokawa (2010) discuss endosperm morphology in terms of cellular and helobial types (Stenar 1925 for early literature; Swamy & Parameswaran 1962). However, this distinction, commonly employed in the literature, does not capture the subtleties of endosperm development (see especially Floyd et al. 1999; Floyd & Friedman 2000, 2001 for alternatives; Rudall et al. 2009b). The initial division of the endosperm is highly asymmetric in Araceae, with subsequent cell divisions initially occurring only in the micropylar chamber; the family is considered to have cellular endosperm (Tobe & Kadokawa 2010; see also Maheshwari & Khanna 1957). The chalazal cells here are sometimes massive (e.g. Paremeswaran 1959). Initial asymmetry in endosperm development characterizes helobial endosperm development, so Araceae could also be characterized as having a form of helobial endosperm; Acoraceae also show similar asymmetry (Buell 1938). There is also variation in whether the micropylar (e.g. Nymphaeales) or both micropylar and chalazal (Amborella: most from the latter) domains give rise to the bulk of the endosperm. The evolutionary history of nuclear endosperm development is also unclear (Olsen 2004; see e.g. Zou et al. 2001 for subtleties in its development), and there is variation in e.g. nuclear endosperm development, as within Poaceae (Leroux et al. 2014). For further discussion about helobial endosperm, see e.g. Kaplan (1981).
The beginning of a solution to such problems is to decompose these "types" into their underlying independent variables. Thus Blackmore et al. (2009) were able to decompose the pollen "types" used in Asteraceae into no fewer than 52 separate characters, while Dunning et al. (2017) found that an example of a reversal from C4 to C3 photosynthesis did not hold up when the independant variables that make up photosynthetic "types" were analysed (see also Floyd et al. 1999; Floyd & Friedman 2000; Herendeen & Miller 2000; Acosta et al. 2009; Y. Yang et al. 2014; Washburn et al. 2015; et al. 2015, etc.). Character states should be delimited only after analysis of the variation of that character at the level of the study being carried out. Thus delimiting states for a local study in the context of the variation shown by a character across all angiosperms may well be inappropriate since gaps in the variation shown by a particular feature in one clade may be obscured by variation in a completely unrelated clade - the universe problem.
After we have decided on characters and states, there is then the question of where character state changes should be placed on the tree, of ancestral state reconstruction/character optimisation. As already mentioned, topological uncertainties are important here: If Tofieldiaceae are sister to the rest of Alismatales (e.g. Ross et al. 2015), then where the change from "cellular" to "helobial" endosperm should be placed on the tree is unclear. Either one gain (apomorphy for order) and one loss (Araceae), or two gains (Tofieldiaceae, and above Araceae). But the problem is broader, since helobial-type development also occurs in the ANITA grade (e.g. Rudall et al. 2009b). Similarly, is triploid endosperm a synapomorphy for all angiosperms or only for the [[magnoliid + Chloranthaceae] [other angiosperms]] clade (Friedman 2001a, b, 2006; Baroux et al. 2002)? Friedman et al. (2003a, esp. b) and Friedman and Williams (2003, 2004) incline towards the latter hypothesis - see especially Williams and Friedman (2004) and Friedman and Ryerson (2009). If Nymphaeales are sister to Amborellales (but see above), then diploid endosperm may be the ancestral condition, with triploid endosperm evolving in parallel in Amborella and above the ANITA grade - and a supporting argument here might be that Amborella has an embryo sac unique in the angiosperms. Simple parsimony, whether ACCTRAN or DELTRAN, helps little here since there are no outgroups (e.g. Friedman & Floyd 2001; Ronse De Craene et al. 2003).
Indeed, using either Bayesian or maximum likelihood analyses, making apparently reasonable suggestions about weighting gains over losses (or vice versa), or more complex assumptions along similar lines, or just using the rather simple models of evolution explicit in ACCTRAN or DELTRAN to place the character on the tree (e.g. Donoghue & Ackerley 1996; Cunningham et al. 1998; Omland 1997, 1999; Ree & Donoghue 1999; Polly 2001; Webster & Purvis 2001; Ronquist 2004; Crisp & Cook 2005; Remizowa et al. 2010b; Sokoloff et al. 2013d; O'Meara 2012; Gascuel & Steel 2014; Wortley et al. 2015; Sauquet et al. 2017; Parkins-Fukuchi 2017) may greatly affect the position of synapomorphies on trees, and hence our ideas of evolution. Sannier et al. (2007) give a good example concerning where on a tree one might peg changes in microsporogenesis in palms (see also Sannier et al. 2009), while Syme and Oakley (2012) note that some methods allow reversals much more easily than others. Pedersen et al. (2007) discuss the sometimes very substantial effect of node support on the posterior probabilities of ancestral character states. But, as Wortley et al. (2015: p. 195) noted rather sadly about optimization, "In general, characters that were unambiguously optimized in all contingencies were either those for which little data were available... or those known to be variable only within a single group, ...". If not reporting on other studies, here I think in terms of parsimony optimization, not always as explicit as it might be, but I have tried to indicate uncertainties as to where particularly important characters change on the tree.
Thus simple parsimony is just one of the ways to optimise the positions of character state changes on a tree, and it may not be the best. Different analyses of the same data may result in state changes ending up in different places on the tree, and/or there may be very different numbers of state changes, and/or changes are interpreted as probabilities given what what we know about the evolution of the character, branch lengths of the tree, topological uncertainties of that tree, and the like. Recent examples are studies of pollen morphology (Wright & Hillis 2014; Wortley et al. 2015; Lu et al. 2015) and of the construction of the ancestral angiosperm flower (Sauquet et al. 2017), where parsimony, maximum likelihood and Bayesian analyses are all compared. Sauquet et al. (2017: Supplementary Methods) capture the issues well, e.g. "MP... has the attractive property that no assumption is made on the relationship between change and branch length; ... Conversely, ML and rjMCMC results are conditional on the assumption that rates of morphological change are constant through times and across lineages." Despite this apparent attractive property of MP, the behaviour of ML and rjMCMC methods was seen as an "advantage" (ibid.). However, molecular clocks are somewhat out of favour these days, and the reasons for invoking a morphological clock, as in ML and rjMCMC methods, are unclear. The bottom line here is that every method has its problems, and extreme care is needed when using them.
The take-home message?: We make the botanical terms we use, not "nature". For me, seeing a plant is forgetting the names its parts are called (with apologies to Weschler 2009); different botanical terms used for what seems to be the same structure are flags that there may be variation out there, but necessarily little more. As Hesse et al. (2009b: p. 27) observed, "Nature itself neither needs categories nor has any knowledge of categories". More particularly, as Olsen (2004: p. S215) noted, "In spite of recent progress in understanding angiosperm phylogeny, all of the main questions regarding the evolutionary history of the nuclear endosperm remain unresolved." - and this is still the case.
There are six final points to remember about the characterisations below, and descriptions in general:
1. I have been much more generally comparative in the ultimate characterisations than is perhaps strictly necessary. That is, the user will be able to find information on the variation of most characters mentioned in the character list as it occurs in all families. An example is leaf ptyxis, which seems to characterise only a few or the terminal groups recognised here, although I have tried to include information about it more generally.
2. More conventional descriptions of a family, for example, as in Cronquist (1981), include a mixture of plesiomorphies and apomorphies at various levels, including apomorphies of groups of genera, or even individual genera, within the family. Although it is difficult to disentangle the importance of the features listed (but typographic conventions, as in Judd et al. 2002, 2015, help), they better convey a description of the organism as a whole. Nevertheless, by summarizing the relevant part of the character state hierarchy before each ordinal characterization, I perhaps give something of the same effect.
3. Some of the features in the characterisations simply describe the extent of variation within the clade, e.g., "stomata anomocytic or paracytic". This is particularly common when relationships within a family, or between a family and its immediate relatives, are unclear. Features so described will not often turn out to be synapomorphies, although elements of the variation they encompass may.
4. There may seem to be little "biology" in the lists of characters at the beginning of each page here. In this context, I am reminded of Stebbins's review of Cronquist (1981) which read in its entirety "The only material of even peripheral interest to the general evolutionist consists of short commentaries on family relationships placed at the end of the description [sic] of many of the families" (Stebbins 1982: p. 628). However, there may be more of interest if you go to the appropriate place in the hierarchy, even if we know little about the functional or adaptive significance of many synapomorphies. Apart from some wind-pollinated, aquatic and parasitic taxa, it is usually difficult to characterise larger groups ecologically, although clades like Ericaceae are exceptions. Indeed, much of the "biology" in more conventional descriptions comes from mention of the pollination biology, for example, of particular genera and other small groups within a family, and users can add this emphasis here as they focus on the taxa that grow locally. However, information on more general aspects of ecophysiology, divergence and/or diversification, pollination and fruit dispersal, and associations with particular groups of herbivores, bacteria and fungi, etc., is being added as it becomes available.
5. As we find out more about variation we find fewer and fewer features that are constant throughout a group. Most unqualified statements of presence and absence should properly be qualified as "usually present" or "usually absent" if one is thinking of the characterisations as encompassing the total variation within a clade. Thus Pistia, alone among monocots as so far known, has sieve tube plastids with starch grains, not protein crystals, although the latter is indeed an apomorphy for monocots. Most, if not all, characters that are high-level apomorphies reverse and/or occur in parallel several times on the tree.
6. Fossils may have character combinations unknown in extant taxa, as may be seen in the discussions of Fagaceae, Platanaceae, Iteaceae and Calycanthaceae, for example. How this will affect characters at interior nodes remains to be seen. Moreover, where exactly on the tree a particular fossil is to be placed can be difficult to work out - see Nymphaeaceae, Calycanthaceae, Paracryphiaceae, etc., fossils such as Archaefructus and Silvianthemum posing particular problems.
The bottom line is that our morphological, anatomical and chemical knowledge of many critical taxa is woefully incomplete, and much basic - and unfortunately perhaps unfashionable - work must be carried to clarify the distribution of morphological, anatomical and chemical characters. For excellent examples of what can to be done, see the work of P. K. Endress, W. E. Friedman, P. Leins and C. Erbar, L.-P. Ronse de Craene and H. Tobe and all their collaborators (various aspects of floral morphology and development) and S. R. Jensen and collaborators (iridoids). Really remarkable discoveries continue to be made, for instance, the realization that the "typical" 8-ncleate embryo sac is not found in basal angiosperms (e.g. Friedman 2001a), and the description of the embryology of Cardiopteris, related to holly, that shows it to be without parallel in any other flowering plant (Tobe 2016). Indeed, filling in the gaps in our knowledge of more mundane things like nodal anatomy, cell and tissue distributions, stomatal morphology, seed coat anatomy, and the like can be quite easy (see also below) and will clarify our understanding of evolution. Much effort must continue to be spent in summarizing characters of well-established clades at all levels, providing features by which they may be recognized, and signaling synapomorphies. However, given our current understanding of both phylogenies and morphology, how to relate the two is not that simple, and different assumptions about character evolution may greatly affect the position of synapomorphies on trees. Nevertheless, it is a relatively easy matter to update notes such as these and to incorporate data on characters that have never before been considered in the context of a tree.
SUMMARY OF THE SYSTEM
[Back to Top]
Below is a summary of the relationships within orders of all the families of seed plants; there are direct links to the pages where the families are discussed. Families, etc., accepted in A.P.G. IV (2016) and the most recent edition of The Plant Book (Mabberley 2017) are very largely the same as those below, and any variation in the circumscription of families recognized by the botanical community will - I hope - eventually disappear.
Square brackets - [...] - enclose clades; the plus sign - + - designates terminal sister taxa; a comma - , - denotes part of a polytomy.
[Cycadaceae + Zamiaceae]Ginkgoales
[[Araucariaceae + Podocarpaceae] [Sciadopityaceae [Taxaceae + Cupressaceae]]]Pinales
[Ephedraceae [Gnetaceae + Welwitschiaceae]]ANGIOSPERMS/FLOWERING PLANTS
Evolution and diversification of the angiospermsAmborellales
[Hydatellaceae [Cabombaceae + Nymphaeaceae]]Austrobaileyales
[Austrobaileyaceae [Trimeniaceae + Schisandraceae]]Mesangiosperms
[Myristicaceae [Magnoliaceae [[Himantandraceae + Degeneriaceae] [Eupomatiaceae + Annonaceae]]]]Laurales
[Calycanthaceae [[Siparunaceae [Gomortegaceae + Atherospermataceae]] [Lauraceae [Monimiaceae + Hernandiaceae]]]]Canellales
[Canellaceae + Winteraceae]Piperales
[Aristolochiaceae [Piperaceae + Saururaceae]]
[Araceae [Tofieldiaceae [[Alismataceae [Hydrocharitaceae + Butomaceae]] [Scheuchzeriaceae [Aponogetonaceae [Juncaginaceae [Maundiaceae [[Posidoniaceae [Ruppiaceae + Cymodoceaceae]] [Zosteraceae + Potamogetonaceae]]]]]]]]]Petrosaviales
Petrosaviaceae [?Here. ?Asparagales]Dioscoreales
[Nartheciaceae [[Taccaceae + Thismiaceae (limits?)] [Burmanniaceae + Dioscoreaceae]]]Pandanales
[Velloziaceae [Stemonaceae [Triuridaceae [Pandanaceae + Cyclanthaceae]]]]Liliales
[[Corsiaceae + Campynemataceae] [[Petermanniaceae [Colchicaceae + Alstroemeriaceae]] [Melanthiaceae [[Liliaceae + Smilacaceae] [Philesiaceae + Rhipogonaceae]]]]]Asparagales
[Orchidaceae [[Boryaceae [Blandfordiaceae [Asteliaceae [Lanariaceae + Hypoxidaceae]]]] [[Ixioliriaceae + Tecophilaeaceae] [Doryanthaceae [Iridaceae [Xeronemataceae [Asphodelaceae [Amaryllidaceae + Asparagaceae]]]]]]]]COMMELINIDS
[Dasypogonaceae, Arecaceae]Poales [to be reorganized]
[[Typhaceae + Bromeliaceae] [Rapateaceae[Mayacaceae [Eriocaulaceae + Xyridaceae] [Thurniaceae [Juncaceae + Cyperaceae]] [Restionaceae [Flagellariaceae [Joinvilleaceae [Ecdeiocoleaceae + Poaceae]]]]]]]]Commelinales
[[Philydraceae [Commelinaceae + Hanguanaceae]] [Haemodoraceae + Pontederiaceae]]Zingiberales
[[Musaceae [Heliconiaceae [Strelitziaceae + Lowiaceae]]] [[Cannaceae + Marantaceae] [Costaceae +Zingiberaceae]]]
[Eupteleaceae [Papaveraceae [[Lardizabalaceae + Circaeasteraceae] [Menispermaceae [Berberidaceae + Ranunculaceae]]]]]Proteales
[Sabiaceae [Nelumbonaceae [Platanaceae + Proteaceae]]]Trochodendrales
[Gunneraceae + Myrothamnaceae]PENTAPETALAE
[Peridiscaceae [[Paeoniaceae [Altingiaceae [Hamamelidaceae [Cercidiphyllaceae + Daphniphyllaceae]]]] [[Crassulaceae [Aphanopetalaceae [Tetracarpaeaceae [Penthoraceae + Haloragaceae]]]] [Iteaceae [Grossulariaceae + Saxifragaceae]]]]], Cynomoriaceae unplacedROSIDS
[Krameriaceae + Zygophyllaceae]Oxalidales
[Huaceae [[Connaraceae + Oxalidaceae] [Cunoniaceae [Elaeocarpaceae [Brunelliaceae + Cephalotaceae]]]]]Celastrales
[Lepidobotryaceae + Celastraceae]Malpighiales
[[Ctenolophonaceae [Erythroxylaceae + Rhizophoraceae]], Irvingiaceae, Pandaceae, [Ochnaceae [[Bonnetiaceae + Clusiaceae] [Calophyllaceae [Hypericaceae + Podostemaceae]]]]] [[[Lophopyxidaceae, Putranjivaceae], Caryocaraceae, Centroplacaceae, [Elatinaceae + Malpighiaceae], [Balanopaceae [[Trigoniaceae + Dichapetalaceae] [Chrysobalanaceae + Euphroniaceae]]]] [[Humiriaceae [[Achariaceae [Goupiaceae + Violaceae] [Passifloraceae [Lacistemataceae + Salicaceae]]]]] [[Peraceae [Rafflesiaceae + Euphorbiaceae]] [[Phyllanthaceae + Picrodendraceae] [Linaceae + Ixonanthaceae]]]]]N-FIXING CLADE
[Quillajaceae [Fabaceae [Polygalaceae + Surianaceae]]]Rosales
[Rosaceae [[Rhamnaceae [Elaeagnaceae [Barbeyaceae + Dirachmaceae]]] [Ulmaceae [Cannabaceae [Moraceae + Urticaceae]]]]]Cucurbitales
[Anisophylleaceae [[Corynocarpaceae + Coriariaceae] [Cucurbitaceae [Tetramelaceae [Datiscaceae + Begoniaceae]]]], Apodanthaceae unplacedFagales
[Nothofagaceae [Fagaceae [[Myricaceae + Juglandaceae] [Casuarinaceae [Ticodendraceae + Betulaceae]]]]]MALVIDS/ROSID II
[Geraniaceae + Francoaceae]Myrtales
[[Combretaceae [Onagraceae + Lythraceae]] [[Vochysiaceae + Myrtaceae] [Melastomataceae [Crypteroniaceae [Alzateaceae + Penaeaceae]]]]]Crossosomatales
[[Staphyleaceae [Guamatelaceae [Crossosomataceae + Stachyuraceae]]] [Aphloiaceae [Geissolomataceae + Strasburgeriaceae]]]Picramniales
[Biebersteiniaceae, Nitrariaceae, [[Kirkiaceae [Anacardiaceae + Burseraceae]] [Sapindaceae [Meliaceae [Simaroubaceae + Rutaceae]]]]]Huerteales
[[Gerrardinaceae + Petenaeaceae] [Tapisciaceae + Dipentodontaceae]]Malvales
[Neuradaceae [Thymelaeaceae [Sphaerosepalaceae, Bixaceae, [Cistaceae [Sarcolaenaceae + Dipterocarpaceae]], [Cytinaceae + Muntingiaceae], Malvaceae]]]Brassicales
[[Akaniaceae + Tropaeolaceae] [[Moringaceae + Caricaceae] Setchellanthaceae[ [Limnanthaceae [[Koeberliniaceae [Bataceae + Salvadoraceae]] [Emblingiaceae [[Pentadiplandraceae [Gyrostemonaceae + Resedaceae]], Tovariaceae, [Capparaceae [Cleomaceae + Brassicaceae]]]]]]]]]]
[Erythropalaceae [Strombosiaceae [Coulaceae [Ximeniaceae, Aptandraceae, Olacaceae [Octoknemaceae [[Loranthaceae [Misodendraceae + Schoepfiaceae]] [Opiliaceae + Santalaceae]]]]]]], Balanophoraceae unplacedBerberidopsidales
[Aextoxicaceae + Berberidopsidaceae]Caryophyllales
[[[Droseraceae + Nepenthaceae] [Drosophyllaceae [Ancistrocladaceae + Dioncophyllaceae]]] [[Frankeniaceae + Tamaricaceae] [Polygonaceae + Plumbaginaceae]]] [Rhabdodendraceae [Simmondsiaceae [[Asteropeiaceae + Physenaceae] [Macarthuriaceae [Microteaceae [[Caryophyllaceae [Achatocarpaceae + Amaranthaceae]] [Stegnospermataceae [Limeaceae [[Lophiocarpaceae [Kewaceae [Barbeuiaceae [Aizoaceae [Gisekiaceae [[Sarcobataceae + Phytolaccaceae] [Petiveriaceae + Nyctaginaceae]]]]]]] [Molluginaceae [Montiaceae [[Halophytaceae [Didiereaceae + Basellaceae]] [Talinaceae [Portulacaceae [Anacampserotaceae + Cactaceae]]]]]]]]]]]]]]]]ASTERIDS
[[Cornaceae [Grubbiaceae + Curtisiaceae]] [Nyssaceae [Hydrostachyaceae [Hydrangeaceae + Loasaceae]]]]Ericales
[[Balsaminaceae [Marcgraviaceae + Tetrameristaceae]] [[Polemoniaceae + Fouquieriaceae], Lecythidaceae, [[Sladeniaceae + Pentaphylacaceae], [Sapotaceae [Ebenaceae + Primulaceae]], [Mitrastemonaceae, Theaceae, [Symplocaceae [Styracaceae + Diapensiaceae]], [[Sarraceniaceae [Roridulaceae + Actinidiaceae]] [Clethraceae [Cyrillaceae + Ericaceae]]]]]]]EUASTERIDS
[Oncothecaceae + Icacinaceae]Metteniusales
[Garryaceae + Eucommiaceae]Gentianales
[Rubiaceae [[Loganiaceae + Gelsemiaceae] [Gentianaceae + Apocynaceae]]]Vahliales
[[Montiniaceae [Sphenocleaceae + Hydroleaceae]] [Convolvulaceae + Solanaceae]]Boraginales
[[Codonaceae [Wellstediaceae + Boraginaceae]] [Hydrophyllaceae, Nama, etc. [Heliotropiaceae [Cordiaceae + Ehretiaceae]]]Lamiales
[Plocospermataceae [[Carlemanniaceae + Oleaceae] [Tetrachondraceae [[Peltanthera [Calceolariaceae + Gesneriaceae]] [Plantaginaceae [Scrophulariaceae [Stilbaceae [[Byblidaceae + Linderniaceae] [[Thomandersiaceae + Verbenaceae], Pedaliaceae, Schlegeliaceae, Martyniaceae, Bignoniaceae, Acanthaceae, Lentibulariaceae] [Lamiaceae [Mazaceae [Phrymaceae [Paulowniaceae + Orobanchaceae]]]]]]]]]]]]]CAMPANULID/ASTERID II
[[Cardiopteridaceae + Stemonuraceae] [Aquifoliaceae [Helwingiaceae + Phyllonomaceae]]]Escalloniales
[[Rousseaceae + Campanulaceae] [Pentaphragmataceae [Stylidiaceae [[Alseuosmiaceae [Phellinaceae + Argophyllaceae]] [Menyanthaceae [Goodeniaceae [Calyceraceae + Asteraceae]]]]]]]Bruniales
[Bruniaceae + Columelliaceae]Apiales
[Pennantiaceae [Torricelliaceae [Griseliniaceae [Pittosporaceae [Araliaceae [Myodocarpaceae + Apiaceae]]]]]]Paracryphiales
[Viburnaceae + Caprifoliaceae]
ON SOME POORLY-KNOWN TAXA THAT ARE IN URGENT NEED OF STUDY
[Back to Top]
Although a vast amount of morphological s.l. information is being developed from question-driven research on model organisms like Zea, Arabidopsis and Antirrhinum, it is often difficult to know how far the results can be extrapolated. Indeed, I have long been impressed (or depressed) about several aspects about the general knowledge of morphology.
— How little we really know when we discuss evolution - for most features, we extrapolate from a quite small data base.
— How little morphological s.l. knowledge is added in most phylogenetic studies.
— How much we are dependent on general surveys from the 1880s to the 1930s in particular (for cytology and plant chemistry the surveys are later, but even here our knowledge base is increasing very slowly), although for aspects of floral morphology and development, for example, a few authors have added substantial amounts of data over the last thirty years. Fortunately, much of this older literature is now on line, for example, see the Botanicus Digital Library, with approaching 2,500,000 pages scanned, while in the Biodiversity Heritage Library there are about 46,000,000 pages [as of 13.iv.2015].
— How easy it is to add new information about many organ systems.
A more comprehensive understanding of plant morphology is needed to enable broad evolutionary questions to be answered with a greater degree of confidence. There is a particular need for targeted surveys of the morphological variation of particular groups. Given limited time and resources, one can also profitably focus on small clades an understanding of the morphology of which will have the greatest effect on our understanding of character evolution. These include taxa along the spines of branches, a small clade sister to the rest of a very speciose clade, etc.. Add to that a judicious sampling of more speciose clades (see also Endress 2011) carefully targeted in the context of the phylogeny of the group of interest; untargeted surveys, or surveys of groups whose phylogeny is unclear, may yield less of interest than one might hope (c. f. Schönenberger & von Balthazar 2006). Neither phylogeny without good comparative morphological knowledge nor comparative morphology without a good phylogeny is of much use.
To help in building up a more extensive information base that will facilitate our thinking about the larger patterns of flowering plant evolution, the list below includes a number of quite small taxa whose phylogenetic position is such that information on embryo sac development, chemistry, seed anatomy, etc., is particularly important for our understanding of the evolution of those features. The taxa listed are reasonable targets for comprehensive studies and are eminently suitable for Masters theses. Of course, given the current academic climate, I cannot say that such studies will be highly regarded by tenure and promotion committees (bless their hearts, wherever they might be), but they should nevertheless be integral to all phylogenetic studies that are concerned with character evolution and integral parts of grant proposals.
The taxa listed below come from all over the world except the northern Temperate and Arctic zones. Tropical Africa (even excluding the Cape region and Madagascar), Central and tropical South America, and Australia-New Zealand are particularly well represented, East Asia and Indo-Malesia somewhat less so, then the Cape region, Madagascar, and the southern U.S.A./northern Mexico, and finally New Caledonia, temperate South America, and Europe. Although some of these genera like Triceratella (Commelinaceae) are very rare, others are quite common, or are even in cultivation (Rehmannia, Aextoxicon, Portulaca, etc.). Such a list does not pretend to be exhaustive, indeed, there are some genera that are quite well known, but for which important information is lacking - thus I know of no reports of fertilization from Nothofagus (Nothofagaceae, sister to rest of Fagales). Given our general level of knowledge, there is no shortage of plants to look at!
In addition to the taxa listed, many others, especially in Geraniales, practically the whole of Phytolaccaceae and Molluginaceae, Irvingiaceae, Kirkiaceae, Acanthaceae-Nelsonioideae, Escalloniaceae, Bruniaceae, etc., could well have been included, and as phylogenies in Lamiales, etc., become better resolved, other clades in urgent need of study will become apparent. Many holoparasites need developmental and embryological studies.
Mention of “basal” in the comments below is relative; it means basal in a tree relative to other more speciose clades, or a small clade sister to a very speciose clade; there are no implications that members of such clades are “primitive” or plesiomorphic.
Achatocarpaceae: Caryophyllales. Achatocarpus (Mexico to Argentina) Phaulothamnus (S.W. U.S.A., N. Mexico): 10 and 1 spp. respectively. Sister to Amaranthaceae s.l.
Aextoxicaceae: Berberidopsidales. Aextoxicon (Chile): 1 sp. One of two families in Berberidopsidales, along spine below the asterids, see also Berberidopsidaceae.
Aizoaceae: Caryophyllales. Acrosanthes (Western Cape, South Africa): 6 spp. The only genus in its subfamily.
Akaniaceae: Brassicales. Akania (E. Australia), Bretschneidera (S.E. Asia): 1 and 1 spp. respectively. With Tropaeolaceae, sister to all other Brassicales.
Alseuosmiaceae: Asterales. Alseuosmia (New Zealand), Crispiloba (N.E. Australia), Periomphale (New Caledonia), Platyspermation (New Caledonia), Wittsteinia (S.E. Australia, New Guinea): 5, 1, 1, 1, and 2 spp. respectively.
Anacampserotaceae: Caryophyllales. Anacampseros, Avonia, Grahamia (Africa, S.W. U.S.A., Central and South America, Australia, very scattered): 32 spp. in whole family. Sister to Cactaceae.
Aphanopetalaceae: Saxifragales. Aphanopetalum (W. and E. Australia): 2 spp.
Aphloiaceae: Crossosomatales. Aphloia (E. Africa, Madagascar, Seychelles): 1-several spp. Sister to one major clade in the order.
Asteraceae: Asterales. Corymbium (South Africa); 7 spp. Sister to Asteroideae.
Asteropeiaceae: Caryophyllales. Asteropeia (Madagascar): 8 spp. One family of two in clade sister to core Caryophyllales, see also Physenaceae.
Barbeuiaceae: Caryophyllales. Barbeiua (Madagascar): 1 sp. Sister to a very speciose part of Caryophyllales.
Barbeyaceae: Rosales. Barbeya (Horn of Africa, Arabia): 1 sp. Near Rhamnaceae.
Berberidopsidaceae: Berberidopsidales. Streptothamnus (E. Australia): 1 sp., florally very different from Berberidopsis. One of two families in Berberidopsidales, along spine below the asterids, see also Aextoxicaceae.
Bixaceae: Malvales. Diegodendron (Madagascar): 1 sp. Isolated in Bixaceae.
Brassicaceae: Brassicales. Aethionema (Mediterranean to Afghanistan): 70 spp. Sister to all other Brassicaceae.
Buxaceae: Buxales. Didmyeles (Madagascar): 2 spp. Sister to rest of family.
Carlemanniaceae: Lamiales - sister to Oleaceae, together sister to the rest of the order, minus one species! Carlemannia, Silvianthus (both S.E. Asia to W. Malesia): 3 and 1 spp. respectively.
Campanulaceae: Asterales. Cyphioideae: Cyphia (Africa): 60 spp. Cyphocarpoideae: Cyphocarpus (Chile): 2 spp. Nemacladoideae: Nemocladus (S.W. North America): 12 spp. Basal or nearly so in the family.
Campynemataceae: Liliales. Campynema (Tasmania), Campynemanthe (New Caledonia, somewhat known): 1 and 3 spp. respectively.
Caprifoliaceae: Dipsacales. Zabelia (East Asia, Afghanistan to Japan): 6 spp. Crucial position in middle of Caprifoliaceae.
Caryocaraceae: Malpighiales. Anthodiscus (tropical South America), Caryocar (Central America, tropical South America): 8 and 15 spp. respectively.
Centroplacaceae: Malpighiales. Centroplacus (West Africa), Bhesa (Indomalesia): 1 and 5 spp. respectively.
Codonaceae: Boraginales. Codon (South Africa): 2 spp. Sister to clade including Boraginaceae.
Combretaceae: Myrtales. Strephonema (West Africa): 3 spp. Sister to rest of family.
Commelinaceae: Commelinales. Cartonema (N. and S.W. Australia, New Guinea) and probably Triceratella (Zimbabwe): 11 and 1 spp. respectively. In clade sister to rest of family.
Convolvulaceae: Solanales. Humbertia (Madagascar): 1 sp. Sister to rest of family.
Cordiaceae: Boraginales. Hoplestigma (West Africa). 1 spp. Strange floral morphology.
Ctenolophonaceaee: Malpighiales. Ctenolophon (tropical West Africa, Malesia): 3 spp.
Cucurbitaceae: Cucurbitales. Indofevillea (N. E. India, Assam, Tibet, Myanmar): 2 spp. Sister to all other Cucurbitoideae.
Curtisiaceae: Cornales. Curtisia (Southern Africa): 1 sp.
Cyrillaceae: Ericales. Cliftonia (S.E. U.S.A), Cyrilla (S. U.S.A. to N. South America): 1 and 1 spp. respectively. Sister to Ericaceae.
Dasypogonaceae: Arecales. Baxteria (S.W. Australia), Calectasia (S.E. Australia), Dasypogon (S.W. Australia), Kingia (S.W. Australia): 1, 11, 3, and 1 spp. respectively. ?Sister to Arecaceae.
Dipentodontaceae: Huerteales. Dipentodon (E. Asia), Perrottetia (Central and W. South America, S.E. Asia to Australia): 1 and 15 spp. respectively. All Huerteales poorly known.
Dirachmaceae: Rosales. Dirachma (Horn of Africa): 2 spp. Near Rhamnaceae.
Euphroniaceae: Malpighiales. Euphronia (N. South America): 3 spp.
Fabaceae: Fabales. Duparquetia (tropical West Africa): 1 sp. Near basal in family.
Geissolomataceae: Crossosomatales. Geissoloma (South Africa): 1 sp.
Gelsemiaceae: Gentianales. Pteleocarpa (W. Malesia): 1 sp. Distinctly odd.
Geraniaceae: Geraniales. Hypseocharis (Andean South America): 1-3 spp. Sister to rest of family.
Gerrardinaceae: Huerteales. Gerrardina (S. and E. Africa): 2 spp.; they look rather different. All Huerteales are poorly known.
Giseckiaceae: Caryophyllales. Giseckia (Africa to E. Asia): 5 spp. Core Caryophyllales.
Goupiaceae: Malpighiales. Goupia (Central and N.E. South America): 2 spp. Probably sister to Violaceae.
Griseliniaceae: Apiales. Griselinia (New Zealand and S. South America): 6 spp.
Grubbiaceae: Cornales. Grubbia (South Africa, the Cape): 3 sp.
Guamatelaceae: Crossosomatales. Guamatela (Guatemala): 1 sp.
Huaceae: Oxalidales. Afrostyrax, Hua (both tropical W. Africa): 1 and 2 spp. respectively. Perhaps sister all other Oxalidales.
Icacinaceae: Icacinales. Includes Casimirella (Paraguay, 1 sp.), Cassinopsis (Africa, Madagascar, 6 spp.), Hosiea (China, Japan, 2 spp.), Mappia (tropical America, 5 spp.), Merrillodendron (Philippines, etc., 1 sp.), Nothapodytes (East Asia to Malesia, 4 spp.), Sarcostigma (S.E. Asia, W. Malesia, 2 spp.). Basically the whole family...
Kewaceae: Caryophyllales. Kewa (ex Hypertelis) (Africa, Saint Helena): 8 spp. Near Aizoaceae, core Caryophyllales.
Lacistemataceae: Malpighiales. Lacistema, Lozania (both Central and South America): 11 and 3 spp. respectively. Sister to Salicaceae.
Leycthidaceae: Ericales. Foetidia (centred on Madagascar): 17 spp. Napoleonoideae, sister to rest of family - Crateranthus, Napoleona (both tropical West Africa): 3 and 8 spp. respectively.
Lepidobotryaceae: Celastrales. Lepidobotrys (tropical W. Africa), Ruptiliocarpon (C. America, W. South America): 1 and 1 spp. respectively. Sister to Celastraceae s.l.
Limeaceae: Caryophyllales. Limeum (Africa to Pakistan, esp. South Africa), Macarthuria (S.E./S.W. Australia): 26 and 10 spp. respectively. Clade along spine of core Caryophyllales.
Lophiocarpaceae: Caryophyllales. Lophiocarpus (South Africa), Corbichonia (Africa to Asia): 4 and 2 spp. respectively. Sister to speciose clade in core Caryophyllales.
Lophopyxidaceae: Malpighiales. Lophopyxis (Malesia to W. Pacific): 1 sp.
Mayacaceae: Poales. Mayaca (West Africa, tropical America): 4 spp.
Metteniusaceae: Metteniusales. Includes Apodytes (Old World tropics, 15 spp.), Calatola (Mexico to Ecuador, 7 spp.), Dendrobangia (tropical South America, 2 spp.), Emmotum (tropical South America, 12 spp.), Metteniusa (Costa Rica to NW South America, 7 spp.), Platea (Malesia, 5 spp.), Rhaphiostylis (tropical W. Africa, 6 spp.). Again, basically the whole family.
Microteaceae: Caryophyllales. Microtea (Central and South America, Antilles): 9 spp. Perhaps sister to all other core Caryophyllales minus Macarthuriaceae.
Montiniaceae: Solanales. Grevea (E. Africa, Madagascar), Kaliphora (Madagascar), Montinia (S. Africa): 3, 1, and 1 spp. respectively.
Muntingiaceae: Malvales. Dicraspidia (Central to N.W. South America), Muntingia (tropical America, somewhat known): 1 and 1 sp. respectively. Neotessmannia (Peru): 1 sp. Does this belong?
Neuradaceae: Malvales. Grielium (South Africa), Neurada (E. Mediterranean to India), Neuradopsis (S.W. Africa): 5, 1 and 3 spp. respectively. ?Sister to all other Malvales.
Nitrariaceae: Sapindales. Nitraria (Mediterranean to C. Asia, S. North America), Peganum, Tetradiclis (both E. Europe to C. Asia): 12, 6 and 1 spp. respectively.
Nyctaginaceae: Caryophyllales. Caribea litoralis (Cuba). Sister to rest of family.
Oncothecaceae: Icacinales. Oncotheca (New Caledonia): 2 spp.
Orobanchaceae: Lamiales. Rehmannia (East Asia), Trianeophora (EastAsia): 6 and 3 spp. respectively. Sister to rest of family.
Passifloraceae: Malpighiales. Pibiria (Guyana): 1 sp. Sister to Turneroideae.
Paracryphiaceae: Paracryphiales. Paracryphia (New Caledonia), Quintinia (Philippines to New Zealand), Sphenostemon (Celebes to N.E. Australia and New Caledonia); 1, 25, and 10 spp. respectively. Sister to Apiales.
Pennantiaceae: Apiales. Pennantia (E. Australia to New Zealand): 4 spp. Sister to all other Apiales.
Pentadiplandraceae: Brassicales. Pentadiplandra (tropical W. Africa): 1 sp. Core Brassicales.
Pentaphragmataceae: Asterales. Pentaphragma (S.E. Asia-Malesia): 30 spp. Sister to major clade in order.
Peridiscaceae: Saxifragales. Medusandra, Soyauxia (both tropical W. Africa), Peridiscus (Amazonia), Whittonia (N. South America): 2, 7, 1, and 1 spp. respectively. Sister to all other Saxifragales.
Petenaeaceae: Huerteales. Petenaea (C. America): 1 sp. Order poorly known.
Phellinaceae: Asterales. Phelline (New Caledonia): 12 spp.
Phyllonomaceae: Aquifoliales. Phyllonoma (Mexico to W. South America): 4 spp.
Picramniaceae: Picramniales. Alvaradoa (Florida, Central America, Bahama, esp. the Greater Antilles, Bolivia to Argentina), Nothotalisia (Panama and N.W. South America), Picramnia (Florida, Central and South America): 5, 3, and 41 spp. respectively. Along spine of malvids.
Poaceae: Poales. Pharoideae: Leptaspis (Old World tropics), Pharus (New World, Florida to Argentina): 4 and 8 spp. respectively. Puelioideae: Guaduella, Puelia (both tropical W. Africa): 6 and 5 spp. respectively. Clades along basal spine of family.
Physenaceae: Caryophyllales. Physena (Madagascar): 2 spp. One family of two in clade sister to core Caryophyllales, see also Asteropeiaceae.
Plocospermataceae: Lamiales. Plocospermum (Central America): 1 sp. Sister to all other Lamiales.
Picrodendraceae: Malpighiales. Podocalyx (Amazonian South America): 1 sp. Sister to rest of family.
Portulacaceae: Caryophyllales. Portulaca (pan(sub)tropical): 40-100 spp. Once-removed from sister to Cactaceae.
Primulaceae: Ericales. Maesa (Old World Tropics): 160 spp. Sister to rest of family.
Quillajaceae: Fabales. Quillaja (temperate South America): ca 3 spp.
Resedaceae: Forchhammeria (S.W. North America), Stixis (E. Himalayas to W. Malesia), Neothorelia (Laos), Tirania (Vietnam): 10, 7, 1 and 1 spp. respectively. Also Borthwickia (S.W. Yunnan, China, and adjacent Burma): 1 sp.
Rhizophoraceae: Malpighiales. Paradrypetes (tropical South America): 2 spp. Very odd genus.
Rousseaceae: Asterales. One clade includes Roussea (Mauritius): 1 sp. The other clade includes Abrophyllum (E. Australia), Carpodetus (New Guinea to New Zealand), Cuttsia (E. Australia): 1, 2, and 1 spp. respectively. Sister to Campanulaceae.
Sapindaceae: Sapindales. Xanthoceras (China): 1 sp. Possibly sister to rest of family
Sarcobataceae: Caryophyllales. Sarcobatus (S.W. North America): 2 spp. Core Caryophyllales, position unclear.
Setchellanthaceae: Brassicales. Setchellanthus (S.W. U.S.A., Mexico): 1 sp. Along spine of Brassicales.
Simmondsiaceae: Caryophyllales. Simmondsia (S.W. U.S.A., Mexico): 1 sp. Along spine just below core Caryophyllales.
Sladeniaceae: Ericales. Ficalhoa (East Africa), Sladenia (S. E. Asia): 1 and ?2 spp. respectively.
Stegnospermataceae: Caryophyllales. Stegnospermum (Central America, Antilles): 3 spp. Along spine of core Caryophyllales.
Surianaceae: Fabales. Cadelia (N.E. Australia), Guilfoylia (E. Australia), Recchia (Mexico), Stylobasium (W. and N. Australia), Suriana (pantropical, a little known): 1, 1, 3, 2 and 1 spp. respectively. Position in order unclear.
Tapisciaceae: Huerteales. Huertea (Central and South America), Tapiscia (E. Asia): 4 and 1 spp. respectively. Whole order poorly known
Tetracarpaeaceae: Saxifragales. Tetracarpaea (Tasmania): 1 sp.
Tetrachondraceae: Lamiales. Polypremum (New World), Tetrachondra (New Zealand, Chile): 1 and 2 spp. respectively. Sister to very speciose monosymmetric Core Lamiales.
Ticodendraceae: Fagales. Ticodendron (Central America): 1 sp.
Torricelliaceae: Apiales. Aralidium (W. Malesia), Melanophylla (Madagascar), Torricellia (Southeast Asia): 1, 7, and 2 spp. respectively. Sister to all Apiales minus Pennantiaceae.
Vahliaceae: unplaced asterid I. Vahlia (Africa, Madagascar, Indian subcontinent): 8 spp.
Wellstediaceae: Boraginales. Wellstedia (N.E. Africa, South West Africa): 6 spp. Sister to Boraginaceae.
Xeronemataceae: Asparagales. Xeronema (New Zealand, New Caledonia): 1 sp. Along spine of Asparagales.
ON THE ORGANIZATION AND DESIGN OF THE SITE
[Back to Top]
This website is best viewed using the most recent version of your preferred internet browser. You can tell when each page was last updated if you look at the top of each page, or, for the individual order pages, immediately below mention of the order or other major clade first mentioned on that page - thus on the Arecales page the update indication is below the characterization of the commelinid group.
The left pane is designed as a quick entry into different parts of the seed plant phylogeny. The default pane includes a reference list of accepted orders as well as a listing of abbreviations, etc., used in the body of the text. There are also direct links to the developing essays on seed plant evolution and angiosperm evolution and diversification. This left pane can be replaced by a list of the main characters that are described in detail elsewhere - see the "Characters" option in the top pane of the website.
The top pane is designed as a menu:
- Home will always return the user to this page.
- Tree gets you to the Main Tree that displays a phylogeny of the orders. Clicking once on the name of a terminal taxon will take you to a characterization of that taxon. Clicking on a node will take you to that particular node. Clicking on one of the tree icons next to an ordinal name will usually take you a tree showing relationships within that order. Not all orders have such icons; some are too small. Clicking on "Unplaced Taxa" will send the user to those few families and genera that lack any convincing evidence as to their likely immediate relationships.
- Orders provides an alphabetical listing of all accepted ordinal names and also all synonyms. Clicking on a letter at the top of the page will take you to the beginning of names that start with that particular letter.
- Families is an alphabetical listing of all accepted family names and their synonyms. Clicking on a letter at the top of the page will take you to the beginning of names that start with that particular letter. Clicking on a link to a family will take you to its characterization.
- Characters provides a textual description of characters used to support the recognition and/or monophyly of a particular taxon. General references that are important sources of data for this project are also included. At the beginning of this page is a notice "click here"; this will display a list of the major characters included in the left pane. One can return to the Orders list by choosing the option "Back to Orders" at the beginning of the character list.
- References provide an alphabetical listing of the references cited; it has been broken down into six interlinked pages for ease of loading. Clicking on a letter at the top or the bottom of the page will take you to the part of the reference list where surnames with that letter begin. Articles with two authors are listed alphabetically by name of second author and then by date, articles with three or more authors are cited last (e.g. as "Chase [et al. 1993]") and are listed by date. You can also search on individual authors' names.
- "Search" takes you to a page where you may enter keywords or terms that you are interested in finding within the Angiosperm Phylogeny Website. The search engine is unfortunately not designed to search for wildcards, so if your search is not successful, please try variations of your search terms. [Para. to be changed.]
- Links lists some other websites or web utilities that are relevant to general classification, floras (a few), image resources, and phylogeny. These listings are very incomplete. There are also some links to other websites in the Bibliography and on individual ordinal pages.
- The Glossary is made up of three separate pages. Clicking on a letter at the top or the bottom of the page will take you to the first definition starting with that particular letter. Within the Glossary, there are cross references to characters at the same level. Thus when reading the definition of paracytic stomata, you will find direct links to other types of stomata such as anomocytic, cyclocytic, etc. There are also links to definitions at other levels, e.g. to stomata in general, where the basic features of stomata are described, and also to variants of individual stomatal types such as brachyparacytic stomata. We have also built in some synonymy, e.g., Rubiaceous stomata are a synonym of paracytic stomata.
The names of the images of plants may not be up to date - sorry. We did not chose the most spectacular photographs we could find, rather, I tried to select those that showed at least some features on the plant that help characterize the group to which it belongs. All photographs are copyrighted as is evident from the characterizations or the images or the external websites themselves. For photographs of individual species and genera, or simply for more images of families, use Google, but beware of the identifications there!
THE INTERPRETATION OF THE TREES, TEXT, ABBREVIATIONS, ETC.
[Back to Top]
In the trees in this site I have emphasized mostly nodes with substantial support (e.g. ³80% bootstrap support) that appear after analysis of data from more than one gene; posterior probabilities for a particular node are usually substantially higher than its bootstrap or jacknife values. In a few cases I have been somewhat less cautious, but I have always tried to make it clear where I am treading on thin ice. There are references (not exhaustive) to papers giving support for the relationships suggested here, and these papers may have more resolved trees than those shown, albeit details of these topologies may on occasion have little support. Chase et al (1993, 2000a), Olmstead et al. (1992, 1993, 2000), Olmstead and Graham (2000a, b), Savolainen et al. (2000a, b), D. Soltis et al. (1997, 1998, 2000, 2003, 2005b, 2017), P. Soltis et al. (2000), B. Bremer et al. (2002), Hilu et al. (2003), Jansen et al. (2007), Saarela et al. (2007), Moore et al. (2007, 2010), etc., are invaluable sources for the developing big picture of angiosperm relationships.
If one printed out all the trees in this site and stuck them all together, it might seem as if one had some kind of supertree, however, it is clear from the description of my modus operandi that these trees are hardly a formal supertree. Readers who are interested in the topology of the most parsimonious tree, or relationships suggested by poorly supported nodes, etc., should consult the original literature. PhyloMatic is a another resource to be used.
For an introduction to the interpretation of phylogenetic trees, see Baum and Smith (2012). Krell and Cranston (2004) and Crisp and Cook (2005) and many others have emphasized how careful one must be when interpreting and talking about ladderised trees (see above) in particular and phylogenetic trees in general; the use of the adjective "basal" is especially dangerous (see also D. Soltis et al. 2005b). However, when describing relationships in the context of a branching phylogenetic tree, the use of terms like "high", "low", "basal", etc., to describe aspects of the branching pattern might seem perfectly appropriate, but it is difficult to talk about branching points of a phylogenetic tree; there is no trunk, and no "branch" - one member of a pair of sister clades - can be basal to the other. Although on occasion, I do use the term "basal", this is in the context of a ladderized tree (see above) and refers to the branch in which there has subsequently been less diversification. Thus Equisetum, Amborella, Acorus, Enkianthus, Humbertia, and many other clades can all be called "basal" from this point of view - but note that this has only the topological connotations I have just mentioned, there is certainly no implication that these taxa lack apomorphies or are "primitive" in any way. Indeed, the terms "primitive" and "advanced" are heavily loaded and I have tried not to use them; "plesiomorphic" and its opposite, "derived" and "apomorphic", are the terms to use when talking about individual characters.
In connection with this ladderisation of the trees, comments on particular nodes - whether subtending dichotomies or polytomies - are to be found on the page where the first branch (reading from left to right) at that node is mentioned, and there are cross-links between pages in important cases. Thus comments on the relationships of the magnoliid clade, which involves discussion of the relationships between Chloranthales, monocots, Ceratophyllaceae and eudicots as well, are to be found on the Chloranthales page alone, however, the discussion there is cross-linked to the other pages. However, comments on relationships within the magnoliid group itself are to be found on the Magnoliales page alone.
The information at the beginning of each page is to be read as if you were following along from the base of the embryophyte tree to the branch in which you are interested. Thus each page starts off with a characterisation of the common ancestor of all embryophytes followed by characterisations of all nodes leading up to your branch. Of course, features mentioned more basally in the tree (earlier on that page) may change, perhaps even reverse. Thus at the node [AUSTROBAILEYALES [[CHLORANTHALES + MAGNOLIIDS] [MONOCOTS [CERATOPHYLLALES + EUDICOTS]]]] you will find "ethereal oils +" - this part of the tree seems to be where that feature evolved. However, the apomorphies of [CERATOPHYLLALES + EUDICOTS] and of all monocots minus Acorales include "ethereal oils 0". Ethereal oils have also subsequently been reacquired several times, as in Zingiberaceae and within Lamiaceae.
The families and orders are those recognized by the Angiosperm Phylogeny Group (A.P.G. IV 2016), any differences reflecting subsequent work (see individual pages). Families are grouped within orders as far as possible according to their phylogenetic relationships. Trees showing relationships within many orders and a few of the larger families are included; they may have been cobbled together from more than one study, and I may indicate the level of support for the nodes on the trees. Subfamilies or tribes are numbered sequentially within each family. Although their characterisations may be only imperfectly worked out, they help clarify which characters really are potential family-level apomorphies and which characterise only parts of the family, speciose though those parts may be. The importance of having infra-familial variation pegged to its correct level is evident in, for example, Annonaceae, Convolvulaceae and Fabaceae. Paraphyletic groups are clearly flagged as being such, e.g. "Caesalpinioideae" (paraphyletic) vs Faboideae (monophyletic) - although there is now a particular circumscription of Caesalpinioideae which is monophyletic that can be used (see the Legume Phylogeny Working Group 2017). Further details of relationships, etc., can be found in the papers cited, which should always be consulted.
For the authorities of the names of subfamilies, families, orders, etc., I have relied largely on Reveal's listings (Reveal 2001 onwards), Hoogland and Reveal (2005) and Thorne (with Reveal) (2007). However, the authors of some names as given here may be incorrect since bibliographic work that affects the names of authorities proceeds apace. Synonymy is as complete as I can get it at the familial level and above.
Features are mentioned in the characterisations roughly following the order in the discussion of the characters on the "Characters" page. Possible apomorphies are indicated, but it will be abundantly clear that fixing characters to nodes is no easy task. For some characters (an example is leaf ptyxis) there is little easily-accessible basic information and the terms used to describe it in the literature are not used consistently. It is not always recorded in the characterisations, although its coverage is slowly improving; the same is true for the pattern of petiole vasculature, presence of pericyclic fibers in the stem, etc.; I had initially not considered such characters to be particularly useful (and this indeed may be true of petiole anatomy). Gaps in data are often indicated, but for many features confidently included in a characterisation, e.g. micropyle type, it should not be forgotten that this may be known for very few taxa. The contraction P stands for perianth, T for tepals, K for calyx, C for corolla, A for androecium, and G for gynoecium. "#<" means equal to or more than, "<#" means equal to or fewer than. "Many" means that there are more than fifteen or so parts. Parentheses in characterisations of clades above the level of family denote characters that are scattered throughout that clade, being found in several, but not all, terminal taxa but in no obvious pattern. Examples are septal nectaries and cuticle waxes in monocots, N-fixing in part of the eurosid I clade, and iridoids in asterids. Parentheses in characterizations of terminal taxa refer (as in conventional descriptions) to uncommon features. Square brackets enclose explanations or glosses of the feature described. When a generic name is placed in square brackets after mention of a feature, this emphasized that the feature is known from/I know of it from only that one genus. However, square brackets surrounding carpel number means that the carpels are connate, at least as to their ovule-bearing parts. A line under the number of carpels means that the ovary is superior. A fuller list of abbreviations, etc., used may be found under "Abbreviations" on the top of the left pane.
Following familial and subfamilial characterisations are two figures, the approximate number of accepted genera and species in the group. I mention most genera with 50 or more species and estimate total numbers of species and genera in families; much of this information is taken from Mabberley's excellent The Plant Book (Mabberley 2008) and from accounts in Kubitzki (1993 onwards). A list of included genera and their synonyms can usually be found by clicking on "list". Coverage of the synonymy is variable, and keeping up-to-date on new generic names is impossible. In general, the patient user will be able to answer most questions about the authorities of generic names, commonly-used synonyms, etc.. There are also some links to images.
General distribution is indicated, and there are some hundreds of distribution maps; these are amended as more information becomes available, and there are yet others in process. These give only overall indications of natural - i.e. unaffected by recent human activities - distributions, and distributions on oceanic islands are usually very incomplete. In many cases the original outline maps on which many maps here are based have had to been amended, for instance, as substantial holes in the distributions of many apparently widely-ranging families have become apparent - indeed, it has proved surprisingly difficult to find/make accurate maps. Major resources include Hultén and Fries (1986: see also Hultén's earlier work), van Steenis (1963), van Steenis and van Balgooy (1966), van Balgooy (1975, 1984, 1993), Meusel and his collaborators (Meusel et al. 1965, 1978, 1992), and Quian and Ricklefs (2004: see electronic appendix, corrections of the often rather vague maps in Heywood ); invaluable maps for tropical Africa are appearing, see Lebrun and Stork (2003 onwards). Some maps in Heywood et al. (2007) have been used as a last resort. There are various online resouces, including the excellent FloraBase, for the flora of West Australia, and Australia's Virtual Herbarium, for the flora of Australia as a whole; BONAP's Taxonomic Data Center for the north American flora, also provides a useful check on the distributions of plants given in general maps (and also much else). Maps generated by searches on GBIF and Tropicos can be helpful, but beware of records from cultivated plants and aliens. I have included the distributions of fossils when they clarify the distributions of extant taxa. Sources for the maps are indicated.
Following most families and a very few orders are brief paragraphs with features that may help in their recognition; the terms used here may not be perfectly "correct" botanically. Some of these features are taken from LaFrankie (2011), while Utteridge and Bramley (2014) suggest how the commoner tropical families can best be recognized.
There are a number of sections with subheadings following the family accounts (and also at other nodes when relevant), and most of these were introduced starting in late 2008. Age refers to crown-group ages, and these are placed at the appropriate node. In several cases there are substantially different age estimates for the same event, so treat these dates with particular caution. Evolution includes discussion on diversification and biogeography (Diversification & Distribution), comments on mycorrhizae and photosynthetic variants, etc. (Ecology & Physiology), information about pollination and disseminule distribution (Pollination Biology & Seed Dispersal), herbivory and galling (Plant/Animal Interactions), fungal and bacterial associations in general (Bacterial/Fungal Associations), interesting Vegetative Variation, Genes & Genomes, and any plants with particular Economic Importance. Parts of these sections have also found their way to the section on the evolution and diversification of the angiosperms.
In Chemistry, Morphology, etc. interesting variation within the family is summarized and there are references to sources of information that have been used but are not mentioned elsewhere in the family notes; there are additional general references in "Characters". Remember that many of the older "family" and "order" references include information about other groups some of which would not now be considered at all close. Phylogeny includes summaries of major works bearing on our current ideas of phylogentic relationships in the family. Classification includes references to the infrafamilial classification followed here, and there is often some discussion about generic limits. Earlier ideas of relationships (see Previous Relationships) includes largely some mention of some suggestions of Cronquist (1981) and Takhtajan (1997), but only because their work is still used. Finding out who was "first" in suggesting a particular relationship is no goal of these pages, the more so since what is often more interesting is not that a particular suggestion was made, but exactly why it was made. Trivia needs no explanation. The "Bibliography" may still have holes.
In the "Glossary" there are definitions of the terms commonly used in the site and some other terms that may be encountered, representative chemical formulae, etc.. Definitions as far as possible follow current usage, rather than etymology or original definitions (see, e.g. Rickett 1954-56). Building this glossary has forced me to confront head-on such problems as the plethora of terms used to describe inflorescences (in part resolvable, I think, providing one does not attempt too much detail) and fruits (more difficult). Botany needs a common language, and for this the botanical community as a whole needs to agree on definitions. A goal of botanical terminology must be to simplify the terms we use and to try and reduce the number of synonyms in common use.
- WARNING TO ALL USERS!!
[Back to Top]
All clades are hypotheses of relationships, and as hypotheses they may be overturned. Even though I have for the most part been conservative, changes in our ideas of relationships, and hence in the clades we talk about, are still likely in places. Relationships of a few taxa are still largely unknown or only partly known, and there will turn out to be a few seriously misplaced genera. Thus some changes are to be expected, but change is neither a defect of cladistics nor a necessary consequence of the use of molecular data.
A very important issue for all morphological studies, particularly at this level, is the documentation of variation within a character and the justification of the states that are used to describe this variation. As I often mention, the states used have frequently been inadequately justified, explicit criteria for their delimitation not having been presented. Character states that lack justification may or may not compromise phylogenetic analyses, but there is certainly little reason to talk about their evolution (Stevens 2000 for literature; see also above). To quote Heywood (1973: p. 311) slightly out of context, "Systematics can be likened to a mincing machine into which data of all sorts is fed and processed to form a series of sausages ... The basic recipe for the construction of these "sausages" is usually secret, yet it is such encapsulated pieces of information that we have to work with and communicate with." In general, character states as conventionally delimited may not be as helpful to us as we might like or assume.
For many characters in the characterizations our basic knowledge is often incomplete, and certainly sampling within families is often inadequate. However, I have included less information about chemical, wood anatomical, palynological, and embryological characters than I might. This is because for chemical characters (for example), my basic knowledge needs improving, while for some of the other more cryptic characters the sampling may be poor. But equally importantly, these notes began as teaching notes, and I judged that there was more than enough for the student or even the teacher to be going on with. An ideal system would include much more detail, the user ignoring anything irrelevant for the purpose at hand.
That I mention some features as not being known is in many cases simply an expression of my ignorance.
This site is updated on a more or less continuous basis. I teach plant families at least every other year and that is a particularly good time for finding areas that need attention.
I am always more than grateful if you can let me know of any references in the text that lack citations, errors of omission or commission, etc. - firstname.lastname@example.org should find me.
Spelling is erratic and somewhat mid-Atlantic; grammar is little better. All errors are mine.
HISTORY OF THE SITE
[Back to Top]
These notes began as I taught Biology 103, the basic plant families course, at Harvard, taking over from Carroll E. Wood, Jr. (an impossible task). They became more wide-ranging when I twice taught OTS "Tropical Plant Systematics", and when I attempted to deal with all families in a one-semester, graduate-level course at Harvard. This last was a failure; we did not get to the monocots. However, the time and energy spent in assembling material for each class from all corners of the herbarium convinced me like nothing else could have that the modified Englerian system the herbarium then followed was didactically a disaster. Finally, I have long been interested in sight identification of herbarium material; unidentified specimens have to be brought into the system somehow. I have done this intermittently, but as frequently as I could, over the years, often with students and others, and this has been invaluable both in reviewing characters and also in learning more.
The first version of the website (version 2) that was made generally known included literature that appeared as late as early August, 2001; the Introduction to this version is dated the 13th of July 2001. Version 2 benefited from teaching again in Costa Rica. This time I was with InBio personnel, and I learned perhaps more than I taught; a course given at the University of Campinas, in Brasil, was similarly stimulating.
[Back to Top]
What happens with these pages depends on how useful they are found to be, or can be made to be. Their goal is to help in teaching, although I find them also a useful research tool in that they help direct one's attention to interesting characters and to taxa that are poorly understood. They are not intended to compete with other web sites that depict the tree of life, etc.
Filling in gaps in the literature continues to improve family characterisations; three character systems in particular, chemistry, wood anatomy and palynology, clearly need more attention. We will continue to build in more links to photographs and to other web sites, particularly those that focus on families. Adding good diagrams showing the basic floral morphology of each family would be very helpful (see Ronse DeCraene 2010 for recent work on floral diagrams; Eichler 1875-1878 remains invaluable), and simple floral formulae are being added, although complex formulae can be difficult to understand (see Prenner et al. 2010; Simpson 2010; Nuraliev et al. 2019 for other symbols that might be useful).
[Back to Top]
Last, but certainly not least, I thank Kobinah Abdul-Salim, Stuart Davies, Diane Ferguson, Rick Ree, George Weiblen and Barbara Whitlock for providing the impetus to start this. Students at courses at Harvard University, Massachusetts, InBio, Costa Rica, the University of Campinas, Brasil, and the University of Missouri, St Louis and Missouri Botanical Garden have worked with successive versions of the hierarchy, uncovering flaws as they did do; to them my thanks for their enthusiasm and tolerance. Librarians at Harvard University and the Missouri Botanical Garden have been unfailingly helpful in finding sometimes obscure references. A few sections have been read by colleagues who are thanked individually in the text. Many people have sent in comments and/or noticed mistakes; my thanks to you all, although I have not mentioned you individually unless you have caught particularly egregious errors! Mark Olson, Beth Owen and Bob Magill helped to set up the site. Finally, to say that I am extremely grateful to Hilary Davis for building the site, adding the links to the Gentry photographs of the Missouri Botanical Garden, and generally helping things along, is an understatement.
P. F. Stevens, 13 (Friday) July 2001 - University of Missouri, St Louis, and Missouri Botanical Garden.
I inexcusably forgot to thank Dr Barry Hammel (Costa Rica) and Drs Maria Amaral and Volker Bittrich (Campinas, Brasil) for their invitations to teach/learn from InBio personnel and University of Campinas staff and students respectively. For version 3, many have been kind in either giving us their photographs or allowing us to make direct links to their photographs: Atlas of Florida Vascular Plants, Arizona State University Herbarium, Australian National Botanic Gardens, David Boufford - Biodiversity of the Hengduan Mountains China, Mark Brand - UConn Plant Database, CalPhotos, Connecticut Botanical Society, Michael Dillon - Andean Botanical Information System, Michael A. Dirr, Murray Fagg - Australian National Botanic Gardens, Gardenweek.org, Christine Howells - Australian Plants Society Tasmania, Kelly Irvin - International Bulb Society, Don Les, John Maunder - Provincial Museum of Newfoundland and Labrador, Kate McCombs - Christchurch City Council New Zealand, Andrew Morgan - University of Tasmania, Clinton Morse - University of Connecticut Greenhouse, Mount Tomah Botanic Garden Australia, Lytton J. Musselman, Dan Nickrent - Parasitic Plant Connection, Plant of the Week - Smithsonian Institution, James Reveal, George Schatz - Madagascar Conspectus Website, Thomas Schöpke, Tim Stephens - University of California Santa Cruz, Kurt Stüber, Jim Mann Taylor, Texas A&M Bioinformatics Working Group Vascular Plant Gallery, Brian Walters - Australian Society for Growing Native Australian Plants, Len Webb Ecological Images Collection, Western Australia Seagrass Website, Steven J. Wolf, University of Wisconsin Plant Systematics Collection. Gerald Carr and Heinz Schneider have been particularly generous in making available many magnificent photographs; these are but some that can be found at the sites that the two maintain (Vascular Plant Families Image Gallery and Botanical Image Database, respectively). Dave Boufford has caught many errors throughout the site, as has Paul von Rijckevorsel. I am again grateful to students at the University of Missouri at St Louis, Washington University, and St Louis University, and to staff and visitors at the Missouri Botanical Garden. And once again, my sincere thanks to Hilary Davis, to whom the improvements of the site are due.
P. F. Stevens, 15 May 2002 - University of Missouri, St Louis, and Missouri Botanical Garden.
Once again, the normal list of suspects is to be thanked. Many people have written in with comments that, whether apparently trivial or cosmic, I have tried to take into account. I remain grateful to students at the University of Missouri at St Louis, Washington University, and St Louis University, and to staff and visitors at the Missouri Botanical Garden. Hilary Davis continues to improve the site.
P. F. Stevens, 2 May 2003 - University of Missouri, St Louis, and Missouri Botanical Garden.
Version 4 was archived on May 1. I again thank students at the University of Missouri at St Louis, Washington University, and St Louis University, and all who have caught mistakes, suggested additions, etc. Hilary Davis has worked hard on the site, especially the glossary, and she plans further improvements.
P. F. Stevens, 5 May 2004 - University of Missouri, St Louis, and Missouri Botanical Garden.
Version 5 is being archived today, Sunday May 22. All those who have caught mistakes and made suggestions are gratefully thanked, and Hilary Davis is particularly thanked for developing the protocol for making the maps. She has also developed a protocol for integrating illustrations of critical characters with the text that we will try and implement this next year.
P. F. Stevens, 22 May 2005 - University of Missouri, St Louis, and Missouri Botanical Garden.
Version 6 was archived on Sunday May 21. All those who have caught mistakes and made suggestions are gratefully thanked, and Andrew Ford has been particularly helpful. The library staff at the Missouri Botanical Garden have patiently dealt with a positive barrage of queries this last year, and I thank Victoria McMichael and Mary Stiffler in particular for their tolerance. Hilary Davies continues to help developing the site.
P. F. Stevens, 22 May 2006 - University of Missouri, St Louis, and Missouri Botanical Garden.
Version 7 was archived on Sunday, June 3. I am particularly grateful to students in my plant families course for making this last term so stimulating, and spurring the development of a new section of the site, the "Student" section, which Hilary Davis put up just prior to archiving. Again, the library staff have been swamped with requests, and again corrections and additions have been suggested; I thank all who have been involved in this project, no matter how apparently peripherally.
P. F. Stevens, 8 June 2007 - University of Missouri, St Louis, and Missouri Botanical Garden.
Version 8 was archived on Friday, May 30th. The "Student" section has occupied my energies for much of this year. I thank Felipe Zapata for bringing literature to my attention, making suggestions, etc., Hilary Davis for continuing to persevere with this project, and all who have suggested additions and corrections.
P. F. Stevens, Monday, May 26th, 2008 - University of Missouri, St Louis, and Missouri Botanical Garden.
Version 9 was archived on Friday, June 19th. In connection with suggestions made by Amy Zanne and Cam Webb, I have been working at making the discussion more rational, introducing a number of subheadings which I hope will make it easier to navigate. Felipe Zapata and Hilary Davis have continued to help with this project, and I thank all of you who have sent me additions and corrections.
P. F. Stevens, Sunday, June 21st, 2009 - University of Missouri, St Louis, and Missouri Botanical Garden.
I have just realised that I omitted to say that version 10 was archived on on July 7th; the year has been very hectic. I am not as far along with the reorganization of the site as I wanted to be, but hope springs eternal. Cam Webb, Amy Zanne, Felipe Zapata and Hilary Davis continue with the project, and again I thank all of you who have sent me additions and corrections; the "small" errors often point out larger problems.
P. F. Stevens, Saturday, November 20th(!), 2010 - University of Missouri, St Louis, and Missouri Botanical Garden.
Version 11 was archived on May 21st. Unfortunately, this year was no better than the last... Thanks Cam, Amy, Felipe and Hilary for continuing to directly or indirectly prod, and to all for corrections; trees will have to be updated in the very near future!
P. F. Stevens, Saturday, May 28, 2011 - University of Missouri, St Louis, and Missouri Botanical Garden.
Version 12 was archived on July 25th. The whole site is in the process of being reworked, and this has exposed the consequences of adding bits of information sporadically over a long period of time... In addition, with the help of Cam the apomorphies - many of which have been added this past year - are being given xml tags, and these will shortly be extended to the discussion, the nodes, etc. Thanks to Amy, Felipe and Hilary for continuing help, the staff of the library at the Garden who have deat with even more of a barrage of requests than usual, and to Volker Bittrich and to all the rest of you for comments and corrections.
P. F. Stevens, Tuesday, August 1, 2012 - University of Missouri, St Louis, and Missouri Botanical Garden.
Version 12 was archived on September 26th. The reworking of the site has continued - apomorphies have now been flagged throughout the site, and dates for clades, which were in a considerable mess, are being cleaned up. Given the reorganisation of the site over the last couple of years, the "Student" section has been removed, as has the "Apomorphies" section (it is now redundant) and the "Statistics" section (that was simply one more area to keep up to date, and anyhow, I had largely ignored it).
P. F. Stevens, Tuesday, September 28, 2013 - University of Missouri, St Louis, and Missouri Botanical Garden.
Getting there.... Dates for clades are now largely cleaned up, but most must be seriously inaccurate. The next push must be to clean up the trees. Readers have made many useful comments, and Felipe Zapata in particular has helped with literature.
P. F. Stevens, July 4, 2017 - University of Missouri, St Louis, and Missouri Botanical Garden.
[Back to Top]
Return to Research Home