Forgotten forests - issues and prospects in biome mapping using Seasonally Dry Tropical Forests as a case study
© Särkinen et al; licensee BioMed Central Ltd. 2011
Received: 6 September 2011
Accepted: 24 November 2011
Published: 24 November 2011
South America is one of the most species diverse continents in the world. Within South America diversity is not distributed evenly at both local and continental scales and this has led to the recognition of various areas with unique species assemblages. Several schemes currently exist which divide the continental-level diversity into large species assemblages referred to as biomes. Here we review five currently available biome maps for South America, including the WWF Ecoregions, the Americas basemap, the Land Cover Map of South America, Morrone's Biogeographic regions of Latin America, and the Ecological Systems Map. The comparison is performed through a case study on the Seasonally Dry Tropical Forest (SDTF) biome using herbarium data of habitat specialist species.
Current biome maps of South America perform poorly in depicting SDTF distribution. The poor performance of the maps can be attributed to two main factors: (1) poor spatial resolution, and (2) poor biome delimitation. Poor spatial resolution strongly limits the use of some of the maps in GIS applications, especially for areas with heterogeneous landscape such as the Andes. Whilst the Land Cover Map did not suffer from poor spatial resolution, it showed poor delimitation of biomes. The results highlight that delimiting structurally heterogeneous vegetation is difficult based on remote sensed data alone. A new refined working map of South American SDTF biome is proposed, derived using the Biome Distribution Modelling (BDM) approach where georeferenced herbarium data is used in conjunction with bioclimatic data.
Georeferenced specimen data play potentially an important role in biome mapping. Our study shows that herbarium data could be used as a way of ground-truthing biome maps in silico. The results also illustrate that herbarium data can be used to model vegetation maps through predictive modelling. The BDM approach is a promising new method in biome mapping, and could be particularly useful for mapping poorly known, fragmented, or degraded vegetation. We wish to highlight that biome delimitation is not an exact science, and that transparency is needed on how biomes are used as study units in macroevolutionary and ecological research.
South America is one of the world's most diverse continents, housing around 90,000-110,000 species of seed plants, c. 37% of the world's total [1–3]. Taxonomic diversity, however, is not evenly distributed within the continent; on a broad scale, the Amazon rain forest is home to completely different species to those from the mountain tops of the Andes, and areas differ on a finer scale in their species richness and endemism . Understanding such diversity gradients, and the processes that shape and maintain them, remains a focal question in ecology and evolutionary biology.
Studies aiming to understand diversity gradients rely on schemes depicting the distribution of this diversity. At the continental scale, species diversity is divided into major units referred to as biomes (also know as vegetation zones, phytogeographic regions, phytochoria, etc). For example, Africa is divided into 22 biomes based on floristic similarity, climatic factors, and vegetation structure. The African biome map was originally developed by White [5, 6] and later revised and digitised using remote sensing data . White's biome delimitation has been widely accepted among ecologists, conservationists and evolutionary biologists, and the stability of the African biome map has enhanced collaboration across research fields (e.g. [8–10]).
General types of biome maps
Hierarchical representation of regions
Representation of ecological affinities
Representation of evolutionary communities
Examples of maps
Species composition, richness & endemism
WWF Ecoregions 
Evolutionary biology & conservation science
Land cover maps
Remote sensing data
Land Cover Map of South America 
Climate, elevation, and species composition
Ecological Systems Map 
All research fields
The gap between the fields of ecology and evolution is closing (e.g. [18, 19]). As a result, there is a growing need for a common frame of reference with which to test hypotheses that bridge the fields. Currently, biomes are used as study units in many macroevolutionary and ecological studies (e.g. [20–24]), without critical analysis of the biome maps used and their limitations. It is clear that a thorough discussion is needed on what biomes are and how they should be delimited. Defining a stable biome map scheme for South America that could be used in both macroevolutionary and macroecological research would increase transparency and stimulate dialogue between the research fields.
The concept of biomes was originally developed by Alexander von Humboldt  who first noted the dynamic relationship between vegetation composition and structure, climate, and geography. Humboldt argued that vegetation had a central role in the understanding of landscape level processes, and first articulated the idea that biomes can be seen as evolutionary theatres for the lineages they contain. The modern development of Humboldt's original concept, where biomes are seen as biological meta-communities, comes largely from recent molecular phylogenetic studies in which a strong pattern of phylogenetic niche conservatism (PNC) is seen across plant lineages at global level (e.g. ). The expectation that related species tend to occupy similar environments [26, 27] is the basis for PNC. The PNC concept potentially has a strong role in explaining historical species assembly, as it governs the composition of regional floras and species pools from which communities are assembled over time . This means that biomes, and their dynamic history through time, can have a strong effect on the evolution of the lineages they host (e.g. [14, 28]).
This study was initially started with the aim of exploring ways to derive better biome maps for South America that could be used in biological research. In this paper we focus on exploring how herbarium data can be used as aid in biome mapping. First, we review a set of available digital biome maps of South America to discuss their strengths and weaknesses. We focus on five recently proposed biome maps which are available in digital format, including the Land Cover Map (LCM) , WWF Ecoregion map (ECO) , the Americas Basemap (AB) , Latin American Biogeography Scheme (LAB) , and the Ecological Systems Map (ESM) . The comparison is performed through a case study of the Seasonally Dry Tropical Forests (SDTF), a relatively poorly known biome with a strongly fragmented distribution across South America. Georeferenced herbarium records of species endemic to the biome are used to ground-truth the biome maps in silico. In the second part of the study we propose Biome Distribution Modelling approach (BDM) to biome mapping. Climatic and elevation data is used in conjunction with herbarium specimen data of habitat specialist species to derive a new high resolution biome map for SDTF in South America based on predictive modelling.
Seasonally Dry Tropical Forests
The SDTF, or BTES (Bosques Tropicales Estacionalmente Secos) or FED/FES (Florestas Estacionais Deciduais e Semideciduais), is a relatively recently identified biome, which was first defined based on floristic similarity and high endemism at both generic and species level [32, 33]. The ecology and biology of neotropical SDTF have been recently reviewed [34–36], but in short, SDTFs are found in areas with low annual rain fall less than 1,100 mm/year with a dry season at least 5-6 months long during which rain fall remains below 100 mm [37, 38]. The flora is dominated by species in the angiosperm families Leguminosae, Cactaceae, and Bignoniaceae, and species show morphological adaptations to the long dry season during which most of them are deciduous .
Definitions of South American dry biomes
Annual rainfall (mm/year)
Length of dry season (months)
Dominant plant families
Physiognomy of vegetation
Notes on flora
Natural fire cycles
Seasonally dry tropical forests
Leguminosae, Bignoniaceae, Euphorbiaceae, Cactaceae, Bromeliaceae
Open to closed canopy forest
Adaptations to drought, scarcity of perennial grasses
Fertile, well drained, shallow soils. pH 6-7
Leguminosae, Myrtaceae, Vochysiaceae, Poaceae, Cyperaceae
Open to wooded grasslands
Fire adaptations in most plants, dominance of C4 grasses
Poor, Al rich, well drained, deep soils. pH very acid (~5)
c. 5 (variable)
Leguminosae (esp. Mimosoideae), Anacardiaceae, Cactaceae, Poaceae, Bromeliaceae
Open to closed canopy forest, interspersed with occasional savannas
Frost and salinity tolerant species with temperate affinities
Saline. Sometimes very alkaline in depth (up to pH 8-9)
Regular, rarely snow
The fragmented distribution and the structural variation of the vegetation make the SDTF biome a perfect case study for exploring how herbarium data could be used as an aid in biome mapping. Firstly, fragmented biomes are generally underrepresented in biome maps as small areas often remain undetected in continental scale maps especially if spatial resolution is poor. The issue of how best to map small but significant biome fragments has not thus far been discussed in detail, although the need for this has been highlighted by conservation agencies [e.g. ]. Similarly, biomes that show structural variation propose a challenge due to the fact that many remote sensed applications cannot readily pick up on the differences between structurally similar vegetation. Validation methods such as ground-truthing are required for remote sensing maps, but solutions for continental scale studies (e.g. biome mapping) are sparse.
Delimitation of SDTF on Available Biome Maps
Performance of biome maps for SDTF
Percentage of specimens
Narrow endemics only
Narrow endemics only
Narrow endemics only
Latin American Biogeography Scheme (LAB)
Americas Basemap (AB)
WWF Ecoregions (ECO)
Land Cover Map (LCM)
Ecological Systems Map (ESM)
Biome maps of South America
Primary data used
No. of classes
No. of classes
No. of classes
No. of classes
Latin American Biogeography Scheme
Geography, and secondarily species composition and endemism
Species composition and endemism
Species endemism, but landform used as primary data in areas lacking widely used biogeographic maps e.g. South America
Land cover map
Remote sensing and elevation
Land cover classes
Ecological Systems Map
Climate, elevation, geology, land cover, and landform data
Species-by-species breakdown of the results shows that there is a consistent trend across species where similar percentage of specimens fall within and outside SDTF (Additional file 2, Tables S7 and S8). Consistent percentage of specimens fall in either neighbouring biomes or other dry biomes (Additional file 2, Tables S7 and S8). This indicates that the results of the map comparison are not due to single species dominating the dataset, but due to a consistent trend across species. Similarly, analysis of the smaller data set where only narrowly restricted species were included shows a consistent pattern with the wider analysis, where less than half of specimens fall in SDTF (Table 3). The secondary analysis shows slightly smaller fractions of specimens mapping under Cerrado and Chaco (Table 3).
Despite its high spatial resolution the LCM performed poorly in the map comparison recovering only 14.6% of specimens under SDTF (Table 3). The poor performance is an artefact of the LCM including anthropogenic habitats, however. Nearly half of the specimens fall into agricultural land (48.2%) under categories such as 'Mosaic agriculture and degraded forest' (Additional file 1, Table S1). Although strict comparison of the LCM and the other maps is difficult, the results indicate that large fraction of SDTF in South America is affected by human disturbance and is severely fragmented. If the agricultural areas are considered as SDTF, the LCM map becomes the best performing map with 62.7% of specimens mapping under the correct biome (Table 3). Specimens falling outside SDTF map under Cerrado and the Chaco biome, similar to the other maps (Table 3). This indicates that the poor performance of LCM is not because it misidentifies the biome, but due to the severe human-induced fragmentation of the SDTF biome.
Biome Distribution Modelling
Comparison of Current Biome Maps
Baseline data of the current biome maps of South America reviewed here varies considerably, including data on species composition, endemism, climate, elevation, and vegetation structure. Hence, biome delimitations are expected to vary between the maps. The expectation reflects the fact that biomes are complex empirical realities that are hard to organise into fixed categories, an issue discussed in depth in previous publications (e.g. [31, 47]). Despite this, there is a growing need to review how biomes are defined in biology . Macroecological and evolutionary research is developing into fields investigating ecological and evolutionary aspects of biomes, such as productivity gradients , extinction risk , and forest die-back due to climate change across biomes . Such studies should be based on biomes defined as biologically meaningful units, i.e. large evolutionary meta-communities that are not only ecological similar but share evolutionary lineages (species, genera, families, and orders). Ways of deriving such evolutionary biome delimitations using community phylogenetics have been explored in a recent study [Oliveira-Filho AT, Pennington RT, Rotella J, Lavin M: Exploring evolutionarily meaningful vegetation definitions in the tropics: a community phylogenetic approach, submitted].
With this in mind, we performed a detailed comparison of the five biome maps using the SDTF biome as an example. SDTF is a poorly known biome with a strongly fragmented distribution across South America, and hence, it works as a perfect case study for exploring issues in biome mapping. SDTF has been confused in the past with other South American dry biomes, the Chaco and savannas, especially the Brazilian Cerrado and hence we expected to see major differences between maps. We used georeferenced specimen data of SDTF habitat specialist species to ground-truth the biome maps and to test how the maps differed in depicting SDTF distribution.
The results showed poor performance of all maps in depicting known fragments of SDTF based on herbarium records of habitat specialist species. Less than half of specimens were mapped under the SDTF biome in all of the maps. Large proportions of specimens were mapped under other biomes, mainly the Chaco and Cerrado, or under neighbouring biomes in the Andes. Our first step was to fully explore the potential underlying causes of the poor performance. The mismatch between the species distribution data and the biome maps raised the question of whether georeferenced herbarium specimens can be validly used as surrogates for biome distribution. Here we consider two important questions in relation to herbarium data: (1) georeferencing errors, and (2) species' ecological lability and habitat preferences.
Georeferencing errors are common in databases such as GBIF, and rigorous cleaning is required before any analysis can be done (see Methods). Most of the modern herbarium specimens do not present an issue, as these have been georeferenced in the field with GPS and have relatively accurate coordinate data. Excluding obvious typing errors, these modern collections can be considered as high quality data. Specimens without coordinate data, however, are being georeferenced after the actual collection event based on the locality description on the specimen label. This is where errors can take place. Whether the georeferencing is done manually or with automated software, both methods come with errors. The beauty of herbarium data is, however, that each specimen has duplicates, commonly as many as five, which are deposited in other herbaria. As these specimens become georeferenced, they provide independent, repeated samples which can be used to detect errors. Hence we consider the role of georeferencing errors in relation to herbarium data in general as a manageable source of error that can be controlled with rigorous cleaning. In our dataset, duplicate georeferenced specimens allowed efficient cleaning of our datasets, with c. 390 records deleted as a result.
The role of potential "weedy" species (species with a broader ecological preference that spans the STDF limits) was investigated through re-analysing the maps using smaller data sets of specimen records from narrowly restricted endemics only. The narrowly restricted endemics occur in a single or a small set of SDTF nuclei only, rather than across the biome, and can hence be considered as strict habitat specialists. The results from the second analysis supported the wider analysis, indicating that the choice of species did not affect our results.
Excluding the possibility of large georeferencing errors and weedy species, our data from the ground-truthing analysis indicates two major issues with the current biome maps. First we consider the effect of poor spatial resolution. All maps, with the exception of the LCM, showed a breakdown of resolution at regional scale. Such poor spatial resolution strongly limits the use of such maps in GIS applications. This is particularly the case for areas with high elevational heterogeneity where the landscape is naturally fragmented. Our example of the Marañón Valley in Northern Peruvian Andes illustrates how the current maps oversimplify the complex landscape, mainly due to their poor resolution. The only map in our analysis which succeeded in depicting the heterogenous landscape showing smaller SDTF forest nuclei in the Andes was the LCM, a map based on remote-sensed data.
Secondly, we consider the role of poor delimitation of biomes in the current maps. Whilst the LCM does not seem to suffer from lack of spatial resolution at regional scale, it suffers greatly from poor delimitation of the SDTF biome. Small fragments of SDTF depicted in the Andes are labelled under categories such as 'Shrub savannah' and 'Montane forests'. This is not surprising considering how difficult it is to distinguish between dry vegetation types with remote sensed data alone . The poor delimitation of the SDTF and other dry biomes in the LCM suggests that there is a particular need to use ground-truthing or other validation methods in remote sensing, especially for dry biomes.
Refining SDTF Distribution
So what is the best current estimate of the SDTF distribution? We used the BDM approach to generate a refined distribution map of SDTF, where climatic and elevation data was used in conjunction with the herbarium specimen data to model the biome distribution. The modelled SDTF map strongly agrees with previously published maps [33, 45] but is higher in spatial detail. Whilst the current South American biome maps failed in accurately depicting small SDTF fragments such as the Andean forest nuclei, the modelled distribution gives a more realistic representation of the biome in South America. The model performance was good, close to excellent, which gives support to the idea that modelling ecologically similar species under a single model might be a justifiable approach. Previous study of the North American mouse species Peromyscus polionotus and its 15 subspecies concluded that modelling ecologically coherent units (i.e. subspecies in their case) resulted in better distribution models compared to models where ecologically divergent subspecies were combined into a single data set . Similar studies should be done to explore model performance when mapping multiple species using the BDM approach.
The availability of a more accurate distribution map for the South American SDTF will hopefully highlight the importance and diversity of the ecosystem, and is a prerequisite for conservation planning and management. For example, despite the small size of the Andean SDTF fragments, depicting their distribution is of great importance, as these forest nuclei host unique biota comparable to the diversity found in the Galápagos Islands (e.g. Marañón Valley, Northern Peru [46, 51]). Furthermore, our ground-truthing analysis of the LCM showed that large percentage of SDTF areas are highly degraded due to agriculture. Given that 54.2% of the remaining SDTF are estimated to be in South America based on the recent global overview of the SDTF conservation status , our results paint a dire picture of the status of these forgotten forests where approximately half of the forest area has been degraded by agriculture. The remaining areas are becoming smaller and smaller, and hence harder to detect and depict in large scale maps.
Use of Herbarium Specimen Data in Biome Mapping
What can we learn from this case study? Our analysis shows just how difficult it is to map highly discontinuous and fragmented vegetation like SDTF over large spatial scales. Fragmented biomes are underrepresented in biome maps in general, as smaller fragments are easily missed especially if spatial resolution is poor. Anthropogenic fragmentation poses additional challenges: vegetation cover is becoming increasingly fragmented due to human disturbance, and habitat degradation is leading to changes in vegetation structure even in biomes previously deemed structurally homogeneous. Both of these factors lead to difficulties of mapping biomes based on vegetation structure data alone (i.e. remote sensing).
This is where herbarium data from habitat specialist species could help, given that plants act as indicators of the environment as a whole. With increasing number of georeferenced specimens available through online databases (e.g. over 1.8 million specimens available for Brazil through CRIASpecies link alone), we argue that specimen data can generally contribute to the growing need of feasible validation tools for remote sensing maps (e.g. [53, 54]). Whilst ground-truthing over continental scales is not feasible, it can be done in silico by downloading and cleaning herbarium data in a relatively short time over large spatial scales. Lack of validation tools has been highlighted in recent reviews as a major area requiring further research [55, 56]. Herbarium specimens are currently used in modelling species distributions and in estimating species diversity [57, 58], but no studies to our knowledge have explored the use of georeferenced specimen data as a validation tool, despite the availability of millions of specimens available online.
In silico ground-truthing would be particularly useful for biome maps of highly environmentally heterogeneous areas such as the Andes. Current continental scale biome maps depict a depauperate picture of Andean diversity concatenating much of it into single meaningless units such as 'Montane vegetation of dry forest and open woodland'. Strongly seasonal biomes, such as SDTF, are another special case where herbarium data can provide help. Remote sensing images are often inadequate in distinguishing seasonal forests, as they can appear like humid forests during wet season, but as shrubland during the long dry season. Highly degraded biomes and habitats provide yet another case where herbarium data could be used to study habitat loss over time, as specimen data over time can provide an estimate of the original distribution of vegetation cover based on plants collected before land clearance. Lastly, human-induced disturbance and habitat degradation causes issues in remote sensing, and herbarium data could be used as an aid in distinguishing between degraded savanna and degraded dry forest which is currently not feasible with remote sensed data alone.
Another use of herbarium data in biome mapping is the BDM approach presented in this paper. The BDM approach has previously been used to map historical distribution of biomes using past climate conditions in combination with herbarium data [59–61], whilst our focus was to use the approach to model current biome distribution. The advantage of the BDM approach over other mapping methods is that it combines high spatial resolution environmental data with floristic data in the form of georeferenced herbarium specimens. The approach results in maps with extremely high spatial resolution (1 km × 1 km) and requires less ground-truthing as maps are modelled based on floristic data. In the case of SDTF, BDM approach produced a much improved biome map with a relatively small effort. The new map can be considered as a working hypothesis of the SDTF distribution in South America, and as more data is added to the model, the distribution of the biome can be easily refined.
Current biome maps of South America perform poorly in depicting known fragments of SDTF which are based on herbarium records of habitat specialist species. The poor performance of the maps can be attributed to two main factors: (1) the poor spatial resolution of the biome maps, and (2) their poor delimitation of SDTF. Georeferenced herbarium data could provide a validation tool for enhancing biome maps in general. Map schemes that rely fully on remote sensed data could gain from the use of herbarium specimens in particular, as ground-truthing across continents with plot data is currently not feasible. The lack of studies incorporating herbarium specimens has been likely due to inadequate specimen data across species distributions especially for tropical taxa , but the situations is rapidly improving as more information is collected and digitized, potentially leading to its use not only in validating biome maps, but also in constructing them . An alternative approach is presented where herbarium specimens are used in conjunction with environmental data to model current biome distributions. Incorporating herbarium data in biome mapping using either of the above approaches should be encouraged, especially so in projects focusing on poorly known, fragmented and/or structurally heterogeneous biomes. We highlight that special attention should be given to specimen identification. Specimen determinations by taxonomic experts should be used as a way to quality control data. Taxonomic sources should also be consulted in the choice of species used.
SDTF Habitat Specialist Species Occurrence Data Set
List of SDTF specialist species
Prado & Gibbs
Linares-Palomino et al.
No. of specimens included
Amburana cearensis (Fr.All.) A.C.Smith
Anadenanthera colubrina (Vell.) Brenan
Aspidosperma polyneuron Müll. Arg.
Aspidosperma pyrifolium Mart.
Balfourodendron riedelianum (Engl.) Engl.
Blanchetiodendron blanchetii (Benth.) Barneby & J.W. Grimes
Chloroleucon tenuiflorum (Benth.) Barneby & J.W. Grimes
Combretum leprosum Mart. Search in The Plant List
Cordia americana (L.) Gottschling & J.S. Mill.
Cordia incognita Gottschling & J.S. Mill.
Cyathostegia matthewsii (Benth.) Schery
Diatenopteryx sorbifolia Radlk.
Enterolobium contortisiliquum (Vell.) Morong
Geoffroea spinosa Jacq.
Machaerium aculeatum Raddi
Machaerium condensatum Kuhlm. & Hoehne
Machaerium ruddianum Mendonça Filho & A. M. G. Azevedo
Mimosa arenosa (Willd.) Poir.
Myracrodruon urundeuva Fr.All.
Nicotiana glutinosa L.
Parapiptadenia blanchetii (Benth.) Vaz & Lima
Parapiptadenia zehntneri (Harms) M. P. M. de Lima & H. C. de Lima
Peltogyne pauciflora Benth.
Peltophorum dubium (Spreng.) Taub.
Phytolacca dioica L.
Piptadenia viridiflora (Kunth) Benth.
Pityrocarpa moniliformis (Benth.) Luckow & R. W. Jobson
Pouteria gardneriana (A. DC.) Radlk. Search in The Plant List
Pterogyne nitens Tul.
Ruprechtia laxiflora Meissn.
Schinopsis brasiliensis Engl.
Sideroxylon obtusifolium (Roem. & Schult.) T.D.Penn.
Solanum amotapense Svenson
Solanum chmielewskii (C.M.Rick et al.) D.M.Spooner et al.
Solanum confertiseriatum Bitter
Solanum corumbense S.Moore
Solanum daphnophyllum Bitter
Solanum gnaphalocarpon Vell.
Solanum granuloso-leprosum Dunal
Solanum hibernum Bohs
Solanum huaylasense Peralta,
Solanum hutchisonii (J.F.Macbr.) Bohs
Solanum iltisii K.E.Roe
Solanum neorickii D.M.Spooner et al.
Solanum plowmanii S.Knapp
Solanum smithii S.Knapp
Solanum stuckertii Bitter
Ximenia americana L.
Zanthoxylum fagara (L.) Sarg.
The combined distribution map of the specimens, both the full and partial dataset (Figure 4) was drawn using ArcMap 10, and was observed to match the SDTF distribution depicted in previous publications [33, 45]. Although no obvious gaps in the distribution data can be identified, the dataset had only a few specimens from northern South America (Figure 4). Most data points were for Brazil, Argentina, Paraguay and Bolivia.
Comparison of Biome Maps
The maps included in the study are freely available online or can be requested from the corresponding authors. Attribute tables of biome maps were used to obtain data on their hierarchical divisions using ArcMap 10. The number of divisions was recorded for South America only, as this was the largest common denominator of all the maps. The Caribbean and the islands of the coast of South America were excluded. Urban and barren areas (e.g. water, ice, and snow) were omitted from each biome map prior to calculations.
Map comparison was performed in ArcMap 10 using the full and partial herbarium specimen data sets as a way of ground truthing. The ArcToolbox option of Spatial Join was used to join the distribution data with the biome map layer. Once the joined data file was created, the number of specimens falling into each biome category was calculated using the enquiry tool. Specimens that fell under categories 'Shrublands' and 'Deserts and xeric shrublands' were included under SDTF (Additional file 1). Because maps LAB and ESM did not distinguish SDTF under a single category, areas 'Caatinga', 'Arid Ecuador', 'Tumbes-Piura', and 'Monte' in LAB, and 'Caatinga', 'Dry Meso-America', and 'Caribbean' in ESM, were regarded as SDTF (Additional file 1).
Distribution Modelling of SDTF
A new map for the SDTF was constructed using an approach here referred to as the biome distribution modelling (BDM). BDM is based on species distribution modelling, where environmental variables are used in conjunction with species occurrence data to model species distributions. Instead of modelling a single species distribution, BDM uses a composite data set of habitat specialist species to model the distribution of the whole biome.
BDM was performed using the maximum entropy model as implemented in Maxent software [64, 65] as the model has been shown to perform well against other presence only models . The model uses the principle of maximum entropy density estimation to generate a probability distribution based on presence-only data [64, 65]. A single model was constructed for the South America SDTF using the complete herbarium specimen data set with 6,300 records. Input environmental variables included 19 bioclimatic variables and elevation data from WorldClim at 30 arc-second spatial resolution (c. 1 km2, http://www.worldclim.org/bioclim) . The layers were downloaded in tiles, including tiles 23-24, 33-34, and 43-44. The entire set of 19 climatic variables was used to avoid any a priori assumptions of correlations among the variables. Maxent 3.3.2 (http://www.cs.princeton.edu/) was run with default settings: convergence threshold 10-5, maximum number of iterations of 500, regularisation = 1. Distribution data set was partitioned so that 30% of the records were omitted from model building and used as a test dataset (1,025 specimens). Ten iterations of the model were run with random seed to derive mean and standard deviation (SD) of AUC model scores. The model output was evaluated using the area under curve (AUC) value of receiver operating characteristic (ROC) plot. AUC value of 1 indicates optimal performance, whilst AUC = 0.5 indicates performance equal to random. The importance of the input environmental variables in model building was measured using jackknife. Jackknife test compares gains between models run with and without each environmental variable and measures the relative importance of each variable to the final model build. The resulting distribution is given in logistical values, where 0 refers to low probability and values near 1 mean high probability of presence. Map was generated by visualising all areas with logistical value > 0.5. Omission levels at this level were 36% for training and 37% for testing data set. The map is available from the authors by request.
We thank Sandra Knapp who made invaluable comments on early versions of the manuscript, and Nadia Bystriakova for advice on Maxent. We thank NatureServe (collaboration with the Centro de Datos para la Conservación of the Universidad Nacional Agraria La Molina, the Instituto de Investigación de la Amazonía Peruana, Gonzalo Navarro, and Wanderley Ferreira) who provided the Ecological Systems Data. The study was funded by the National Science Foundation (NSF) grant "PBI Solanum - a world treatment" DEB-0316614 [TS], and the Universidad Nacional de Rosario and Consejo Nacional de Investigaciones Científicas y Técnicas, Argentina [DEP].
- Raven PH: Tropical floristics tomorrow. Taxon. 1988, 37: 549-560.View ArticleGoogle Scholar
- Prance GT: A comparison of the efficacy of higher taxa and species numbers in the assessment of biodiversity in the Neotropics. Phil Trans R Soc B London. 1994, 345: 89-99.View ArticleGoogle Scholar
- Thomas WW: Conservation and monographic research on the flora of Tropical America. Biodivers Conserv. 1999, 8: 1007-1015.View ArticleGoogle Scholar
- Myers N, Mittermeier RA, Mittermeier CG, da Fonseca GAB, Kent J: Biodiversity hotspots for conservation priorities. Nature. 2000, 403: 853-858.View ArticlePubMedGoogle Scholar
- White F: The vegetation of Africa: a descriptive memoir to accompany the Unesco/AETFAT/UNSO vegetation map of Africa. Paris, Unesco. 1983Google Scholar
- White F: The AETFAT chorological classification of Africa: history, methods and applications. Bull Jard Bot Nat Belg. 1993, 62: 225-281.View ArticleGoogle Scholar
- Kindt R, Osino D, Orwa C, Nzisa A, van Breugel P, Graudal L, Lilleso JPB, Kehlenbeck K, Dietz J, Nyabenge M, Jamnadass R, Neufeld H: Useful tree species for Africa: interactive vegetation maps and species composition tables based on the Vegetation Map of Africa. 2011, Nairobi, World Agroforestry CentreGoogle Scholar
- Cardillo M: Phylogenetic structure of mammal assemblages at large geographical scales: linking phylogenetic community ecology with macroecology. Philos T R Soc B. 2011, 366: 2545-2553.View ArticleGoogle Scholar
- Fernández MH, Vrba ES: Macroevolutionary processes and biomic specialization: testing the resource-use hypothesys. Evol Ecol. 2005, 19: 199-219.View ArticleGoogle Scholar
- Kelt DA, Meyer MD: Body size frequency distributions in African mammals are bimodal at all spatial scales. Global Ecol Biogeogr. 2009, 18: 19-29.View ArticleGoogle Scholar
- ter Steege H, Pitman NCA, Phillips OL, Chave J, Sabatier D, Duque A, Molino JF, Prevost MF, Spichiger R, Castellanos H, von Hildebrand P, Vasquez R: Continental-scale patterns of canopy tree composition and function across Amazonia. Nature. 2006, 443: 444-447.View ArticlePubMedGoogle Scholar
- Engelbrecht BMJ, Comita LS, Condit R, Kursar TA, Tyree MT, Turner BL, Hubbell SP: Drought sensitivity shapes species distribution patterns in tropical forests. Nature. 2007, 447: 80-82.View ArticlePubMedGoogle Scholar
- Carnaval AC, Hickerson MJ, Haddad CFB, Rodrigues MT, Moritz C: Stability predicts genetic diversity in the Brazilian Atlantic Forest hotspot. Science. 2009, 323: 785-789.View ArticlePubMedGoogle Scholar
- Hoorn C, Wesselingh FP, ter Steege H, Bermudez MA, Mora A, Sevink J, Sanmartin I, Sanchez-Meseguer A, Anderson CL, Figueiredo JP, Jaramillo C, Riff D, Negri FR, Hooghiemstra H, Lundberg J, Stadler T, Sarkinen T, Antonelli A: Amazonia through time: Andean uplift, climate change, landscape evolution, and biodiversity. Science. 2010, 330: 927-931.View ArticlePubMedGoogle Scholar
- Jaramillo C, Ochoa D, Contreras L, Pagani M, Carvajal-Ortiz H, Pratt LM, Krishnan S, Cardona A, Romero M, Quiroz L, Rodriguez G, Rueda MJ, de la Parra F, Moron S, Green W, Bayona G, Montes C, Quintero O, Ramirez R, Mora G, Schouten S, Bermudez H, Navarrete R, Parra F, Alvaran M, Osorno J, Crowley JL, Valencia V, Vervoort J: Effects of rapid global warming at the Paleocene-Eocene boundary on Neotropical vegetation. Science. 2010, 330: 957-961.View ArticlePubMedGoogle Scholar
- Olson DM, Dinerstein E, Wikramanayake ED, Burgess ND, Powell GVN, Underwood EC, D'Amico JA, Itoya I, Strand HE, Morrison JC, Loucks CJ, Allnutt TF, Ricketts TH, Kura Y, Lamoreux JF, Wettengel WW, Hedao P, Kassem KR: Terrestrial ecoregions of the world: a new map of life on earth. BioScience. 2001, 51: 933-938.View ArticleGoogle Scholar
- Eva HD, Belward AS, de Miranda EE, di Bella CM, Gond V, Huber O, Jones S, Sgrenzaroli M, Fritz S: A land cover map of South America. Glob Change Biol. 2004, 10: 731-744.View ArticleGoogle Scholar
- Cavender-Bares J, Wilczek A: Integrating micro- and macroevolutionary processes in community ecology. Ecology. 2003, 84: 592-597.View ArticleGoogle Scholar
- McInnes L, Baker WJ, Barraclough TG, Dasmahapatra KK, Goswami A, Harmon LJ, Morlon H, Purvis A, Rosindell J, Thomas GH, Turvey ST, Phillimore AB: Integrating ecology into macroevolutionary research. Biol Lett. 2011Google Scholar
- Crisp MD, Arroyo MTK, Cook LG, Gandolfo MA, Jordan GJ, McGlone MS, Weston PH, Westoby M, Wilf P, Linder PH: Phylogenetic biome conservatism on a global scale. Nature. 2009, 458: 754-756.View ArticlePubMedGoogle Scholar
- He K, Zhang JT: Testing the correlation between beta diversity and differences in productivity among global ecoregions, biomes, and biogeographical realms. Ecol Inform. 2009, 4: 93-98.View ArticleGoogle Scholar
- Fritz SA, Bininda-Emonds ORP, Purvis A: Geographical variation in predictors of mammalian extinction risk: big is bad, but only in the tropics. Ecol Lett. 2009, 12: 538-549.View ArticlePubMedGoogle Scholar
- Bofarull AM, Royo AA, Fernandez MH, Ortiz-Jaureguizar E, Morales J: Influence of continental history on the ecological specialization and macroevolutionary processes in the mammalian assemblage of South America: Differences between small and large mammals. BMC Evol Biol. 2008, 8:Google Scholar
- Malhi Y, Aragao LEOC, Galbraith D, Huntingford C, Fisher R, Zelazowski P, Sitch S, McSweeney C, Meir P: Exploring the likelihood and mechanism of a climate-change-induced dieback of the Amazon rainforest. P Natl Acad Sci. 2009, 106: 20610-20615.View ArticleGoogle Scholar
- von Humboldt, Bonpland A: Essai sur la geographie des plantes. 1805, Paris, Levrault, Schoell & CieGoogle Scholar
- Wiens JJ, Graham CH: Niche conservatism: Integrating evolution, ecology, and conservation biology. Annu Rev Ecol Evol Syst. 2005, 36: 519-539.View ArticleGoogle Scholar
- Donoghue MJ: A phylogenetic perspective on the distribution of plant diversity. P Natl Acad Sci. 2008, 105: 11549-11555.View ArticleGoogle Scholar
- Pennington RT, Lavin M, Sarkinen T, Lewis GP, Klitgaard BB, Hughes CE: Contrasting plant diversification histories within the Andean biodiversity hotspot. P Natl Acad Sci. 2010, 107: 13783-13787.View ArticleGoogle Scholar
- Daly DC, Mitchell JD: Lowland vegetation of tropical South America - an overview. Imperfect balance: Landscape transformations in the pre-Colombian Americas. Edited by: Lentz D. 2000, New York, Columbia University Press, 391-454.Google Scholar
- Morrone JJ: Biogeografia de America Latina y el Caribe. 2001, Zaragoza, Manuales & Tesis SEAGoogle Scholar
- Josse C, Navarro G, Comer P, Evans R, Faber-Langendoen D, Fellows M, Kittel S, Menard S, Pyne M, Reid M, Schulz K, Snow K, Teague J: Ecological systems of Latin America and the Caribbean: A working classification of terrestrial systems. 2003, Arlington VA, Nature ServeGoogle Scholar
- Prado DE, Gibbs PE: Patterns of species distributions in the Dry Seasonal Forests of South America. Ann Mo Bot Gard. 1993, 80: 902-927.View ArticleGoogle Scholar
- Prado DE: Seasonally dry forests of tropical South America: from forgotten ecosystems to a new phytogeographic unit. Ed J Bot. 2000, 57: 437-461.View ArticleGoogle Scholar
- Pennington RT, Ratter JA, Lewis GP: An overview of the plant diversity, biogeography and conservation of neotropical savannas and seasonally dry forests. Neotropical savannas and seasonally dry forests: plant biodiversity, biogeography and conservation. Edited by: Pennington RT, Ratter JA, Lewis GP. 2006, Florida, CRC Press, 1-29.View ArticleGoogle Scholar
- Dirzo R, Mooney H, Ceballos G, Young H: Seasonally Dry Tropical Forests: Ecology and Conservation. 2011, Island PressView ArticleGoogle Scholar
- Werneck FP, Costa GC, Colli GR, Prado DE, Sites JW: Revisiting the historical distribution of seasonally dry tropical forests: new insights based on palaeodistribution modelling and palynological evidence. Global Ecol Biogeogr. 2011, 20: 272-288.View ArticleGoogle Scholar
- Murphy P, Lugo AE: Ecology of tropical dry forests. Annu Rev Ecol Syst. 1986, 17: 67-88.View ArticleGoogle Scholar
- Gentry AH: Diversity and floristic composition of neotropical dry forests. Seasonally dry tropical forests. Edited by: Bullock SH, Mooney HA, Medina E. 1995, Cambridge, Cambridge University Press, 146-194.View ArticleGoogle Scholar
- Werneck FP: The diversification of eastern South American open vegetation biomes: historical biogeography and perspectives. Quat Sci Rev. 2011, 30: 1630-1648.View ArticleGoogle Scholar
- Furley PA, Ratter JA: Soil resources and plant communities of the central Brazilian cerrado and their development. J Biogeogr. 1988, 15: 97-108.View ArticleGoogle Scholar
- Prado DE: What is the Gran Chaco vegetation in South America?. I. A review. Contribution to the study of flora and vegetation of the Chaco. V. Candollea. 1993, 48: 145-172.Google Scholar
- Prado DE: What is the Gran Chaco vegetation in South America?. II. A redefinition. Contribution to the study of flora and vegetation of the Chaco. VII. Candollea. 1993, 48: 615-629.Google Scholar
- Portillo-Quintero CA, Sanchez-Azofeifa GA: Extent and conservation of tropical dry forests in the Americas. Biol Conserv. 2011, 143: 144-155.View ArticleGoogle Scholar
- Linares-Palomino R, Oliveira-Filho AT, Pennington RT: Neotropical seasonally dry forests: Diversity, endemism, and biogeography of woody plants. Seasonally Dry Tropical Forests: Ecology and Conservation. Edited by: Dirzo R, Mooney H, Ceballos G, Young H. 2011, Island Press, 3-21.View ArticleGoogle Scholar
- Pennington RT, Prado DE, Pendry CA: Neotropical seasonally dry forests and Quaternary vegetation changes. J Biogeogr. 2000, 27: 261-273.View ArticleGoogle Scholar
- Linares-Palomino R: Phytogeography and floristics of seasonally dry tropical forests in Peru. Neotropical savannas and seasonally dry forests: plant biodiversity, biogeography and conservation. Edited by: Pennington RT, Ratter JA, Lewis GP. 2006, Florida, CRC Press, 257-279.View ArticleGoogle Scholar
- Oliveira-Filho AT: Classificação das fitofisionomias da América do Sul cisandina tropical e subtropical: proposta de um novo sistema - prático e flexível - u uma injeção a mais de caos?. Rodriguésia. 2009, 60: 237-258.Google Scholar
- Kreft H, Jetz W: A framework for delineating biogeographical regions based on species distributions. J Biogeogr. 2010, 37: 2029-2053.View ArticleGoogle Scholar
- Kalacska M, Sanchez-Azofeifa GA, Rivard B, Calvo-Alvarado JC, Quesada M: Baseline assessment for environmental services payments from satellite imagery: A case study from Costa Rica and Mexico. J Environ Manag. 2008, 88: 348-359.View ArticleGoogle Scholar
- Gonzalez SC, Soto-Centeno JA, Reed DL: Population distribution models: species distributions are better modelled using biologically relevant data partitions. BMC Ecol. 2011, 11: 20-PubMed CentralView ArticlePubMedGoogle Scholar
- Särkinen TE, Marcelo-Peña JL, Yomona AD, Simon MF, Pennington RT, Hughes CE: Underestimated endemic species diversity in the dry inter-Andean valley of the Río Marañón, northern Peru: An example from Mimosa (Leguminosae, Mimosoideae). Taxon. 2011, 60: 139-150.Google Scholar
- Miles L, Newton AC, DeFries RS, Ravilious C, May I, Blyth S, Kapos V, Gordon JE: A global overview of the conservation status of tropical dry forests. J Biogeogr. 2006, 33: 491-505.View ArticleGoogle Scholar
- Foody GM: Status of land cover classification accuracy assessment. Remote Sens Environ. 2002, 80: 185-201.View ArticleGoogle Scholar
- Turner W, Spector S, Gardiner N, Fladeland M, Sterling E, Steininger M: Remote sensing for biodiversity science and conservation. Trends Ecol Evol. 2003, 18: 306-314.View ArticleGoogle Scholar
- Gottschalk TK, Huettmann R, Ehlers M: Thirty years of analysing and modelling avian habitat relationships using satellite imagery data: a review. Int J Remote Sensing. 2005, 26: 2631-2656.View ArticleGoogle Scholar
- Gillespie TW, Foody GM, Rocchini D, Giorgi AP, Saatchi S: Measuring and modelling biodiversity from space. Prog Phys Geog. 2008, 32: 203-221.View ArticleGoogle Scholar
- Raxworthy CJ, Martinez-Meyer E, Horning N, Nussbaum RA, Schneider GE, Ortega-Huerta MA, Peterson AT: Predicting distributions of known and unknown reptile species in Madagascar. Nature. 2003, 426: 837-841.View ArticlePubMedGoogle Scholar
- Saatchi S, Buermann W, ter Steege H, Mori S, Smith TB: Modeling distribution of Amazonian tree species and diversity using remote sensing measurements. Rem Sens Environ. 2008, 112: 2000-2017.View ArticleGoogle Scholar
- Carnaval AC, Moritz C: Historical climate modelling predicts patterns of current biodiversity in the Brazilian Atlantic forest. J Biogeogr. 2008, 35: 1187-1201.View ArticleGoogle Scholar
- Graham CH, Moritz C, Williams SE: Habitat history improves prediction of biodiversity in rainforest fauna. P Natl Acad Sci. 2006, 103: 632-636.View ArticleGoogle Scholar
- Werneck FP, Costa GC, Colli GR, Prado D, Sites JW: Revisiting the historical distribution of Seasonally Dry Tropical Forests: new insights based on palaeodistribution modelling and palynological evidence. Global Ecol Biogeogr. 2011, 20: 272-288.View ArticleGoogle Scholar
- Tobler MW, Honorio E, Janoyec J, Reynel C: Implications of collection patterns of botanical specimens on their usefulness for conservation planning: an example of two neotropical plant families (Moraceae and Myristicaceae) in Peru. Biod & Conserv. 2007, 16: 659-677.View ArticleGoogle Scholar
- Oliveira-Filho AT: TreeAtlan 2.0, Flora arbórea da América do Sul cisandina tropical e subtropical: Um banco de dados envolvendo biogeografia, diversidade e conservação. Universidade Federal de Minas Gerais. 2010, [http://www.icb.ufmg.br/treeatlan/]Google Scholar
- Phillips SJ, Dudík M, Schapire RE: A maximum entropy approach to species distribution modeling. Proc Twenty-First Int Conf Machine Learning. 2004, 655-662.Google Scholar
- Phillips SJ, Anderson RP, Schapire RE: Maximum entropy modeling of species geographic distributions. Ecol Modelling. 2006, 190: 231-259.View ArticleGoogle Scholar
- Elith J, Graham CH, Anderson RP, Dudik M, Ferrier S, Guisan A, Hijmans RJ, Huettmann F, Leathwick JR, Lehmann A, Li J, Lohmann LG, Loiselle BA, Manion G, Moritz C, Nakamura M, Nakazawa Y, Overton JMcC, Townsend Peterson A, Phillips SJ, Richardson K, Scachetti-Pereira R, Schapire RE, Soberon J, Williams S, Wisz MS, Zimmermann NE: Novel methods improve predition of species' distributions from occurrence data. Ecography. 2006, 29: 129-151.View ArticleGoogle Scholar
- Hijmans RJ, Cameron SE, Parra JL, Jones PG, Jarvis A: Very high resolution interpolated climate surfaces for global land areas. Int J Climatology. 2005, 25: 1965-1978.View ArticleGoogle Scholar
- Ratter JA, Ribeiro JF, Bridgewater S: The Brazilian Cerrado vegetation and threats to its biodiversity. Ann Bot. 1997, 80: 223-230.View ArticleGoogle Scholar
- Ratter JA, Bridgewater S, Ribeiro JF: Analysis of the floristic composition of the Brazilian Cerrado vegetation III: comparison of the woody vegetation of 376 areas. Ed J Bot. 2003, 60: 57-109.Google Scholar