Insights into biodiversity sampling strategies for freshwater microinvertebrate faunas through bioblitz campaigns and DNA barcoding
- Brandon J Laforest†1Email author,
- Amanda K Winegardner†2,
- Omar A Zaheer3, 4,
- Nicholas W Jeffery3, 4,
- Elizabeth E Boyle3, 4 and
- Sarah J Adamowicz3, 4
© Laforest et al.; licensee BioMed Central Ltd. 2013
Received: 13 December 2012
Accepted: 14 March 2013
Published: 4 April 2013
Biodiversity surveys have long depended on traditional methods of taxonomy to inform sampling protocols and to determine when a representative sample of a given species pool of interest has been obtained. Questions remain as to how to design appropriate sampling efforts to accurately estimate total biodiversity. Here we consider the biodiversity of freshwater ostracods (crustacean class Ostracoda) from the region of Churchill, Manitoba, Canada. Through an analysis of observed species richness and complementarity, accumulation curves, and richness estimators, we conduct an a posteriori analysis of five bioblitz-style collection strategies that differed in terms of total duration, number of sites, protocol flexibility to heterogeneous habitats, sorting of specimens for analysis, and primary purpose of collection. We used DNA barcoding to group specimens into molecular operational taxonomic units for comparison.
Forty-eight provisional species were identified through genetic divergences, up from the 30 species previously known and documented in literature from the Churchill region. We found differential sampling efficiency among the five strategies, with liberal sorting of specimens for molecular analysis, protocol flexibility (and particularly a focus on covering diverse microhabitats), and a taxon-specific focus to collection having strong influences on garnering more accurate species richness estimates.
Our findings have implications for the successful design of future biodiversity surveys and citizen-science collection projects, which are becoming increasingly popular and have been shown to produce reliable results for a variety of taxa despite relying on largely untrained collectors. We propose that efficiency of biodiversity surveys can be increased by non-experts deliberately selecting diverse microhabitats; by conducting two rounds of molecular analysis, with the numbers of samples processed during round two informed by the singleton prevalence during round one; and by having sub-teams (even if all non-experts) focus on select taxa. Our study also provides new insights into subarctic diversity of freshwater Ostracoda and contributes to the broader “Barcoding Biotas” campaign at Churchill. Finally, we comment on the associated implications and future research directions for community ecology analyses and biodiversity surveys through DNA barcoding, which we show here to be an efficient technique enabling rapid biodiversity quantification in understudied taxa.
KeywordsOstracoda Crustacea Barcoding biotas Sampling strategy Bioblitz Citizen science Species richness Zooplankton Accumulation curves Subarctic
One of the biggest impediments to conducting large-scale biodiversity surveys lies in the taxonomic identification of target organisms. This is especially true when dealing with microinvertebrates, where defining morphological features are often discernible only through intensive methods such as slide preparation and microscopy. One group of organisms that exemplifies this dilemma is the small-bodied crustacean class Ostracoda. Ostracods are very common in benthic freshwater communities, but also occur in marine, intertidal, or semi-terrestrial environments. They are useful model organisms for studies on various aspects of ecology and evolution [1–5], given the high prevalence of their calcified bivalve shells in the freshwater fossil record as well as their variability in breeding systems [6, 7]. In freshwater systems alone, the class Ostracoda has been conservatively estimated to number close to 2,000 described species , with 420 freshwater species recorded for North America [9, 10]. Taxonomic keys are available to the species level for North America and Europe [10–13], and many surveys describe the regional diversity of the class (e.g. [14–20]). The projected global diversity in all habitat types is estimated to be approximately 13,000 .
An infrequently discussed challenge in conducting biodiversity surveys is how to design and implement a suitable sampling strategy. While many studies have compared the efficacy of various field collection methods for capturing accurate estimates of planktonic invertebrate community structure [21–29], there has been little discussion of the idea of sampling strategy as a whole in terms of study objectives, sampling instrumentation, time commitments, adaptation of field methods in response to environmental heterogeneity, and sorting of samples prior to identification both in the field and in the laboratory. Given that the sample size of microinvertebrate community analyses is always much greater than the resources available to identify each individual organism to the appropriate taxonomic level, this sorting of organisms representing the sample community is of utmost importance. Previous studies have demonstrated the presence of cryptic species in microinvertebrates [30–32], and highlight the potential to overlook species with cryptic morphology as well as those with low abundance .
Establishing timeframes for microinvertebrate surveys can be linked to many different factors such as limited funding associated with fieldwork, appropriate weather windows for collection, and the availability of trained personnel. These limitations are especially applicable to studies conducted in remote locations, as well as areas of intense seasonality. Conducting fieldwork in these regions should be made as efficient as possible not only to limit associated costs, but to limit human interference on the natural system. Furthermore, while there is discussion in the literature on appropriate standards for comparing sampling strategies for freshwater bodies of various size and habitat diversity [23, 29], there is less discussion on the rationale behind intensive sampling. This is a key point as more scientists participate in public research, and more research projects involve an aspect of citizen science.
Citizen science involves collaboration between scientists and volunteers to gather field and observational data , and several studies have found that these types of collaborations produce reliable data that would be difficult to gather by any individual research group or scientist [35, 36]. For biodiversity studies, citizen science projects often encompass large-scale “bioblitzes” that involve collecting a large number of organisms in a short time period, often as short as a few hours (e.g. http://www.get-to-know.org/bioblitz/). Originally coined in 1996 by Susan Rudy of the U.S. National Park Service, the term bioblitz is now widely employed, with citizen-science bioblitzes recorded in countries such as Canada, New Zealand, Portugal, and Taiwan. The results of bioblitzes are typically not published in the scientific literature, despite their widespread occurrence and potential for inclusion . This may be changing, as demonstrated by the August 2012 issue of Frontiers in Ecology and the Environment, a special issue dedicated to the publishing of citizen-science research. For these sampling campaigns to remain an effective and efficient use of citizen-scientist collaborations, sampling strategies and specific objectives that may be served by these efforts should be discussed and evaluated. Here, we quantify and compare the outcomes of different student bioblitzes within the Churchill barcoding biotas campaign to measure collection effort in relation to biodiversity yield.
For animals, DNA barcoding using a region of the mitochondrial gene cytochrome c oxidase subunit I (COI) is an increasingly common method both for identifying species and for quantifying provisional species diversity [38–41], and can be used to evaluate and compare sampling strategies for biodiversity surveys. Through separating a sample of organisms into molecular operational taxonomic units (MOTUs), it is possible to calculate provisional species richness without the need for morphology-based identifications. DNA barcoding has been previously employed to build accumulation curves for understudied taxa such as parasitoid wasps (Ichneumonidae, Braconidae, Cynipidae and Diapriidae), with barcode-based accumulation curves indicating higher diversity, but the same shape, than accumulation curves built using morphospecies . While Linnean identifications are useful for community ecology studies, due to the possibility of linking with environmental data, the rapid quantification of biodiversity lends itself nicely to answering questions of species richness, species assemblage patterns, and sampling strategy comparison. As the reference library for the Barcode of Life Data Systems (BOLD)  grows, more of these unknown MOTUs will be linked to known species and allow for more sophisticated community ecology questions to be asked.
We employed DNA barcoding to compare five sampling strategies of subarctic freshwater ostracods in Churchill, Manitoba, Canada from 2007–2011, using MOTUs as surrogates for species. The present study does not test the effectiveness of DNA barcoding in recovering species boundaries for this group; rather, we use DNA barcoding as a tool to address our main study objective. We assume here that genetic patterns in the freshwater Ostracoda of Churchill mirror those of other microcrustaceans. For example, studies of the Branchiopoda of Churchill , freshwater microcrustaceans of Mexico and Guatemala , and marine zooplanktonic ostracods  have shown strong separation of described species based upon DNA barcodes.
This study presents an a posteriori analysis evaluating the success of five sampling strategies in both capturing and estimating the regional diversity of freshwater ostracods in the Churchill region, as this site was selected for an intensive “barcoding biotas” regional biodiversity survey employing DNA barcoding methods (introduced in ). Methodological differences among the sampling strategies prevent analysis into the effect of individual variables on strategy success, but still allow for broad-scale exploration of factors influencing the success of collection events at the scale of bioblitzes. The strategies differed in their primary objectives, duration of time spent sampling, number of sites sampled, and method of sorting of samples prior to analysis and deciding which samples to submit for DNA barcoding analysis. It was predicted that our comparison of these strategies would reveal differences in their effectiveness, yielding useful information for the design of future microinvertebrate surveys, with an emphasis on student or citizen bioblitzes. Previous studies [21–29] have compared sampling methods (e.g. tow nets, D-nets, hand nets) but did not use the same methods to measure or compare sampling effectiveness. By contrast, our study provides evidence that the rationale behind a sampling strategy is as important as the equipment used during bioblitzes (especially those with non-expert volunteers). We suggest that a focus on sampling diverse microhabitats is effective and that having two rounds of specimen selection for DNA barcoding will increase efficiency of molecular resource use for quantifying species diversity.
Description of study site
Summary of the methods used and numbers of genetic clusters found in each ostracod study
# of sites
Primary target of collection
Number of clusters separated by 2% divergence
Number of specimens barcoded
Number of clusters captured that only appeared in 1 or 2 sampling strategies
Time- based effort
July 7–21, 2007
July 9–17, 2008
July 7–21, 2007 & July 9–17, 2008
June 3-Aug 25, 2010
64, 100, 250
Insecta (with mixed invertebrates retained)
July 22-Aug 2, 2011
Field sampling strategies and morphological sorting
Time-based sampling (2007)
From July 7–21, 2007, we sampled nine sites including four coastal rock pool sites, three tundra ponds, and two flowing freshwater habitats, with a complete focus on ostracod biodiversity. Each site was sampled for up to one hour, or when the specimens collected were thought to accurately represent the ostracod biodiversity of the location based on broad-scale morphological classification, whichever came first. Samples were gathered using a standard home aquarium fish net with a mesh size of 100 μm and a 100 μm D-net employed using a poke-and-collect method. All specimens were live-sorted into gross morphospecies using a standard dissecting microscope (20x magnification) in a field laboratory setting based on colour, size, shape, collection date, and collection site. A total of 21 morphospecies was identified in the field, but this was considered to be a liberal number as morphospecies were numbered for each site and not harmonized regionally across sites. A minimum of one individual per morphospecies per site was included in the genetic analysis, but most cases included at least two individuals of each morphospecies from each site, for a total of 94 specimens. This method was based on the time targets associated with collection, the use of a single methodology during collection, and a reliance on field morphospecies when sorting specimens for analysis.
Rapid-blitz sampling (2008)
Over six collecting dates from July 9–17, 2008, we sampled 16 sites including eight tundra or fen ponds, one coastal rock bluff pool, three lakes or reservoirs, and four flowing habitats. A rapid-blitz sampling strategy was employed, with time varying across sites according to habitat size and complexity. At each site, a deliberate, active search was performed to sample the variety of microhabitats present, including both planktonic zones and littoral zones to a depth of approximately 0.5 m. While this strategy did employ different methods across habitats, the variability was unsystematic in that there was no set objective for the study prior to sampling, other than contributing to the general species survey at Churchill, with a focus on microcrustaceans. Therefore, no sampling protocol was developed prior to the collection. An aquarium net of 100 μm mesh size was employed, with the net run through the water column rapidly, over and through vegetation, and over the surface of the substrate. Sampling time was at least 10 minutes per site, with the least time spent at rock bluff pools and more time spent at larger habitats and those with abundant vegetation. Samples were brought back to the field laboratory alive, in water from their own habitat, to be sorted on a light box. From each site, specimens were identified by eye to gross morphospecies (including sorting by size, colour, and shape). The intent was to fill a single plate (95 specimens) for DNA barcoding, with the preliminary results used to inform a second round of sorting of the same samples. Typically, at least two individuals per gross morphospecies per site were selected when available, and sometimes up to five in cases of substantial variability within provisional morphospecies. Selected samples were placed individually into small tubes. This method employed a single method of collection and relied on field morphospecies when sorting specimens for analysis.
Liberal re-sort of previous sampling methods (2007/2008)
To test the efficiency of live sorting ostracods into morphospecies in a field laboratory setting immediately following collection, we re-sampled extra specimens from the above two collection efforts, spanning 14 collection sites. We originally did not select these specimens for genetic analysis as it was thought that they represented replicates. We sampled 190 additional individuals from the unused sample pools originally sorted in the field following their collection event (with 93 from the 2007 and 97 from the 2008 samples). Multiple replicates of morphospecies were selected (sample size limited by availability of archived samples), and broader-scale methods of classification of morphospecies were employed as much of the colour had faded from the specimens due to storage. We also avoided one conspicuous species of ostracod (large-bodied, blue morphospecies) which had been overrepresented in previous analyses. This method built upon preliminary evidence of undersampling and liberally included specimens for analysis from the other projects, representing a deviation from previously established methodologies which had placed emphasis on field morphospecies to inform sorting procedure.
Fixed-protocol field method (2010)
From early June to late August, 2010, we sampled 75 sites across five freshwater habitat types (30 tundra ponds, 30 coastal rock pools, five shallow lakes, seven creeks, and three points along the Churchill River). Each site was sampled three times throughout the spring/summer season, one sampling event for each site in June, July, and August. The sites were sampled in approximately the same order each month. The sites were sampled for the entire aquatic insect and zooplankton community; thus, ostracods were not the sole or main focus of collection. Of these 75 sites, we found ostracods in only 27, and therefore only these sites were included for further analysis in this study. For each site, we selected a sampling location of 20 metres in length parallel to the water edge/shore and sampled 1.5 metres and 5 metres from the edge along this 20 metre transect. If the habitat was too small to mark out a 20 metre transect, we sampled along a transect that covered the longest length of the habitat. Two collection methods were employed as a sampling strategy for these sites. The first collection method involved moving along the 20 metre transect with a dip net (either 100 μm or 250 μm), disturbing the substrate to acquire benthic organisms, in addition to running the dip net throughout the water column to collect any pelagic species. This walking of the transect was done twice at each site. The second collection method involved tossing a plankton tow net (64 μm) up to 10 m from shore twice from a different point each time along the transect and pulling it back towards the collector. Both of these methods were employed to the best of our abilities at all sites, regardless of habitat type. We immediately separated the ostracods into morphospecies based on site, colour, size, and shape using a light box and a standard dissecting microscope in a field laboratory setting, and preserved them in ethanol. Upon return from the field station, samples were stored at −20°C. Two individuals per morphospecies per sampling event were included in the genetic analyses, for a total of 90 specimens. This field method used a pre-defined protocol at each site without consideration of site-specific appropriateness in an attempt to increase consistency to allow for direct comparisons between sites and over time.
Flexible-protocol field method (2011)
Specimens were collected from the planktonic and benthic zones of 20 small freshwater or brackish pools located on rock bluffs along the Hudson Bay coast, and 22 tundra ponds from July 22- August 2, 2011. Sites were sampled using separate planktonic and benthic protocols, with the major focus being on microcrustaceans although other taxa were retained as well. Planktonic samples were collected using a plankton net (153 μm), with a total of three tosses encompassing as many different parts of the water body as possible (varying microhabitats, direction, position along shore). While the 153 μm plankton net was a larger mesh size than the nets used for the other strategies, similar or larger mesh sizes have proven effective in other ostracod surveys [17, 18], and this is close to the 100–150 μm range recommended for surveying Ostracoda . For each toss, the net was allowed to sink as close to the bottom as possible (without touching) and then pulled to shore at a slightly upward angle. If the water body was too shallow for a plankton net toss; then collection was conducted with a 100 μm hand net using figure-eight patterns over the accessible parts of the water body. Benthic samples were collected with a 250 μm dip net for water bodies with rocky substrates and with a 153 μm weighted plankton net for softer substrates. For dip net collection, the substrate was kicked up and the net was moved in figure-eight patterns over a 2–3 m transect perpendicular to shore. For plankton net collection, the net sank to the bottom and was pulled to shore along a 2–3 m transect. Clean planktonic samples were filtered using a 100 μm hand net. Plankton samples containing debris and all benthic samples were refrigerated in their own water. Morphospecies were sorted as per the above protocols. At least 10 individuals per gross morphospecies were selected for preservation. Preserved specimens were stored at room temperature until August 3rd 2010, at which point they were moved to −20°C freezer storage. Under a dissecting microscope (with 10-80X magnification range), two to three individuals per morphospecies per site per zone (planktonic/benthic) were selected for barcoding, for a total of 283 specimens. This field method considered which methods were appropriate for a given habitat, in that two methods were not employed for small and shallow habitats, and the project was focused on microcrustaceans.
Specimen locality data, digital photos, and sequences were uploaded to the Barcode of Life Data Systems (BOLD) database . All data are available as one dataset, entitled DS-Freshwater Ostracoda of Churchill [OSTCHU], accessible via the following permanent DOI (Will be assigned upon acceptance of the manuscript). The five different codes for the “Process IDs” (specimen identifiers assigned by BOLD to refer to the sequences for all specimens) reflect a total of 5 former projects on BOLD, each linked to one of the five collecting/sorting strategies (Table 1). DNA barcoding and sequence alignment was conducted according to standard methods , with the specifics of the protocols further described in Additional file 1: Appendix A.
For inclusion in analysis, sequences must have had >200 bp and <2% Ns. This is below the cut-off for the barcode standard for building a reference database of DNA barcodes (>500 bp and <1% Ns). However, shorter sequences can still be reliably matched to conspecifics . All analyses were based on a provisional genetic definition of species, MOTUs. We named our MOTUs using sequential numbers added onto the institutional code for the Biodiversity Institute of Ontario, University of Guelph (e.g. Podocopid BIOUG001, etc.). MOTUs were assigned using Barcode Index Numbers (BINs) from BOLD3 , accessed March 16, 2012. BINs are genetic groupings assigned by BOLD for sequences >500 bp based on a 2% initial sequence divergence that is combined with an algorithm permitting deviations from this threshold in cases of genetic distance continuity or discontinuity . We assigned our shorter sequences to these MOTUs if they clustered within a particular BIN on a neighbour-joining tree. The maximum pairwise sequence divergence within each MOTU, as well as the distance to the nearest neighbouring sequence belonging to a different MOTU, was calculated for all sequences of at least 500 bp using the Barcode Gap Analysis function in BOLD3 using the Kimura-2-parameter (K2P) distance model . K2P distances have been more prevalent in the barcoding literature and were selected to facilitate comparison across studies. While two recent papers have argued for the use of p-distances instead, results using p-distances vs. K2P are very similar [50, 51].
A neighbour-joining (NJ) tree was constructed in MEGA v. 5.0 using the K2P model for nucleotide substitutions and pairwise deletions of missing sites. One thousand bootstrap replicates were performed to assess nodal support of the clusters/MOTUs, without the assumption that this phenogram will accurately represent deeper phylogenetic relationships. Of 498 sequences recovered, 496 were used in the construction of this tree; 2 sequences were removed as they contained no overlapping sites with a number of other sequences and therefore would not allow distances to be calculated using MEGA. Clusters at the tips of the tree were collapsed for tree visualization.
We performed basic summary statistics to elucidate the sequencing success rate for the specimens submitted for genetic analyses, as well as determined the number of genetic clusters recovered by each sampling strategy. To accommodate differing sample sizes, we compared provisional species richness among strategies using accumulation curves for successfully barcoded individuals. Curves were built in R version 2.14  using the packages “picante” and “vegan” , with the curves randomized on a per-individual basis, without replacement, and with 1000 permutations. We further built curves that were rarefied by the number of sites sampled to ensure that there was no bias among studies having different numbers of sampling sites. Unless otherwise specified, sites without ostracods were excluded from their respective studies for analysis purposes. Curves based upon all sites would not be comparable because some studies were designed to be ostracod or microcrustacean focused, and therefore all or nearly all selected sites contained ostracods for those strategies. For the rarefied curves, we used the specaccum function in the package “picante”. We calculated Bray dissimilarity coefficients between the sampling strategies using normalized presence-absence data using the function decostand in the “vegan” package . We performed 1-way ANOVAs and ad hoc tests of significance using the Bonferroni control to look for significant differences in dissimilarity between sampling strategies in their resulting species composition, i.e. genetic cluster composition. We did not perform an ANOVA on count data, but on dissimilarity coefficients. Upon plotting histograms of the dissimilarity values, they were approximately normally distributed.
Species richness estimators
Even if species accumulation curves do not reach an asymptote, species richness estimators can be used to compare the richness of collections and to assess the stability of the biodiversity estimates with increasing sampling. We used the program EstimateS® version 8.2  to compute the mean of the incidence-based, non-parametric richness estimators Chao2, first-order Jackknife (Jack1), and Incidence Coverage Estimate (ICE) for each sampling strategy, as well as the standard deviation for each and 95% confidence interval for Chao2. Chao2 is expected to be a robust and conservative estimate of diversity and uses the number of singletons and doubletons (species present in one or two samples) to infer the richness of additional species present but not detected . We used the resulting indicators to generate plots showing total number of expected species with number of sites sampled, and compared the stability across the five strategies for the different richness estimators. We generated the set of richness estimators twice for the fixed-protocol and flexible-protocol strategies, once using only the total number of sites where ostracods were actually located and once using the total number of sites surveyed (refer back to Table 1).
Summary of the primers used in each project and sequencing success rates
Number of specimens
Success rate (%)
Number of specimens >200 bp with <2% Ns
OZFWC – Plates 1 and 2b
OZFWC – Plates 3 and 4b
OZFWC – Plates 5-9b
The liberal re-sort and flexible-protocol strategy captured 29 (~60%) of these clusters, while the time-based, rapid-blitz, and fixed-protocol strategies captured 18 (~38%), 18 (~38%), and 17 (~35%) MOTUs, respectively. Of the 48 genetic clusters identified in this analysis, only three (~6%) were captured by all five sampling strategies. Sixteen (~33%) genetic clusters were captured by only one of the five individual sampling strategies, and only 29 (~60%) clusters were captured by any two of the five sampling strategies. The flexible-protocol and liberal re-sort strategies proved highly efficient at capturing ‘rare’ clusters, here defined as appearing in only one or two of the five sampling strategies, but were also equally successful as the other sampling strategies at capturing common clusters, here defined as clusters appearing in three to five sampling strategies (Table 1). Of the 31 clusters appearing in only one or two sampling strategies, 14 and 15 (~45 and 48%) were identified in the liberal re-sort and flexible-protocol strategies respectively, compared to five or six (~16 or 19%) for each of the other strategies. The liberal re-sort and flexible-protocol also captured 83% of abundant clusters, compared to the remaining strategies, which captured 66-72% of abundant clusters.
Dissimilarity indices of methods
Bray Curtis dissimilarities of distance comparisons of sampling methods
Compared with rapid blitz
Compared with liberal re-sort
Compared with fixed protocol
Compared with flexible protocol
Species richness estimators
Mean MOTU richness estimates, standard deviation, and confidence intervals for each sampling strategy
Observed individuals (genetic)
Chao2 95% CI
19.6 – 46.6
29.8 – 458.2
37.8 – 135.4
14.0 – 91.2
26.4 – 90.4
8.6 – 46.0
24.6 – 97.6
Richness estimators were also generated a second time for the fixed- and flexible-protocol methods using the total number of sites surveyed, regardless of whether they contained Ostracoda. Values for the richness estimators were similar in magnitude between the two datasets, but decreased for all metrics for the fixed-protocol strategy and increased for two metrics for the flexible-protocol strategy (Table 4).
Comparisons of sampling strategies
This study presents an a posteriori assessment of the effectiveness of five freshwater ostracod collecting strategies, in order to help to inform the design of non-expert bioblitzes. We have found that the five sampling strategies differed both in the composition of the MOTUs they yielded and in total richness. We noted that only three MOTUs were found in all of the sampling strategies, and 16 MOTUs were only captured by a single strategy (Figure 1). Differences in composition were further verified by the Bray dissimilarity coefficients, which computed the dissimilarity of the different sampling strategies based on resulting MOTU composition. This analysis found low levels of similarity between the sampling methods (Table 3). This lack of similarity between sampling methods is not particularly surprising, however, considering that all strategies under sampled the total diversity. Moreover, we have confounded differences in site selection, some temporal differences between the different sampling strategies, as well as with some differences in equipment used. What is of greater interest in this study however, is the effectiveness of the different sampling strategies to capture a proportion of the total detected diversity and to serve as a predictor of total diversity in the region.
Both site-based and individual-based rarefaction curves revealed the liberal re-sort method to be superior in capturing total richness in this system. While it is not surprising that sequencing more specimens would yield more MOTUs, what was unexpected was that this pattern held for a given sampling intensity. This method was based upon field collections for two other strategies (time-based and rapid-blitz), which were either ostracod or microcrustacean focused or which aimed to include a variety of suitable microhabitats. This method involved more liberal inclusion of specimens for molecular processing and deliberately excluded one large-bodied, conspicuous (blue-coloured), and common morphospecies.
We also computed richness estimates using non-parametric methods for each of the sampling strategies. While these analyses showed that none of the employed sampling strategies captured the total richness present in the system, there was variability among the strategies in their ability to estimate richness. We conclude that the most appropriate of the five strategies for non-expert bioblitzes, in terms of both capturing and estimating diversity in this system, is the liberal re-sort because it captured the highest number of MOTUs, resulted in the second-highest Chao2 total richness estimator, and the Chao2 estimator remained stable after only 6 sites sampled. These results confirm that building upon field methods that included selecting a diversity of microhabitat and then more liberally including specimens for molecular analysis was most successful in characterizing diversity in this system.
Implications of the evaluation of sampling strategies
In the past, deficiencies in freshwater ostracod sampling techniques led to the false conclusion that these animals were rare components of aquatic habitats . While this is no longer the case, care must still be taken to use appropriate sampling strategies for ostracods. Two factors that influenced sampling effectiveness were whether morphospecies-based or liberal sorting of specimens was used, and whether microcrustaceans were the main goal of a collection. This study demonstrated that the greatest amount of ostracod diversity was uncovered in a microinvertebrate bioblitz aimed at the collection of specific taxa using an micro-habitat focused field strategy and a liberal sorting strategy, both of which allowed for higher numbers of specimens to be analyzed regardless of field morphospecies status. The two-phase method of a preliminary genetic analysis followed by a liberal resort (initiated specifically because of the detection of numerous singletons during phase 1) was an efficient way to direct resources. While we have provided partial evidence for this conclusion, we recognize that the differences among the five sampling strategies analyzed retrospectively in this paper (e.g., number of sites, season, equipment used, and PCR primers) may have affected our results. We suggest that partial evidence based upon considering available data can inform micro-invertebrate sampling design, pending further evidence from studies on other habitats and taxa. We recommend the following guidelines for designing non-expert biodiversity sampling programs that can use DNA barcoding for identification of samples: (a) Sampling strategies must include various macro and microhabitats. This study showed that non-experts were able to improve biodiversity estimators by actively selecting a variety of microhabitats. (b) A minimum of two sorting sessions should be done post collection when preparing specimens for DNA barcoding. The first session should include the dividing of organisms into gross morphospecies and selecting a representative number from each unit for DNA barcoding. A second, more liberal sorting strategy should include more replicates from the existing taxonomic units (as was the case in the intensive re-sort featured in this study); this sorting should be informed by phase one, as time and cost efficiency can be increased by deliberately excluding abundant morphospecies detected during phase one. (c) The objective of the study should be considered along with recognition of the benefits of a taxon-specific approach to collection compared to whole-community analyses. Whenever possible, sub-teams (even if all are non-experts) should focus on particular taxa within a broader whole-community analysis.
We note that our findings are applicable only to regions where the expected diversity of the system is relatively low (i.e. subarctic or temperate systems), thus facilitating reaching asymptotes in the biodiversity estimators, as we observed here. Understanding patterns of global diversity as well as alpha and beta diversity patterns would require far more sampling in tropical systems to elucidate regional biodiversity.
Richness of the ostracoda of Churchill
This study also increased the known richness of the Churchill freshwater ostracod fauna. A previous multi-year study of ostracod diversity in the Churchill region by trained aquatic ecologists, including a multitude of sites and employing similar collection methods to our sampling strategies, yielded a total of 30 species identified through traditional taxonomy . Through DNA barcoding, we have here identified 48 genetic clusters, which we interpret as provisional species due to comparison with genetic divergences in other microcrustaceans [e.g. ; this represents an increase in species richness of 56%. Nevertheless, the Churchill system is still undoubtedly under sampled for Ostracoda, as we found 10 MOTUs represented by a single individual across all strategies.
The prior underestimation of invertebrate species richness in this sub-arctic site is common to a variety of taxa. For example, the diversity of the crustacean class Branchiopoda has been characterized in Churchill using similar collecting methods as employed here, combined with DNA barcoding and morphological examination . Interestingly, the known richness increased from 25 to 42 species/MOTUs , a similar proportional increase (68%) as we report here for the Ostracoda. This enhanced knowledge of MOTU richness in the microcrustaceans of Churchill, combined with their strong clustering patterns suggestive of species-level status of MOTUs, will be valuable for future studies of this site. Churchill has long been used as a model site for freshwater zooplankton community and population genetics studies [e.g.  and is now being developed as a site for comprehensive community studies through the creation of a comprehensive DNA barcode library .
Next steps: whole-community bioblitzes?
DNA barcoding, arguably, originated with taxon-focused studies that consolidated data from many different sources . Many of the early DNA barcoding papers were focused on defining methods and showing the efficacy of DNA barcoding for different taxonomic groups (e.g. ). In this way, early DNA barcoding was often focused on questions such as “how can we reliably differentiate closely related species?” This question and these seminal works were important for establishing DNA barcoding as a key technique in ecology and biodiversity studies, but the focus behind DNA barcoding has partially shifted in recent years. In addition to creating reference libraries and uncovering cryptic species or species complexes, DNA barcoding is increasingly being used to answer complex mechanistic ecological questions [e.g. [58–60]. The further shift from single-taxa to whole-community sampling  represents a significant trend in DNA barcoding research and one that is undoubtedly important. As DNA barcoding methods begin to be applied to whole communities to answer ecological questions, it is possible that despite increased taxonomic resolution with the genetic techniques, sampling programs aimed at whole communities may not be effective at capturing total diversity of certain groups. While collection efforts will remain a human effort, further advancement in next-generation sequencing technologies will contribute to the field of environmental DNA analysis , allowing for bulk sample analysis without the need for a sorting protocol. As demonstrated in this analysis, sorting protocol can have a profound impact on the results of a sampling strategy. After much-needed methodological advancements in understudied invertebrate taxa, environmental DNA analysis is poised to greatly contribute to community-level analyses of aquatic microinvertebrates [61, 62].
While our study has shown definite differences in how much of the ostracod community each sampling strategy was able to capture, different strategies still have their merits for different study objectives and research programs. Storey et al.  tested two types of sampling methods for the collection of aquatic macroinvertebrates and noted that one method cost significantly less and was likely appropriate for research objectives such as long-term monitoring. The alternative method that they tested, however more costly and time consuming, proved to be more appropriate for studying new and unexplored areas . This has implications for our earlier discussion of citizen science and the popularity of bioblitzes; while time-based or rapid-blitz approaches may be attractive to this type of citizen scientist sampling campaign because they allow large groups of participants to be involved simultaneously, they may not be the most appropriate depending on the goals of the study.
While the main objective of this study was not to produce rigid rules for ostracod sampling programs conducted by non-experts, we nonetheless think that our findings have a place in current biodiversity science and the design of biodiversity sampling programs. By considering our findings, we hope that freshwater ecologists, and particularly those who engage in large-scale citizen science, can develop a set of best practices for these types of surveys and also recognize the utility of incorporating DNA barcoding into these programs. We found that sampling strategy had a substantial impact on species richness estimates of biodiversity surveys for a freshwater microinvertebrate fauna. Accumulation curves may approach an asymptote indicating completeness of sampling, but this can be attributed to artifacts of a sampling strategy limited not in scope but in methodological considerations. In our study, sampling strategies that were flexible in nature yielded higher species richness estimates than non-flexible sampling strategies, though consideration should be given to the benefits of a fixed-collection protocol in terms of comparative ecological analyses. We found that increasing the number of specimens analyzed during biodiversity surveys through the implementation of a liberal sorting strategy that is not reliant on morphospecies exposed additional diversity that was not originally appreciated by collectors based strictly on external morphological characteristics. This points to why DNA barcoding represents an excellent avenue with which to conduct biodiversity surveys of small-bodied organisms. Finally, strategies with an ostracod-specific or microcrustacean-focused collection mandate were more efficient than whole-community analyses at elucidating ostracod diversity on a per-site/time invested basis. Even with advanced molecular techniques, there should continue to be a balance when designing sampling programs.
We are grateful for support provided by the International Polar Year and Discovery Grant programs of the Natural Sciences and Engineering Research Council of Canada (NSERC) to SJA and to P. Hebert, as well as for support provided by the Government of Canada through Genome Canada and the Ontario Genomics Institute to the International Barcode of Life Project. We also thank the Ontario Ministry of Economic Development and Innovation for funding the ongoing development of BOLD, which was essential for data management and analysis for this project. We thank staff at the Canadian Centre for DNA Barcoding at the University of Guelph for specimen preparation and molecular analysis. We are grateful to D. Steinke and S. Prosser for providing primer sequences. The staff at the Churchill Northern Studies Centre (CNSC) provided valuable logistical support throughout this endeavour as well as financial support through their Northern Research Fund. AKW and EEB received further support from the Northern Scientific Training Program of the Department of Indian Affairs and Northern Development (DIAND) federal agency of Canada. We thank J. Witt and P. Hebert for mentoring and guidance in the field. We also thank E. Noel, J. Wang, M. Young, K. Layton, and E. Chambers, for assistance in the field. Finally, we thank J. Sones, G. Martin, S. Eagle, N. Serrao, R. Hanner, and A. Smith for assistance in successful barcoding. Eight reviewers provided comments that greatly improved the manuscript.
- Havel JE, Hebert PDN, Delorme LD: Genotypic diversity of asexual Ostracoda from a low arctic site. J Evol Biol. 1990, 3: 391-410.View Article
- Little TJ, Hebert PDN: Clonal diversity in high arctic ostracodes. J Evol Biol. 1997, 10: 233-252.View Article
- Schön I, Martens K, Van Doninck K, Butlin RK: Evolution in the slow lane: molecular rates of evolution in sexual and asexual ostracods (Crustacea: Ostracoda). Boil J Linn Soc. 2003, 79: 93-100.View Article
- Tinn O, Oakley TH: Erratic rates of molecular evolution and incongruence of fossil and molecular divergence time estimates in Ostracoda (Crustacea). Mol Phylogenet Evol. 2008, 48: 157-167.View ArticlePubMed
- Wetterich S, Schirrmeister L, Meyer H, Viehberg FA, Mackensen A: Arctic freshwater ostracods from modern periglacial environments in the Lena River Delta (Siberian Arctic, Russia): geochemical applications for palaeoenvironmental reconstructions. J Paleolimnol. 2008, 39: 427-449.View Article
- Bode SNS, Adolfsson S, Lamatsch DK, Martins MJF, Schmit O: Exceptional cryptic diversity and multiple origins of parthenogenesis in a freshwater ostracod. Mol Phylogenet Evol. 2010, 54: 542-552.View ArticlePubMed
- Bellavere C, Benassi G, Calzolari M, Meisch C, Mckenzie KG: Heterocypris (Crustacea, Ostracoda) from the Isole Pelagie (Sicily, Italy): the coexistence of different morphotypes. Ital J Zool. 2002, 69: 53-57.View Article
- Martens K, Schön I, Meish C, Horne DJ: Global diversity of ostracods (Ostracoda, Crustacea) in freshwater. Hydrobiologia. 2008, 595: 185-193.View Article
- Brusca RC, Brusca GJ: Invertebrates. 2003, Sunderland: Sinauer Associates, 2
- Thorp JH, Covich AP: Ecology and classification of North American freshwater invertebrates (3rd ed.). 2002, San Diego, California, USA: Academic Press (Elsevier)
- Athersuch J, Horne DJ, Whittaker JE: Marine and brackish water ostracods. 1989, Bath, England, UK: Bath Press
- Meisch C: Freshwater Ostracoda of Western and Central Europe. 2000, Heidelberg, Germany: Specktrum Akademischer Verlag
- Frenzel P, Keyser D, Viehberg FA: An illustrated key and (palaeo)ecological primer for postglacial to recent Ostracoda (Crustacea) of the Baltic Sea. Boreas. 2010, 39: 567-575.
- Baltanás A, García-Avilés J: New records of freshwater Ostracoda (Crustacea) from the Canary Islands. Bull Soc Nat luxemb. 1993, 94: 219-232.
- Külköylüoğlu O: Ecology and phenology of freshwater ostracods in Lake Gölköy (Bolu, Turkey). Aquat Ecol. 2005, 39: 295-304.View Article
- Meisch C, Malmqvist B, Nilsson AN: Freshwater Ostracoda (Crustacea) collected in Tenerife, Canary Islands. Mitt hamb zool Mus Inst. 1995, 92: 281-293.
- Meisch C, Mary-Sasal N, Colin J-P, Wouters K: Freshwater Ostracoda (Crustacea) collected from the islands of Futuna and Wallis, Pacific Ocean, with a checklist of the non-marine Ostracoda of the Pacific Islands. Bull Soc Nat luxemb. 2007, 108: 89-103.
- Rossetti G, Martens K, Meisch C, Tavernelli S, Pieri V: Small is beautiful: diversity of freshwater ostracods (Crustacea, Ostracoda) in marginal habitats of the province of Parma (Northern Italy). J Limnol. 2006, 65: 121-131.View Article
- Victor R: The taxonomy and distribution of freshwater Ostracods (Crustacea – Ostracoda) of Malaysia, Indonesia and the Philippines. PhD thesis. 1979, University of Waterloo
- Victor R, Fernando CH: Distribution of freshwater Ostracoda (Crustacea) in Southeast Asia. J Biogeogr. 1982, 9: 281-288.View Article
- Viehberg FA: A new and simple method for qualitative sampling of meiobenthos-communities. Limnologica. 2002, 32: 350-351.View Article
- Brinkman MA, Duffy WG: Evaluation of four wetland aquatic invertebrate samplers and four sample sorting methods. J Freshwater Ecol. 1996, 11: 193-200.View Article
- Cao Y, Hawkins CP, Vinson MR: Measuring and controlling data quality in biological assemblage surveys with special reference to stream benthic macroinvertebrates. Freshwater Biol. 2003, 48: 1898-1911.View Article
- Cao Y, Hawkins CP, Storey AW: A method for measuring the comparability of difference sampling methods used in biological surveys: implications for data integration and synthesis. Freshwater Biol. 2005, 50: 1105-1115.View Article
- Cheal F, Davis JA, Growns JE, Bradley JS, Whittles FH: The influence of sampling method on the classification of wetland macroinvertebrate communities. Hydrobiologia. 1993, 257: 47-56.View Article
- Florencio M, Díaz-Paniagua C, Gomez-Mestre I, Serrano L: Sampling macroinvertebrates in a temporary pond: comparing the suitability of two techniques to detect richness, spatial segregation and diel activity. Hydrobiologia. 2011, 10.1007/s10750-011-0690-8.
- Gunzburger M: Evaluation of seven aquatic sampling methods for amphibians and other aquatic fauna. Appl Herpotol. 2007, 4: 47-63.View Article
- Jurado GB, Masterson M, Harrington R, Kelly-Quinn M: Evaluation of sampling methods for macroinvertebrate biodiversity estimation in heavily vegetated ponds. Hydrobiologia. 2008, 597: 97-107.View Article
- Meyer CK, Peterson SD, Whiles MR: Quantitative assessment of yield, precision, and cost-effectiveness of three wetland invertebrate sampling techniques. Wetlands. 2011, 31: 101-112.View Article
- Degerlund M, Huseby S, Zingone A, Sarno D, Landfald B: Functional diversity in cryptic species of Chaetoceros socialis Lauder (Bacillariophyceae). J Plankton Res. 2012, 10.1093/plankt/fbs004.
- Jeffery NW, Elías-Gutiérrez M, Adamowicz SJ: Species diversity and phylogeographical affinities of the Branchiopoda (Crustacea) of Churchill, Manitoba, Canada. PLoS ONE. 2011, 6: e18364-10.1371/journal.pone.0018364.PubMed CentralView ArticlePubMed
- Kucera M, Darling KF: Cryptic species of planktonic foraminifera: their effect on palaeoceanographic reconstructions. Phil Trans R Soc Lond A. 2002, 360: 695-718.View Article
- Lim GS, Blake M, Meier R: Determining species boundaries in a world full of rarity: singletons, species delimitation methods. Syst Biol. 2012, 61: 165-169.View ArticlePubMed
- Cohn JP: Citizen science: can volunteers do real research?. Bioscience. 2008, 38: 192-197.View Article
- Dickinson JL, Zuckerberg B, Bonter DN: Citizen science as an ecological research tool: Challenges and benefits. Annu Rev Ecol Evol S. 2010, 41: 149-172.View Article
- Lepczyk CA, Boyle OD, Vargo TL, Gould P, Jordan R, Liebenberg L, Masi S, Mueller WP, Prysby MD, Vaughan H: Symposium 18: Citizen science in ecology: the intersection of research and education. Bulletin of the Ecological Society of America. 2009, 90: 308-317.View Article
- Miller-Rushing A, Primack R, Bonney R: The history of public participation in ecological research. Front Ecol Environ. 2012, 10: 285-290.View Article
- Hebert PDN, Penton EH, Burns JM, Janzen DH, Hallwachs W: Ten species in one: DNA barcoding reveals cryptic species in the neotropical skipper butterfly Astraptes fulgerator. P Natl Acad Sci USA. 2004, 101: 14812-14817.View Article
- Smith MA, Woodley NE, Janzen DH, Hallwachs W, Hebert PDN: DNA barcodes reveal cryptic host-specificity within the presumed polyphagous members of a genus of parasitoid flies (Diptera: Tachinidae). P Natl Acad Sci USA. 2006, 103: 3657-3662.View Article
- Witt JDS, Threloff DL, Hebert PDN: DNA barcoding reveals extraordinary cryptic diversity in an amphipod genus: implications for desert spring conservation. Mol Ecol. 2006, 15: 3073-3082.View ArticlePubMed
- Kerr KCR, Stoeckle MY, Dove DJ, Weigt LA, Francis CM: Comprehensive DNA barcode coverage of North American birds. Mol Ecol Notes. 2007, 7: 535-543.PubMed CentralView ArticlePubMed
- Smith MA, Fernandez-Triana J, Roughly R, Hebert PDN: DNA barcode accumulation curves for understudied taxa and areas. Mol Ecol Resour. 2009, 9: 208-216.View ArticlePubMed
- Ratnasingham S, Hebert PDN: BOLD: the barcode of life data system. Mol Ecol Notes. 2007, 7: 355-364. (http://www.barcodinglife.org)PubMed CentralView ArticlePubMed
- Elías-Guttiérez M, Jerónimo FM, Ivanova NV, Valdez-Moreno M, Hebert PDN: DNA barcodes for Cladocera and Copepoda from Mexico and Guatemala, highlights and new discoveries. Zootaxa. 1839, 2008: 1-42.
- Bucklin A, Ortman BD, Jennings RM, Nigro LM, Sweetman CJ: A “Rosetta Stone” for metazoan zooplankton: DNA barcode analysis of species diversity of the Sargasso Sea (Northwest Atlantic Ocean). Deep-Sea Res II. 2010, 57: 2234-2247.View Article
- Zhou X, Adamowicz SJ, Jacobus LM, DeWalt RE, Hebert PDN: Towards a comprehensive barcode library for arctic life – Ephemeroptera, Plecoptera, and Trichoptera of Churchill, Manitoba, Canada. Front Zool. 2009, 6: 10.1186/1742-9994-6-30.
- Ivanova NV, DeWaard JR, Hebert PDN: An inexpensive, automation friendly protocol for recovering high-quality DNA. Mol Ecol Notes. 2006, 6: 998-1002.View Article
- Meusnier I, Singer GAC, Landry JF, Hickey DA, Hebert PDN, Hajibabaei M: A universal DNA mini-barcode for biodiversity analysis. BMC Genomics. 2008, 9: 214-PubMed CentralView ArticlePubMed
- Kimura M: A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980, 16: 111-120.View ArticlePubMed
- Collins RA, Boykin LM, Cruickshank RH, Armstrong KF: Barcoding’s next top model: an evaluation of nucleotide substitution models for specimen identification. MEE. 2012, 3: 457-465.
- Srivathsan A, Meier R: On the inappropriate use of the Kimura-2-parameter (K2P) divergences in the barcoding literature. Cladistics. 2012, 28: 190-194.View Article
- R Core Development Team: The R Project for Statistical Computing. 2009, Available online: http://www.r-project.org/
- Oksanen J: Multivariate Analysis of Ecological Communities in R: vegan tutorial. Comprehensive R Archive Network. 2007, Available online: http://cran.r-project.org/
- Colwell RK: EstimateS: Statistical estimation of species richness and shared species from samples. Version 8.2. User’s Guide and application published at: http://purl.oclc.org/estimates. (Accessed October, 2012)
- Gotelli NJ, Colwell RK: Estimating species richness. Biological Diversity: Frontiers in Measurement and Assessment. Edited by: Magurran AE, McGill BJ. 2011, Oxford, England, UK: Oxford University Press, 39-54.
- Hebert PDN, Cywinska A, Ball SL, de Waard JR: Biological identifications through DNA barcodes. P Roy Soc B-Biol Sci. 2003, 270: 313-321.View Article
- Park DS, Foottit R, Maw E, Hebert PDN: Barcoding bugs: DNA-based identification of the true bugs (Insecta: Hemiptera: Heteroptera). PLoS One. 2011, 6: e18749-10.1371/journal.pone.0018749.PubMed CentralView ArticlePubMed
- Smith MA, Woodley NE, Janzen DH, Hallwachs W, Hebert PDN: DNA barcodes reveal cryptic host-specificity within the presumed polyphagous members of a genus of parasitoid flies (Diptera: Tachinidae). PNAS. 2006, 103: 3657-3662.PubMed CentralView ArticlePubMed
- Smith MA, Wood DM, Janzen DH, Hallwachs W, Hebert PDN: DNA barcodes affirm that 16 species of apparently generalist tropical parasitoid flies (Diptera, Tachinidae) are not all generalists. PNAS. 2007, 104: 4967-4972.PubMed CentralView ArticlePubMed
- Smith MA, Rodriguez JJ, Whitfield JB, Deans AR, Janzen DH, Hallwachs W, Hebert PDN: Extreme diversity of tropical parasitoid wasps exposed by iterative integration of natural history, DNA barcoding, morphology, and collections. PNAS. 2008, 105: 12359-12364.PubMed CentralView ArticlePubMed
- Shokralla S, Spall JL, Gibson JF, Hajibabaei M: Next-generation sequencing technologies for environmental DNA research. Mol Ecol. 2012, 21: 1794-1805.View ArticlePubMed
- Hajibabaei M, Shokralla S, Zhou X, Singer GAC, Baird DJ: Environmental barcoding: A next-generation sequencing approach for biomonitoring applications using river benthos. PLoS One. 2011, 6: e17497-10.1371/journal.pone.0017497.PubMed CentralView ArticlePubMed
- Storey AW, Edward DHD, Gazey P: Suber and kick sampling: a comparison for the assessment of macroinvertebrate community structure in streams of south-western Australia. Hydrobiologia. 1991, 211: 111-121.View Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.