RESUMEN
Biodiversity scientists must be fluent across disciplines; they must possess the quantitative, computational, and data skills necessary for working with large, complex data sets, and they must have foundational skills and content knowledge from ecology, evolution, taxonomy, and systematics. To effectively train the emerging workforce, we must teach science as we conduct science and embrace emerging concepts of data acumen alongside the knowledge, tools, and techniques foundational to organismal biology. We present an open education resource that updates the traditional plant collection exercise to incorporate best practices in twenty-first century collecting and to contextualize the activities that build data acumen. Students exposed to this resource gained skills and content knowledge in plant taxonomy and systematics, as well as a nuanced understanding of collections-based data resources. We discuss the importance of the extended specimen in fostering scientific discovery and reinforcing foundational concepts in biodiversity science, taxonomy, and systematics.
RESUMEN
The early twenty-first century has witnessed massive expansions in availability and accessibility of digital data in virtually all domains of the biodiversity sciences. Led by an array of asynchronous digitization activities spanning ecological, environmental, climatological, and biological collections data, these initiatives have resulted in a plethora of mostly disconnected and siloed data, leaving to researchers the tedious and time-consuming manual task of finding and connecting them in usable ways, integrating them into coherent data sets, and making them interoperable. The focus to date has been on elevating analog and physical records to digital replicas in local databases prior to elevating them to ever-growing aggregations of essentially disconnected discipline-specific information. In the present article, we propose a new interconnected network of digital objects on the Internet-the Digital Extended Specimen (DES) network-that transcends existing aggregator technology, augments the DES with third-party data through machine algorithms, and provides a platform for more efficient research and robust interdisciplinary discovery.
RESUMEN
[This corrects the article DOI: 10.1093/biosci/biz140.].
RESUMEN
PREMISE: With digitization and data sharing initiatives underway over the last 15 years, an important need has been prioritizing specimens to digitize. Because duplicate specimens are shared among herbaria in exchange and gift programs, we investigated the extent to which unique biogeographic data are held in small herbaria vs. these data being redundant with those held by larger institutions. We evaluated the unique specimen contributions that small herbaria make to biogeographic understanding at county, locality, and temporal scales. METHODS: We sampled herbarium specimens of 40 plant taxa from each of eight states of the United States of America in four broad status categories: extremely rare, very rare, common native, and introduced. We gathered geographic information from specimens held by large (≥100,000 specimens) and small (<100,000 specimens) herbaria. We built generalized linear mixed models to assess which features of the collections may best predict unique contributions of herbaria and used an Akaike information criterion-based information-theoretic approach for our model selection to choose the best model for each scale. RESULTS: Small herbaria contributed unique specimens at all scales in proportion with their contribution of specimens to our data set. The best models for all scales were the full models that included the factors of species status and herbarium size when accounting for state as a random variable. CONCLUSIONS: We demonstrated that small herbaria contribute unique information for research. It is clear that unique contributions cannot be predicted based on herbarium size alone. We must prioritize digitization and data sharing from herbaria of all sizes.
Asunto(s)
Manejo de EspecímenesRESUMEN
Premise: Species distribution models (SDMs) are widely utilized to guide conservation decisions. The complexity of available data and SDM methodologies necessitates considerations of how data are chosen and processed for modeling to enhance model accuracy and support biological interpretations and ecological applications. Methods: We built SDMs for the invasive aquatic plant European frog-bit using aggregated and field data that span multiple scales, data sources, and data types. We tested how model results were affected by five modeler decision points: the exclusion of (1) missing and (2) correlated data and the (3) scale (large-scale aggregated data or systematic field data), (4) source (specimens or observations), and (5) type (presence-background or presence-absence) of occurrence data. Results: Decisions about the exclusion of missing and correlated data, as well as the scale and type of occurrence data, significantly affected metrics of model performance. The source and type of occurrence data led to differences in the importance of specific explanatory variables as drivers of species distribution and predicted probability of suitable habitat. Discussion: Our findings relative to European frog-bit illustrate how specific data selection and processing decisions can influence the outcomes and interpretation of SDMs. Data-centric protocols that incorporate data exploration into model building can help ensure models are reproducible and can be accurately interpreted in light of biological questions.
RESUMEN
PREMISE: Heterogeneity of biodiversity data from the collections, research, and management communities presents challenges for data findability, accessibility, interoperability, and reusability. Workflows designed with data collection, standards, dissemination, and reuse in mind will generate better information across geopolitical, administrative, and institutional boundaries. Here, we present our data workflow as a case study of how we collected, shared, and used data from multiple sources. METHODS: In 2012, we initiated the collection of biodiversity data relating to Michigan prairie fens, including data on plant communities and the federally endangered Poweshiek skipperling (Oarisma poweshiek). RESULTS: Over 23,000 occurrence records were compiled in a database following Darwin Core standards. The records were linked with media and biological, chemical, and geometric measurements. We published the data as Global Biodiversity Information Facility data sets and in Symbiota SEINet portals. DISCUSSION: We highlight data collection techniques that optimized transcription time, including the use of predetermined and controlled vocabulary, Darwin Core terms, and data dictionaries. The validity and longevity of our data were supported by voucher specimens, metadata with measurement records, and published manuscripts detailing methods and data sets. Key to our data dissemination was cooperation among partners and the utilization of dynamic tools. To increase data interoperability, we need flexible and customizable data collection templates, coding, and enhanced communication among communities using biodiversity data.
RESUMEN
BACKGROUND: Primary biodiversity data records that are open access and available in a standardised format are essential for conservation planning and research on policy-relevant time-scales. We created a dataset to document all known occurrence data for the Federally Endangered Poweshiek skipperling butterfly [Oarisma poweshiek (Parker, 1870; Lepidoptera: Hesperiidae)]. The Poweshiek skipperling was a historically common species in prairie systems across the upper Midwest, United States and Manitoba, Canada. Rapid declines have reduced the number of verified extant sites to six. Aggregating and curating Poweshiek skipperling occurrence records documents and preserves all known distributional data, which can be used to address questions related to Poweshiek skipperling conservation, ecology and biogeography. Over 3500 occurrence records were aggregated over a temporal coverage from 1872 to present. Occurrence records were obtained from 37 data providers in the conservation and natural history collection community using both "HumanObservation" and "PreservedSpecimen" as an acceptable basisOfRecord. Data were obtained in different formats and with differing degrees of quality control. During the data aggregation and cleaning process, we transcribed specimen label data, georeferenced occurrences, adopted a controlled vocabulary, removed duplicates and standardised formatting. We examined the dataset for inconsistencies with known Poweshiek skipperling biogeography and phenology and we verified or removed inconsistencies by working with the original data providers. In total, 12 occurrence records were removed because we identified them to be the western congener Oarisma garita (Reakirt, 1866). This resulting dataset enhances the permanency of Poweshiek skipperling occurrence data in a standardised format. NEW INFORMATION: This is a validated and comprehensive dataset of occurrence records for the Poweshiek skipperling (Oarisma poweshiek) utilising both observation and specimen-based records. Occurrence data are preserved and available for continued research and conservation projects using standardised Darwin Core formatting where possible. Prior to this project, much of these occurrence records were not mobilised and were being stored in individual institutional databases, researcher datasets and personal records. This dataset aggregates presence data from state conservation agencies, natural heritage programmes, natural history collections, citizen scientists, researchers and the U.S. Fish & Wildlife Service. The data include opportunistic observations and collections, research vouchers, observations collected for population monitoring and observations collected using standardised research methodologies. The aggregated occurrence records underwent cleaning efforts that improved data interoperablitity, removed transcription errors and verified or removed uncertain data. This dataset enhances available information on the spatiotemporal distribution of this Federally Endangered species. As part of this aggregation process, we discovered and verified Poweshiek skipperling occurrence records from two previously unknown states, Nebraska and Ohio.