Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Biodivers Data J ; 11: e109439, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38078294

RESUMO

Tens of millions of images from biological collections have become available online over the last two decades. In parallel, there has been a dramatic increase in the capabilities of image analysis technologies, especially those involving machine learning and computer vision. While image analysis has become mainstream in consumer applications, it is still used only on an artisanal basis in the biological collections community, largely because the image corpora are dispersed. Yet, there is massive untapped potential for novel applications and research if images of collection objects could be made accessible in a single corpus. In this paper, we make the case for infrastructure that could support image analysis of collection objects. We show that such infrastructure is entirely feasible and well worth investing in.

2.
PLoS One ; 16(12): e0261130, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34905557

RESUMO

Natural history collection data available digitally on the web have so far only made limited use of the potential of semantic links among themselves and with cross-disciplinary resources. In a pilot study, botanical collections of the Consortium of European Taxonomic Facilities (CETAF) have therefore begun to semantically annotate their collection data, starting with data on people, and to link them via a central index system. As a result, it is now possible to query data on collectors across different collections and automatically link them to a variety of external resources. The system is being continuously developed and is already in production use in an international collection portal.


Assuntos
Coleta de Dados , Bases de Dados Factuais , Armazenamento e Recuperação da Informação/métodos , Botânica , Biologia Computacional/métodos , Humanos
3.
Database (Oxford) ; 20202020 11 27.
Artigo em Inglês | MEDLINE | ID: mdl-33439246

RESUMO

People are one of the best known and most stable entities in the biodiversity knowledge graph. The wealth of public information associated with people and the ability to identify them uniquely open up the possibility to make more use of these data in biodiversity science. Person data are almost always associated with entities such as specimens, molecular sequences, taxonomic names, observations, images, traits and publications. For example, the digitization and the aggregation of specimen data from museums and herbaria allow us to view a scientist's specimen collecting in conjunction with the whole corpus of their works. However, the metadata of these entities are also useful in validating data, integrating data across collections and institutional databases and can be the basis of future research into biodiversity and science. In addition, the ability to reliably credit collectors for their work has the potential to change the incentive structure to promote improved curation and maintenance of natural history collections.


Assuntos
Biodiversidade , História Natural , Bases de Dados Factuais , Humanos , Museus
4.
Biodivers Data J ; (7): e31817, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-30833825

RESUMO

BACKGROUND: More and more herbaria are digitising their collections. Images of specimens are made available online to facilitate access to them and allow extraction of information from them. Transcription of the data written on specimens is critical for general discoverability and enables incorporation into large aggregated research datasets. Different methods, such as crowdsourcing and artificial intelligence, are being developed to optimise transcription, but herbarium specimens pose difficulties in data extraction for many reasons. NEW INFORMATION: To provide developers of transcription methods with a means of optimisation, we have compiled a benchmark dataset of 1,800 herbarium specimen images with corresponding transcribed data. These images originate from nine different collections and include specimens that reflect the multiple potential obstacles that transcription methods may encounter, such as differences in language, text format (printed or handwritten), specimen age and nomenclatural type status. We are making these specimens available with a Creative Commons Zero licence waiver and with permanent online storage of the data. By doing this, we are minimising the obstacles to the use of these images for transcription training. This benchmark dataset of images may also be used where a defined and documented set of herbarium specimens is needed, such as for the extraction of morphological traits, handwriting recognition and colour analysis of specimens.

5.
Database (Oxford) ; 20182018 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-30295725

RESUMO

Over the past years, herbarium collections worldwide have started to digitize millions of specimens on an industrial scale. Although the imaging costs are steadily falling, capturing the accompanying label information is still predominantly done manually and develops into the principal cost factor. In order to streamline the process of capturing herbarium specimen metadata, we specified a formal extensible workflow integrating a wide range of automated specimen image analysis services. We implemented the workflow on the basis of OpenRefine together with a plugin for handling service calls and responses. The evolving system presently covers the generation of optical character recognition (OCR) from specimen images, the identification of regions of interest in images and the extraction of meaningful information items from OCR. These implementations were developed as part of the Deutsche Forschungsgemeinschaft-funded a standardised and optimised process for data acquisition from digital images of herbarium specimens (StanDAP-Herb) Project.


Assuntos
Armazenamento e Recuperação da Informação , Plantas , Fluxo de Trabalho , Automação , Internet , Software
7.
Database (Oxford) ; 2017(1)2017 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-28365724

RESUMO

With biodiversity research activities being increasingly shifted to the web, the need for a system of persistent and stable identifiers for physical collection objects becomes increasingly pressing. The Consortium of European Taxonomic Facilities agreed on a common system of HTTP-URI-based stable identifiers which is now rolled out to its member organizations. The system follows Linked Open Data principles and implements redirection mechanisms to human-readable and machine-readable representations of specimens facilitating seamless integration into the growing semantic web. The implementation of stable identifiers across collection organizations is supported with open source provider software scripts, best practices documentations and recommendations for RDF metadata elements facilitating harmonized access to collection information in web portals. Database URL: : http://cetaf.org/cetaf-stable-identifiers.


Assuntos
Biodiversidade , Bases de Dados Factuais , Processamento de Linguagem Natural , Web Semântica , Software
8.
BMC Ecol ; 16(1): 49, 2016 10 20.
Artigo em Inglês | MEDLINE | ID: mdl-27765035

RESUMO

BACKGROUND: Making forecasts about biodiversity and giving support to policy relies increasingly on large collections of data held electronically, and on substantial computational capability and capacity to analyse, model, simulate and predict using such data. However, the physically distributed nature of data resources and of expertise in advanced analytical tools creates many challenges for the modern scientist. Across the wider biological sciences, presenting such capabilities on the Internet (as "Web services") and using scientific workflow systems to compose them for particular tasks is a practical way to carry out robust "in silico" science. However, use of this approach in biodiversity science and ecology has thus far been quite limited. RESULTS: BioVeL is a virtual laboratory for data analysis and modelling in biodiversity science and ecology, freely accessible via the Internet. BioVeL includes functions for accessing and analysing data through curated Web services; for performing complex in silico analysis through exposure of R programs, workflows, and batch processing functions; for on-line collaboration through sharing of workflows and workflow runs; for experiment documentation through reproducibility and repeatability; and for computational support via seamless connections to supporting computing infrastructures. We developed and improved more than 60 Web services with significant potential in many different kinds of data analysis and modelling tasks. We composed reusable workflows using these Web services, also incorporating R programs. Deploying these tools into an easy-to-use and accessible 'virtual laboratory', free via the Internet, we applied the workflows in several diverse case studies. We opened the virtual laboratory for public use and through a programme of external engagement we actively encouraged scientists and third party application and tool developers to try out the services and contribute to the activity. CONCLUSIONS: Our work shows we can deliver an operational, scalable and flexible Internet-based virtual laboratory to meet new demands for data processing and analysis in biodiversity science and ecology. In particular, we have successfully integrated existing and popular tools and practices from different scientific disciplines to be used in biodiversity and ecological research.


Assuntos
Biodiversidade , Ecologia/métodos , Ecologia/instrumentação , Internet , Modelos Biológicos , Software , Fluxo de Trabalho
9.
PLoS One ; 10(11): e0142240, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26544980

RESUMO

With the rapidly growing number of data publishers, the process of harvesting and indexing information to offer advanced search and discovery becomes a critical bottleneck in globally distributed primary biodiversity data infrastructures. The Global Biodiversity Information Facility (GBIF) implemented a Harvesting and Indexing Toolkit (HIT), which largely automates data harvesting activities for hundreds of collection and observational data providers. The team of the Botanic Garden and Botanical Museum Berlin-Dahlem has extended this well-established system with a range of additional functions, including improved processing of multiple taxon identifications, the ability to represent associations between specimen and observation units, new data quality control and new reporting capabilities. The open source software B-HIT can be freely installed and used for setting up thematic networks serving the demands of particular user groups.


Assuntos
Biodiversidade , Software , Indexação e Redação de Resumos , Classificação , Mineração de Dados , Bases de Dados Factuais , Internet
10.
Biodivers Data J ; (3): e5848, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26491393

RESUMO

BACKGROUND: Reliable taxonomy underpins communication in all of biology, not least nature conservation and sustainable use of ecosystem resources. The flexibility of taxonomic interpretations, however, presents a serious challenge for end-users of taxonomic concepts. Users need standardised and continuously harmonised taxonomic reference systems, as well as high-quality and complete taxonomic data sets, but these are generally lacking for non-specialists. The solution is in dynamic, expertly curated web-based taxonomic tools. The Pan-European Species-directories Infrastructure (PESI) worked to solve this key issue by providing a taxonomic e-infrastructure for Europe. It strengthened the relevant social (expertise) and information (standards, data and technical) capacities of five major community networks on taxonomic indexing in Europe, which is essential for proper biodiversity assessment and monitoring activities. The key objectives of PESI were: 1) standardisation in taxonomic reference systems, 2) enhancement of the quality and completeness of taxonomic data sets and 3) creation of integrated access to taxonomic information. NEW INFORMATION: This paper describes the results of PESI and its future prospects, including the involvement in major European biodiversity informatics initiatives and programs.

11.
Artigo em Inglês | MEDLINE | ID: mdl-26424081

RESUMO

We present the model and implementation of a workflow that blazes a trail in systematic biology for the re-usability of character data (data on any kind of characters of pheno- and genotypes of organisms) and their additivity from specimen to taxon level. We take into account that any taxon characterization is based on a limited set of sampled individuals and characters, and that consequently any new individual and any new character may affect the recognition of biological entities and/or the subsequent delimitation and characterization of a taxon. Taxon concepts thus frequently change during the knowledge generation process in systematic biology. Structured character data are therefore not only needed for the knowledge generation process but also for easily adapting characterizations of taxa. We aim to facilitate the construction and reproducibility of taxon characterizations from structured character data of changing sample sets by establishing a stable and unambiguous association between each sampled individual and the data processed from it. Our workflow implementation uses the European Distributed Institute of Taxonomy Platform, a comprehensive taxonomic data management and publication environment to: (i) establish a reproducible connection between sampled individuals and all samples derived from them; (ii) stably link sample-based character data with the metadata of the respective samples; (iii) record and store structured specimen-based character data in formats allowing data exchange; (iv) reversibly assign sample metadata and character datasets to taxa in an editable classification and display them and (v) organize data exchange via standard exchange formats and enable the link between the character datasets and samples in research collections, ensuring high visibility and instant re-usability of the data. The workflow implemented will contribute to organizing the interface between phylogenetic analysis and revisionary taxonomic or monographic work. DATABASE URL: http://campanula.e-taxonomy.net/.


Assuntos
Classificação/métodos , Bases de Dados Factuais , Processamento Eletrônico de Dados/métodos , Animais , Humanos
12.
Biodivers Data J ; (2): e4221, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25535486

RESUMO

The compilation and cleaning of data needed for analyses and prediction of species distributions is a time consuming process requiring a solid understanding of data formats and service APIs provided by biodiversity informatics infrastructures. We designed and implemented a Taverna-based Data Refinement Workflow which integrates taxonomic data retrieval, data cleaning, and data selection into a consistent, standards-based, and effective system hiding the complexity of underlying service infrastructures. The workflow can be freely used both locally and through a web-portal which does not require additional software installations by users.

13.
Biodivers Data J ; (2): e4034, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25349527

RESUMO

Fauna Europaea is Europe's main zoological taxonomic index, making the scientific names and distributions of all living, currently known, multicellular, European land and freshwater animals species integrally available in one authoritative database. Fauna Europaea covers about 260,000 taxon names, including 145,000 accepted (sub)species, assembled by a large network of (>400) leading specialists, using advanced electronic tools for data collations with data quality assured through sophisticated validation routines. Fauna Europaea started in 2000 as an EC funded FP5 project and provides a unique taxonomic reference for many user-groups such as scientists, governments, industries, nature conservation communities and educational programs. Fauna Europaea was formally accepted as an INSPIRE standard for Europe, as part of the European Taxonomic Backbone established in PESI. Fauna Europaea provides a public web portal at faunaeur.org with links to other key biodiversity services, is installed as a taxonomic backbone in wide range of biodiversity services and actively contributes to biodiversity informatics innovations in various initiatives and EC programs.

14.
Zookeys ; (209): 47-54, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22859877

RESUMO

Multimedia data held by Natural History Museums and Universities are presently not readily accessible, even within the natural history community itself. The EU project OpenUp! is an effort to mobilise scientific biological multimedia resources and open them to a wider audience using the EUROPEANA data standards and portal. The connection between natural history and EUROPEANA is accomplished using well established BioCASe and GBIF technologies. This is complemented with a system for data quality control, data transformation and semantic enrichment. With this approach, OpenUp! will provide at least 1,1 Million multimedia objects to EUROPEANA by 2014. Its lean infrastructure is sustainable within the natural history community and will remain functional and effective in the post-project phase.

15.
Biopreserv Biobank ; 9(1): 51-5, 2011 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-24850206

RESUMO

The explicit aim of the DNA Bank Network is to close the divide between biological specimen collections and molecular sequence databases. It provides a technically optimized DNA and tissue collection service facility in the interest of all biological research, with access to well-documented DNA-containing samples and voucher specimens as well as to corresponding molecular data stored in public sequence databases. The Network enables scientists to (i) query and order DNA samples of organisms collected from natural habitats via a shared Web portal, (ii) store DNA samples for reference under optimal conditions after project completion or data publication, (iii) obtain DNA material to conduct new studies or to extend and complement previous investigations, and (iv) support good scientific practice as the deposition of DNA samples and related specimens facilitates the verification of published results.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...