Movatterモバイル変換


[0]ホーム

URL:


bdc

Atoolkit for standardizing, integrating, and cleaning biodiversitydata

CRAN statusdownloadsR-CMD-checkCodecov test coverageDOILicense

Overview

Handle biodiversity data from several different sources is not aneasy task. Here, we present theBiodiversityDataCleaning (bdc), an Rpackage to address quality issues and improve the fitness-for-use ofbiodiversity datasets.bdc contains functions to harmonize andintegrate data from different sources following common standards andprotocols, and implements various tests and tools to flag, document,clean, and correct taxonomic, spatial, and temporal data.

Compared to other available R packages, the main strengths of thebdc package are that it brings together available tools – and aseries of new ones – to assess the quality of different dimensions ofbiodiversity data into a single and flexible toolkit. The functions canbe applied to a multitude of taxonomic groups, datasets (includingregional or local repositories), countries, or worldwide.

Structure ofbdc

Thebdc toolkit is organized in thematic modules related todifferent biodiversity dimensions.


:warning: The modules illustrated, andfunctionswithin,were linked to form a proposed reproducibleworkflow (seevignettes).However, all functionscan also be executedindependently.



1. [Merge databases]

Standardization and integration of different datasets into a standarddatabase.

2. [Pre-filter]

Flagging and removal of invalid or non-interpretable information,followed by data amendments (e.g., correct transposed coordinates andstandardize country names).

3. [Taxonomy]

Cleaning, parsing, and harmonization of scientific names againstmultiple taxonomic references.

4. [Space]

Flagging of erroneous, suspicious, and low-precision geographiccoordinates.

5. [Time]

Flagging and, whenever possible, correction of inconsistentcollection date.

Otherfunctions

Aim to facilitate thedocumentation, visualization, andinterpretation of results of data quality tests the packagecontains functions for documenting the results of the data-cleaningtests, including functions for saving i) records needing furtherinspection, ii) figures, and iii) data-quality reports.

Installation

install.packages("bdc")library(bdc)

or the development version fromGitHub using:

install.packages("remotes")remotes::install_github("brunobrr/bdc")

Load the package with:

library(bdc)

Package website

Seebdc package website (https://brunobrr.github.io/bdc/) for detailedexplanation on each module.

Getting help

If you encounter a clear bug, please file an issuehere.For questions or suggestion, please send us a email(ribeiro.brr@gmail.com).

Citation

Ribeiro, BR; Velazco, SJE; Guidoni-Martins, K; Tessarolo, G; Jardim,Lucas; Bachman, SP; Loyola, R (2022). bdc: A toolkit for standardizing,integrating, and cleaning biodiversity data. Methods in Ecology andEvolution.doi.org/10.1111/2041-210X.13868


[8]ページ先頭

©2009-2025 Movatter.jp