bdc: Biodiversity Data Cleaning

The bdc package brings together several aspects of biodiversity data-cleaning in one place. It is organized in thematic modules related to different biodiversity dimensions, including 1) Standardization and integration of different datasets; 2) Pre-filter: flagging and removal of invalid or non-interpretable information, followed by data amendments; 3) Taxonomy: cleaning, parsing, and harmonization of scientific names from several taxonomic groups against taxonomic databases locally stored through the application of exact and partial matching algorithms; 4) Space: flagging of erroneous, suspect, and low-precision geographic coordinates; and 5) Time: flagging and, whenever possible, correction of inconsistent collection date. In addition, the package contains features to visualize, document, and report data quality – which is essential for making data quality assessment transparent and reproducible.

Version: 1.0.0
Imports: CoordinateCleaner, dplyr, DT, foreach, fs, ggplot2, here, magrittr, purrr, qs, readr, rgnparser, rnaturalearth, sf (≥ 1.0.5), stringdist, stringi, stringr, taxadb (≥ 0.1.3), tibble, tidyselect
Suggests: contentid (≥ 0.0.15), countrycode, covr, cowplot, DBI, doParallel, duckdb (≥ 0.3.2), knitr, maps, markdown, rangeBuilder, rappdirs, raster, remotes, rmarkdown, rnaturalearthdata, rvest, sp, testthat (≥ 3.0.0), xml2
Published: 2022-03-01
Author: Bruno Ribeiro ORCID iD [aut, cre], Geiziane Tessarolo ORCID iD [aut], Santiago Velazco ORCID iD [aut], Lucas Jardim ORCID iD [aut], Karlo Guidoni-Martins ORCID iD [aut], Rafael Loyola ORCID iD [ctb], Steven Bachman ORCID iD [ctb]
Maintainer: Bruno Ribeiro <ribeiro.brr at>
License: GPL (≥ 3)
URL: (website)
NeedsCompilation: no
Materials: README NEWS
CRAN checks: bdc results


Reference manual: bdc.pdf
Vignettes: Standardization and integration of different datasets


Package source: bdc_1.0.0.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
macOS binaries: r-release (arm64): bdc_1.0.0.tgz, r-release (x86_64): bdc_1.0.0.tgz, r-oldrel: bdc_1.0.0.tgz


