Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences.

Documentation

Manual: stringdist.pdf
Vignette: None available.

Maintainer: Mark van der Loo <mark.vanderloo at gmail.com>

Author(s): Mark van der Loo*, Jan van der Laan*, R Core Team*, Nick Logan*

Install package and any missing dependencies by running this line in your R console:

install.packages("stringdist")

Depends R (>= 2.15.3)
Imports parallel
Suggests testthat
Enhances
Linking to

Package stringdist
Materials
URL https://github.com/markvanderloo/stringdist
Task Views OfficialStatistics
Version 0.9.4.6
Published 2017-07-31
License GPL-3
BugReports https://github.com/markvanderloo/stringdist/issues
SystemRequirements
NeedsCompilation yes
Citation
CRAN checks stringdist check results
Package source stringdist_0.9.4.6.tar.gz